Files
baselines/baselines/acer
ethanwaldie b55eda1dde Added required arguments to the policy builder in the ACER model to (#784)
* Added required arguments to the policy builder in the ACER model to
fix the issue #783

* Changed the step model from nbatch to nenvs

* Updated nsteps to be 1.
2019-01-22 19:22:28 -08:00
..
2018-10-23 10:01:25 -07:00
2018-08-16 14:53:49 -07:00
2018-10-23 10:01:25 -07:00

ACER

  • Original paper: https://arxiv.org/abs/1611.01224
  • python -m baselines.run --alg=acer --env=PongNoFrameskip-v4 runs the algorithm for 40M frames = 10M timesteps on an Atari Pong. See help (-h) for more options.
  • also refer to the repo-wide README.md