Files

ethanwaldie b55eda1dde Added required arguments to the policy builder in the ACER model to (#784 )

* Added required arguments to the policy builder in the ACER model to
fix the issue #783

* Changed the step model from nbatch to nenvs

* Updated nsteps to be 1.

2019-01-22 19:22:28 -08:00

__init__.py

Add ACER, PPO2, and results_plotter.py

2017-11-16 10:02:32 -08:00

acer.py

Added required arguments to the policy builder in the ACER model to (#784 )

2019-01-22 19:22:28 -08:00

buffer.py

refactor ACER (#664 )

2018-10-23 10:01:25 -07:00

defaults.py

refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490 )

2018-08-13 09:56:44 -07:00

policies.py

refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490 )

2018-08-13 09:56:44 -07:00

README.md

update readmes (#514 )

2018-08-16 14:53:49 -07:00

runner.py

refactor ACER (#664 )

2018-10-23 10:01:25 -07:00

README.md

ACER

Original paper: https://arxiv.org/abs/1611.01224
python -m baselines.run --alg=acer --env=PongNoFrameskip-v4 runs the algorithm for 40M frames = 10M timesteps on an Atari Pong. See help (-h) for more options.
also refer to the repo-wide README.md