* update per-algorithm READMEs to reflect new way of running algorithms * adding a link to repo-wide README * updated README files and deepq.train_cartpole example
7 lines
301 B
Markdown
7 lines
301 B
Markdown
# ACER
|
|
|
|
- Original paper: https://arxiv.org/abs/1611.01224
|
|
- `python -m baselines.run --alg=acer --env=PongNoFrameskip-v4` runs the algorithm for 40M frames = 10M timesteps on an Atari Pong. See help (`-h`) for more options.
|
|
- also refer to the repo-wide [README.md](../../README.md#training-models)
|
|
|