diff --git a/baselines/ppo1/README.md b/baselines/ppo1/README.md index 70325b7..f45d141 100644 --- a/baselines/ppo1/README.md +++ b/baselines/ppo1/README.md @@ -2,6 +2,6 @@ - Original paper: https://arxiv.org/abs/1707.06347 - Baselines blog post: https://blog.openai.com/openai-baselines-ppo/ -- `mpirun -np 8 python -m baselines.ppo.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. -- `python -m baselines.ppo.run_mujoco` runs the algorithm for 1M frames on a Mujoco environment. +- `mpirun -np 8 python -m baselines.ppo1.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. +- `python -m baselines.ppo1.run_mujoco` runs the algorithm for 1M frames on a Mujoco environment. diff --git a/baselines/trpo_mpi/README.md b/baselines/trpo_mpi/README.md index c7e379d..b3d9b9d 100644 --- a/baselines/trpo_mpi/README.md +++ b/baselines/trpo_mpi/README.md @@ -2,5 +2,5 @@ - Original paper: https://arxiv.org/abs/1502.05477 - Baselines blog post https://blog.openai.com/openai-baselines-ppo/ -- `mpirun -np 16 python -m baselines.ppo1.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. -- `python -m baselines.ppo1.run_mujoco` runs the algorithm for 1M timesteps on a Mujoco environment. \ No newline at end of file +- `mpirun -np 16 python -m baselines.trpo_mpi.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. +- `python -m baselines.trpo_mpi.run_mujoco` runs the algorithm for 1M timesteps on a Mujoco environment.