From 3d3ea6cb16dc9915f0061408ffb4e78ee005f3be Mon Sep 17 00:00:00 2001 From: Louie Helm Date: Tue, 5 Sep 2017 06:04:37 -0700 Subject: [PATCH 1/2] fix trpo command in readme --- baselines/trpo_mpi/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/baselines/trpo_mpi/README.md b/baselines/trpo_mpi/README.md index c7e379d..b3d9b9d 100644 --- a/baselines/trpo_mpi/README.md +++ b/baselines/trpo_mpi/README.md @@ -2,5 +2,5 @@ - Original paper: https://arxiv.org/abs/1502.05477 - Baselines blog post https://blog.openai.com/openai-baselines-ppo/ -- `mpirun -np 16 python -m baselines.ppo1.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. -- `python -m baselines.ppo1.run_mujoco` runs the algorithm for 1M timesteps on a Mujoco environment. \ No newline at end of file +- `mpirun -np 16 python -m baselines.trpo_mpi.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. +- `python -m baselines.trpo_mpi.run_mujoco` runs the algorithm for 1M timesteps on a Mujoco environment. From 589387403b048f0d4f04a7f66b53da058a8dfaed Mon Sep 17 00:00:00 2001 From: Louie Helm Date: Tue, 5 Sep 2017 06:06:19 -0700 Subject: [PATCH 2/2] fix ppo command in readme --- baselines/ppo1/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/baselines/ppo1/README.md b/baselines/ppo1/README.md index 70325b7..f45d141 100644 --- a/baselines/ppo1/README.md +++ b/baselines/ppo1/README.md @@ -2,6 +2,6 @@ - Original paper: https://arxiv.org/abs/1707.06347 - Baselines blog post: https://blog.openai.com/openai-baselines-ppo/ -- `mpirun -np 8 python -m baselines.ppo.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. -- `python -m baselines.ppo.run_mujoco` runs the algorithm for 1M frames on a Mujoco environment. +- `mpirun -np 8 python -m baselines.ppo1.run_atari` runs the algorithm for 40M frames = 10M timesteps on an Atari game. See help (`-h`) for more options. +- `python -m baselines.ppo1.run_mujoco` runs the algorithm for 1M frames on a Mujoco environment.