diff --git a/README.md b/README.md index a9b7bf6..c5e1fde 100644 --- a/README.md +++ b/README.md @@ -15,7 +15,7 @@ sudo apt-get update && sudo apt-get install cmake libopenmpi-dev python3-dev zli ``` ### Mac OS X -Installation of system packages on Mac requires [Homebrew](https://brew.sh). With Homebrew installed, run the follwing: +Installation of system packages on Mac requires [Homebrew](https://brew.sh). With Homebrew installed, run the following: ```bash brew install cmake openmpi ``` @@ -84,7 +84,7 @@ The hyperparameters for both network and the learning algorithm can be controlle ```bash python -m baselines.run --alg=ppo2 --env=Humanoid-v2 --network=mlp --num_timesteps=2e7 --ent_coef=0.1 --num_hidden=32 --num_layers=3 --value_network=copy ``` -will set entropy coeffient to 0.1, and construct fully connected network with 3 layers with 32 hidden units in each, and create a separate network for value function estimation (so that its parameters are not shared with the policy network, but the structure is the same) +will set entropy coefficient to 0.1, and construct fully connected network with 3 layers with 32 hidden units in each, and create a separate network for value function estimation (so that its parameters are not shared with the policy network, but the structure is the same) See docstrings in [common/models.py](baselines/common/models.py) for description of network parameters for each type of model, and docstring for [baselines/ppo2/ppo2.py/learn()](baselines/ppo2/ppo2.py#L152) for the description of the ppo2 hyperparamters.