Fixed typos in README (#635)
This commit is contained in:
@@ -15,7 +15,7 @@ sudo apt-get update && sudo apt-get install cmake libopenmpi-dev python3-dev zli
|
|||||||
```
|
```
|
||||||
|
|
||||||
### Mac OS X
|
### Mac OS X
|
||||||
Installation of system packages on Mac requires [Homebrew](https://brew.sh). With Homebrew installed, run the follwing:
|
Installation of system packages on Mac requires [Homebrew](https://brew.sh). With Homebrew installed, run the following:
|
||||||
```bash
|
```bash
|
||||||
brew install cmake openmpi
|
brew install cmake openmpi
|
||||||
```
|
```
|
||||||
@@ -84,7 +84,7 @@ The hyperparameters for both network and the learning algorithm can be controlle
|
|||||||
```bash
|
```bash
|
||||||
python -m baselines.run --alg=ppo2 --env=Humanoid-v2 --network=mlp --num_timesteps=2e7 --ent_coef=0.1 --num_hidden=32 --num_layers=3 --value_network=copy
|
python -m baselines.run --alg=ppo2 --env=Humanoid-v2 --network=mlp --num_timesteps=2e7 --ent_coef=0.1 --num_hidden=32 --num_layers=3 --value_network=copy
|
||||||
```
|
```
|
||||||
will set entropy coeffient to 0.1, and construct fully connected network with 3 layers with 32 hidden units in each, and create a separate network for value function estimation (so that its parameters are not shared with the policy network, but the structure is the same)
|
will set entropy coefficient to 0.1, and construct fully connected network with 3 layers with 32 hidden units in each, and create a separate network for value function estimation (so that its parameters are not shared with the policy network, but the structure is the same)
|
||||||
|
|
||||||
See docstrings in [common/models.py](baselines/common/models.py) for description of network parameters for each type of model, and
|
See docstrings in [common/models.py](baselines/common/models.py) for description of network parameters for each type of model, and
|
||||||
docstring for [baselines/ppo2/ppo2.py/learn()](baselines/ppo2/ppo2.py#L152) for the description of the ppo2 hyperparamters.
|
docstring for [baselines/ppo2/ppo2.py/learn()](baselines/ppo2/ppo2.py#L152) for the description of the ppo2 hyperparamters.
|
||||||
|
Reference in New Issue
Block a user