Files
baselines/baselines/a2c/README.md
Thomas Simonini 8158f35611 Wrote some comments to explain the A2C and PPO2 implementation (#607)
* added comments in A2C and PPO2

* Fixed format errors to respect PEP 8 style guide
2018-09-21 13:12:31 -07:00

833 B

A2C

Files

  • run_atari: file used to run the algorithm.
  • policies.py: contains the different versions of the A2C architecture (MlpPolicy, CNNPolicy, LstmPolicy...).
  • a2c.py: - Model : class used to initialize the step_model (sampling) and train_model (training)
    • learn : Main entrypoint for A2C algorithm. Train a policy with given network architecture on a given environment using a2c algorithm.
  • runner.py: class used to generates a batch of experiences