Commit Graph

14 Commits

Author SHA1 Message Date
pzhokhov
8c2aea2add refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490)
* exported rl-algs

* more stuff from rl-algs

* run slow tests

* re-exported rl_algs

* re-exported rl_algs - fixed problems with serialization test and test_cartpole

* replaced atari_arg_parser with common_arg_parser

* run.py can run algos from both baselines and rl_algs

* added approximate humanoid reward with ppo2 into the README for reference

* dummy commit to RUN BENCHMARKS

* dummy commit to RUN BENCHMARKS

* dummy commit to RUN BENCHMARKS

* dummy commit to RUN BENCHMARKS

* very dummy commit to RUN BENCHMARKS

* serialize variables as a dict, not as a list

* running_mean_std uses tensorflow variables

* fixed import in vec_normalize

* dummy commit to RUN BENCHMARKS

* dummy commit to RUN BENCHMARKS

* flake8 complaints

* save all variables to make sure we save the vec_normalize normalization

* benchmarks on ppo2 only RUN BENCHMARKS

* make_atari_env compatible with mpi

* run ppo_mpi benchmarks only RUN BENCHMARKS

* hardcode names of retro environments

* add defaults

* changed default ppo2 lr schedule to linear RUN BENCHMARKS

* non-tf normalization benchmark RUN BENCHMARKS

* use ncpu=1 for mujoco sessions - gives a bit of a performance speedup

* reverted running_mean_std to user property decorators for mean, var, count

* reverted VecNormalize to use RunningMeanStd (no tf)

* reverted VecNormalize to use RunningMeanStd (no tf)

* profiling wip

* use VecNormalize with regular RunningMeanStd

* added acer runner (missing import)

* flake8 complaints

* added a note in README about TfRunningMeanStd and serialization of VecNormalize

* dummy commit to RUN BENCHMARKS

* merged benchmarks branch
2018-08-13 09:56:44 -07:00
pzhokhov
9cb7ece338 add opencv-python to the dependencies (#407) 2018-05-14 10:52:19 -07:00
Matthias Plappert
b71152eea0 Adds support for Hindsight Experience Replay (HER) (#299)
* Add Hindsight Experience Replay (HER)

* Minor improvements
2018-02-26 17:40:16 +01:00
Christopher Hesse
edb52c22a5 export: Fix deepq param noise refactoring, remove atari experiments and azure dependency 2018-02-14 21:42:22 -08:00
simontudo
7bd264e0e9 added cloudpickle to requirements 2018-01-31 10:43:17 +01:00
Alex Nichol
8c90f67560 don't list TensorFlow as a requirement
fixes #146

A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly.
2017-12-15 15:54:43 -08:00
John Schulman
6f39148452 fix gym req 2017-08-27 22:49:50 -07:00
Matthias Plappert
882251878f Parameter space noise for DQN and DDPG (#75)
* Export param noise

* Update documentation

* Final finishing touches
2017-07-27 08:10:59 -07:00
John Schulman
da99706046 ppo and trpo 2017-07-20 08:52:35 -07:00
Szymon Sidor
80f94f8ec5 bump version 2017-07-12 14:48:05 -07:00
Zichuan Lin
ef1a2402fc Update setup.py 2017-06-07 17:29:38 +08:00
Yuxin Wu
709c327c40 Update setup.py
`PongNoFrameskip-v4` seems to require `gym>=0.9.1`
2017-05-29 19:39:25 -07:00
Szymon Sidor
7327a1560b update Atari envs to v4 and warn Python 2 users. 2017-05-25 14:53:29 -07:00
Szymon Sidor
958810ed1e Initial commit 2017-05-24 02:34:20 -07:00