baselines

Author	SHA1	Message	Date
pzhokhov	8c2aea2add	refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490 ) * exported rl-algs * more stuff from rl-algs * run slow tests * re-exported rl_algs * re-exported rl_algs - fixed problems with serialization test and test_cartpole * replaced atari_arg_parser with common_arg_parser * run.py can run algos from both baselines and rl_algs * added approximate humanoid reward with ppo2 into the README for reference * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * very dummy commit to RUN BENCHMARKS * serialize variables as a dict, not as a list * running_mean_std uses tensorflow variables * fixed import in vec_normalize * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * flake8 complaints * save all variables to make sure we save the vec_normalize normalization * benchmarks on ppo2 only RUN BENCHMARKS * make_atari_env compatible with mpi * run ppo_mpi benchmarks only RUN BENCHMARKS * hardcode names of retro environments * add defaults * changed default ppo2 lr schedule to linear RUN BENCHMARKS * non-tf normalization benchmark RUN BENCHMARKS * use ncpu=1 for mujoco sessions - gives a bit of a performance speedup * reverted running_mean_std to user property decorators for mean, var, count * reverted VecNormalize to use RunningMeanStd (no tf) * reverted VecNormalize to use RunningMeanStd (no tf) * profiling wip * use VecNormalize with regular RunningMeanStd * added acer runner (missing import) * flake8 complaints * added a note in README about TfRunningMeanStd and serialization of VecNormalize * dummy commit to RUN BENCHMARKS * merged benchmarks branch	2018-08-13 09:56:44 -07:00
pzhokhov	9cb7ece338	add opencv-python to the dependencies (#407 )	2018-05-14 10:52:19 -07:00
Matthias Plappert	b71152eea0	Adds support for Hindsight Experience Replay (HER) (#299 ) * Add Hindsight Experience Replay (HER) * Minor improvements	2018-02-26 17:40:16 +01:00
Christopher Hesse	edb52c22a5	export: Fix deepq param noise refactoring, remove atari experiments and azure dependency	2018-02-14 21:42:22 -08:00
simontudo	7bd264e0e9	added cloudpickle to requirements	2018-01-31 10:43:17 +01:00
Alex Nichol	8c90f67560	don't list TensorFlow as a requirement fixes #146 A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly.	2017-12-15 15:54:43 -08:00
John Schulman	6f39148452	fix gym req	2017-08-27 22:49:50 -07:00
Matthias Plappert	882251878f	Parameter space noise for DQN and DDPG (#75 ) * Export param noise * Update documentation * Final finishing touches	2017-07-27 08:10:59 -07:00
John Schulman	da99706046	ppo and trpo	2017-07-20 08:52:35 -07:00
Szymon Sidor	80f94f8ec5	bump version	2017-07-12 14:48:05 -07:00
Zichuan Lin	ef1a2402fc	Update setup.py	2017-06-07 17:29:38 +08:00
Yuxin Wu	709c327c40	Update setup.py `PongNoFrameskip-v4` seems to require `gym>=0.9.1`	2017-05-29 19:39:25 -07:00
Szymon Sidor	7327a1560b	update Atari envs to v4 and warn Python 2 users.	2017-05-25 14:53:29 -07:00
Szymon Sidor	958810ed1e	Initial commit	2017-05-24 02:34:20 -07:00

14 Commits