baselines

Author	SHA1	Message	Date
Peter Zhokhov	1c5c6563b7	reverted VecNormalize to use RunningMeanStd (no tf)	2018-08-02 10:55:09 -07:00
Peter Zhokhov	f6d1115ead	reverted running_mean_std to user property decorators for mean, var, count	2018-08-02 10:32:22 -07:00
Peter Zhokhov	f6d5a47bed	use ncpu=1 for mujoco sessions - gives a bit of a performance speedup	2018-08-02 10:24:21 -07:00
Peter Zhokhov	c2df27bee4	non-tf normalization benchmark RUN BENCHMARKS	2018-08-02 09:41:41 -07:00
Peter Zhokhov	974c15756e	changed default ppo2 lr schedule to linear RUN BENCHMARKS	2018-08-01 16:24:44 -07:00
Peter Zhokhov	ad43fd9a35	add defaults	2018-08-01 16:15:59 -07:00
Peter Zhokhov	72c357c638	hardcode names of retro environments	2018-08-01 15:18:59 -07:00
Peter Zhokhov	e00e5ca016	run ppo_mpi benchmarks only RUN BENCHMARKS	2018-08-01 14:56:08 -07:00
Peter Zhokhov	705797f2f0	Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks	2018-08-01 14:46:40 -07:00
Peter Zhokhov	fcd84aa831	make_atari_env compatible with mpi	2018-08-01 14:46:18 -07:00
Peter Zhokhov	390b51597a	benchmarks on ppo2 only RUN BENCHMARKS	2018-08-01 11:01:50 -07:00
Peter Zhokhov	95104a3592	Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks	2018-08-01 10:50:29 -07:00
Peter Zhokhov	3528f7b992	save all variables to make sure we save the vec_normalize normalization	2018-08-01 10:12:19 -07:00
Peter Zhokhov	151e48009e	flake8 complaints	2018-07-31 16:25:12 -07:00
Peter Zhokhov	92f33335e9	dummy commit to RUN BENCHMARKS	2018-07-31 15:53:18 -07:00
Peter Zhokhov	af729cff15	dummy commit to RUN BENCHMARKS	2018-07-31 15:37:00 -07:00
Peter Zhokhov	10f815fe1d	fixed import in vec_normalize	2018-07-31 15:19:43 -07:00
Peter Zhokhov	8c4adac898	running_mean_std uses tensorflow variables	2018-07-31 14:45:55 -07:00
Peter Zhokhov	2a93ea8782	serialize variables as a dict, not as a list	2018-07-31 11:13:31 -07:00
Peter Zhokhov	9c48f9fad5	very dummy commit to RUN BENCHMARKS	2018-07-31 10:23:43 -07:00
Peter Zhokhov	348cbb4b71	dummy commit to RUN BENCHMARKS	2018-07-31 09:42:23 -07:00
Peter Zhokhov	a1602ab15f	dummy commit to RUN BENCHMARKS	2018-07-30 17:51:16 -07:00
Peter Zhokhov	e63e69bb14	dummy commit to RUN BENCHMARKS	2018-07-30 17:39:22 -07:00
Peter Zhokhov	385e7e5c0d	dummy commit to RUN BENCHMARKS	2018-07-30 17:21:05 -07:00
Peter Zhokhov	d112a2e49f	added approximate humanoid reward with ppo2 into the README for reference	2018-07-30 16:58:31 -07:00
Peter Zhokhov	e662dd6409	run.py can run algos from both baselines and rl_algs	2018-07-30 16:09:48 -07:00
Peter Zhokhov	efc6bffce3	replaced atari_arg_parser with common_arg_parser	2018-07-30 15:58:56 -07:00
Peter Zhokhov	872181d4c3	re-exported rl_algs - fixed problems with serialization test and test_cartpole	2018-07-30 15:49:48 -07:00
Peter Zhokhov	628ddecf6a	re-exported rl_algs	2018-07-30 12:15:46 -07:00
peter	83a4a4be65	run slow tests	2018-07-26 14:39:25 -07:00
peter	7edac38c73	more stuff from rl-algs	2018-07-26 14:26:57 -07:00
peter	a6dca44115	exported rl-algs	2018-07-26 14:02:04 -07:00
Adam Gleave	f272969325	GAIL: bugfix in dataset loading (#447 ) * Fix silly typo * Replace ad-hoc function with NumPy code	2018-07-06 16:12:14 -07:00
pzhokhov	a6b1bc70f1	re-import internal; fix missing tile_images.py (#427 ) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal * adding missing tile_images.py	2018-06-08 09:41:45 -07:00
pzhokhov	36ee5d1707	Import internal changes (#422 ) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal	2018-06-06 11:39:13 -07:00
pzhokhov	24fe3d6576	Import internal repo (#409 ) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity	2018-05-21 15:24:00 -07:00
pzhokhov	9cb7ece338	add opencv-python to the dependencies (#407 )	2018-05-14 10:52:19 -07:00
pzhokhov	9cf95a0054	setup travis ci build (#388 ) * simple .travis.yml file * added static syntax checks of common to .travis.yml * dockerizing the build * fix Dockerfile, adding build shield * cleaning up workdir in Dockerfile and .travis.yml * .travis.yml fixed common -> baselines/common for style check	2018-05-03 09:43:28 -07:00
pzhokhov	8b781038cc	put filters and running_stat files in common instead of acktr (#389 )	2018-05-02 18:42:48 -07:00
pzhokhov	69f25c6028	import internal repo (#385 )	2018-05-01 16:54:04 -07:00
pzhokhov	2b0283b9db	Readme.md detailed installation instructions (#377 ) * changes to README.md files with more detailed installation instructions * md-fying the changes better * link on the word homebrew in readme.md * typos in README.md * README.md * removed extra comma sign * removed sudo from brew command	2018-04-25 17:40:48 -07:00
Matthias Plappert	1f8a03f3a6	Update README	2018-03-26 16:50:22 +02:00
Matthias Plappert	3cc7df0608	Minor fixes to HER release (#319 ) * Fix plotting script * Add warning if num_cpu = 1	2018-03-05 11:06:17 +01:00
Alex Nichol	8b3a6c2051	fix DummyVecEnv reusing buffers	2018-03-02 17:18:07 -08:00
Alex Nichol	569bd42629	Merge pull request #308 from araffin/master Bug fix in saving ACER model	2018-03-01 10:45:04 -08:00
Daniel Ziegler	f49a9c3d85	Fix bug in DDPG parameter space noise adaptation (#306 ) The training loop used the rollout step variable `t` rather than the training step variable `t_train` to decide when to adapt the scale of the parameter space noise.	2018-03-01 18:00:34 +01:00
Antonin RAFFIN	14f2f9328c	Bug fix in saving ACER model	2018-03-01 10:24:14 +01:00
Alex Nichol	6bdf2f55a2	Merge pull request #132 from bhatiaabhinav/bug_fixes Bug fix in saving a2c model.	2018-02-27 19:00:37 -08:00
Alex Nichol	97be70d6c8	fixes for DummyVecEnv Fixes various problems running MuJoCo tasks.	2018-02-27 18:55:10 -08:00
Matthias Plappert	b71152eea0	Adds support for Hindsight Experience Replay (HER) (#299 ) * Add Hindsight Experience Replay (HER) * Minor improvements	2018-02-26 17:40:16 +01:00

1 2 3 4

158 Commits