baselines

Author	SHA1	Message	Date
Peter Zhokhov	2c818245d6	dummy commit to RUN BENCHMARKS	2018-07-25 18:09:30 -07:00
Peter Zhokhov	ae8e7fd16b	dummy commit to RUN BENCHMARKS	2018-07-25 18:07:56 -07:00
pzhokhov	24fe3d6576	Import internal repo (#409 ) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity	2018-05-21 15:24:00 -07:00
pzhokhov	9cf95a0054	setup travis ci build (#388 ) * simple .travis.yml file * added static syntax checks of common to .travis.yml * dockerizing the build * fix Dockerfile, adding build shield * cleaning up workdir in Dockerfile and .travis.yml * .travis.yml fixed common -> baselines/common for style check	2018-05-03 09:43:28 -07:00
pzhokhov	2b0283b9db	Readme.md detailed installation instructions (#377 ) * changes to README.md files with more detailed installation instructions * md-fying the changes better * link on the word homebrew in readme.md * typos in README.md * README.md * removed extra comma sign * removed sudo from brew command	2018-04-25 17:40:48 -07:00
Matthias Plappert	b71152eea0	Adds support for Hindsight Experience Replay (HER) (#299 ) * Add Hindsight Experience Replay (HER) * Minor improvements	2018-02-26 17:40:16 +01:00
John Schulman	459f007bcc	Merge pull request #260 from uidilr/master Add GAIL	2018-01-25 20:54:20 -08:00
John Schulman	9fa8e1baf1	Lots of cleanups Fixes for new gym version Add @olegklimov and @unixpickle to authors list	2018-01-25 18:54:24 -08:00
Yusuke Nakata	d8cce2309f	Add GAIL	2018-01-23 12:02:03 +09:00
John Schulman	2dd7d307d7	Add ACER, PPO2, and results_plotter.py	2017-11-16 10:02:32 -08:00
John Schulman	bb40378118	change atari preprocessing to use faster opencv some logger changes	2017-10-25 09:21:29 -04:00
John Schulman	aa6e58bdf1	fix readmes	2017-08-27 22:22:14 -07:00
John Schulman	3f676f7d1e	ACKTR + A2C	2017-08-18 09:25:39 -07:00
Matthias Plappert	882251878f	Parameter space noise for DQN and DDPG (#75 ) * Export param noise * Update documentation * Final finishing touches	2017-07-27 08:10:59 -07:00
Jonas Schneider	5dc00628fe	readme fiddling	2017-07-20 09:00:24 -07:00
John Schulman	da99706046	ppo and trpo	2017-07-20 08:52:35 -07:00
cxx	5e73387494	Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead.	2017-06-16 15:38:42 +08:00
Tiago Carvalho	1f3c3e33e7	Update README.md	2017-05-31 12:14:28 +01:00
Olivier Moindrot	d2c51f5933	Correct path to script "download_model" `python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`	2017-05-24 13:13:30 -07:00
Szymon Sidor	958810ed1e	Initial commit	2017-05-24 02:34:20 -07:00

20 Commits