Commit Graph

20 Commits

Author SHA1 Message Date
Peter Zhokhov
2c818245d6 dummy commit to RUN BENCHMARKS 2018-07-25 18:09:30 -07:00
Peter Zhokhov
ae8e7fd16b dummy commit to RUN BENCHMARKS 2018-07-25 18:07:56 -07:00
pzhokhov
24fe3d6576 Import internal repo (#409)
* import rl-algs from 2e3a166 commit

* extra import of the baselines badge

* exported commit with identity test

* proper rng seeding in the test_identity
2018-05-21 15:24:00 -07:00
pzhokhov
9cf95a0054 setup travis ci build (#388)
* simple .travis.yml file

* added static syntax checks of common to .travis.yml

* dockerizing the build

* fix Dockerfile, adding build shield

* cleaning up workdir in Dockerfile and .travis.yml

* .travis.yml fixed common -> baselines/common for style check
2018-05-03 09:43:28 -07:00
pzhokhov
2b0283b9db Readme.md detailed installation instructions (#377)
* changes to README.md files with more detailed installation instructions

* md-fying the changes better

* link on the word homebrew in readme.md

* typos in README.md

* README.md

* removed extra comma sign

* removed sudo from brew command
2018-04-25 17:40:48 -07:00
Matthias Plappert
b71152eea0 Adds support for Hindsight Experience Replay (HER) (#299)
* Add Hindsight Experience Replay (HER)

* Minor improvements
2018-02-26 17:40:16 +01:00
John Schulman
459f007bcc Merge pull request #260 from uidilr/master
Add GAIL
2018-01-25 20:54:20 -08:00
John Schulman
9fa8e1baf1 Lots of cleanups
Fixes for new gym version
Add @olegklimov and @unixpickle to authors list
2018-01-25 18:54:24 -08:00
Yusuke Nakata
d8cce2309f Add GAIL 2018-01-23 12:02:03 +09:00
John Schulman
2dd7d307d7 Add ACER, PPO2, and results_plotter.py 2017-11-16 10:02:32 -08:00
John Schulman
bb40378118 change atari preprocessing to use faster opencv
some logger changes
2017-10-25 09:21:29 -04:00
John Schulman
aa6e58bdf1 fix readmes 2017-08-27 22:22:14 -07:00
John Schulman
3f676f7d1e ACKTR + A2C 2017-08-18 09:25:39 -07:00
Matthias Plappert
882251878f Parameter space noise for DQN and DDPG (#75)
* Export param noise

* Update documentation

* Final finishing touches
2017-07-27 08:10:59 -07:00
Jonas Schneider
5dc00628fe readme fiddling 2017-07-20 09:00:24 -07:00
John Schulman
da99706046 ppo and trpo 2017-07-20 08:52:35 -07:00
cxx
5e73387494 Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead. 2017-06-16 15:38:42 +08:00
Tiago Carvalho
1f3c3e33e7 Update README.md 2017-05-31 12:14:28 +01:00
Olivier Moindrot
d2c51f5933 Correct path to script "download_model"
`python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`
2017-05-24 13:13:30 -07:00
Szymon Sidor
958810ed1e Initial commit 2017-05-24 02:34:20 -07:00