Commit Graph

262 Commits

Author SHA1 Message Date
Antonin RAFFIN
14f2f9328c Bug fix in saving ACER model 2018-03-01 10:24:14 +01:00
Alex Nichol
6bdf2f55a2 Merge pull request #132 from bhatiaabhinav/bug_fixes
Bug fix in saving a2c model.
2018-02-27 19:00:37 -08:00
Alex Nichol
97be70d6c8 fixes for DummyVecEnv
Fixes various problems running MuJoCo tasks.
2018-02-27 18:55:10 -08:00
Matthias Plappert
b71152eea0 Adds support for Hindsight Experience Replay (HER) (#299)
* Add Hindsight Experience Replay (HER)

* Minor improvements
2018-02-26 17:40:16 +01:00
Christopher Hesse
df2e846ab7 export: fix accidental rename 2018-02-14 22:01:16 -08:00
Christopher Hesse
edb52c22a5 export: Fix deepq param noise refactoring, remove atari experiments and azure dependency 2018-02-14 21:42:22 -08:00
Andrei Kashin
98257ef8c9 Flush temporary file before compressing it.
We need to flush the buffer after `pickle.dump`, otherwise the resulting zip archive might be incomplete (reproducible, if the state consists of a single integer).
2018-02-06 07:04:44 -08:00
Oleg Klimov
d9b36601d9 comment about loading weights in ppo2 2018-02-05 12:25:05 -08:00
Oleg Klimov
2793971c10 fix gail tf_util usage 2018-02-05 07:51:27 -08:00
John Schulman
16d7d23b7d Merge pull request #271 from simontudo/add-requirement-cloudpickle
added cloudpickle to requirements
2018-02-02 23:04:53 -08:00
John Schulman
9175b770c6 Merge pull request #273 from simontudo/videorecorder-import
updated videorecorder import
2018-02-02 23:03:51 -08:00
simontudo
615870ad6b updated videorecorder import 2018-02-01 12:09:08 +01:00
simontudo
7bd264e0e9 added cloudpickle to requirements 2018-01-31 10:43:17 +01:00
John Schulman
8d03102d4d Merge pull request #265 from 20chase/patch-1
fix logger error for trpo_mpi
2018-01-29 00:54:51 -08:00
20chase
4a77855529 using mujoco_arg_parser as args
remove origin parser
2018-01-29 16:52:01 +08:00
John Schulman
2e29b41592 Merge pull request #268 from ei-grad/master
Fix fc call in AcerLstmPolicy
2018-01-27 18:42:31 -08:00
Andrew Grigorev
634e37c5b8 Fix fc call in AcerLstmPolicy
The `act` keyword was removed from baselines.a2c.utils.fc in commit 9fa8e1b.
2018-01-27 23:18:02 +03:00
20chase
452b548c2a Merge branch 'master' into patch-1 2018-01-26 14:34:01 +08:00
John Schulman
ebb8afff2e fix trpo_mpi bug where logstd wasn’t included 2018-01-25 21:17:40 -08:00
John Schulman
c9613b2293 Merge pull request #259 from andrewliao11/openai_gail
Add gail maintainer list
2018-01-25 20:54:34 -08:00
John Schulman
459f007bcc Merge pull request #260 from uidilr/master
Add GAIL
2018-01-25 20:54:20 -08:00
John Schulman
9fa8e1baf1 Lots of cleanups
Fixes for new gym version
Add @olegklimov and @unixpickle to authors list
2018-01-25 18:54:24 -08:00
20chase
ac2ea4f31f fix logger error for MPI
Can't run logger.configure() if rank != 0
2018-01-25 22:09:00 +08:00
Yusuke Nakata
d8cce2309f Add GAIL 2018-01-23 12:02:03 +09:00
andrew
0c207f0185 fix typo 2018-01-21 22:13:01 -08:00
andrew
41d41fabe3 add gail maintainer list 2018-01-21 22:12:03 -08:00
John Schulman
b5be53dc92 Merge pull request #229 from andrewliao11/gail
GAIL implementation
2018-01-21 20:30:20 -05:00
Matthias Plappert
49c1a8ec26 Fix bug in parameter space noise DQN 2018-01-16 10:24:30 -08:00
andrew
e5a714b070 fix relative import 2018-01-12 15:12:45 -08:00
John Schulman
f9d1d3349a remove mpirun from ppo2 instructions 2018-01-12 11:05:29 -08:00
Alex Nichol
8c90f67560 don't list TensorFlow as a requirement
fixes #146

A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly.
2017-12-15 15:54:43 -08:00
Andrew
f22bee085d Add files via upload 2017-12-12 19:03:42 -08:00
andrew
4acc71fe23 add x, y, axis name 2017-12-12 18:58:57 -08:00
andrew
2f1b629ecc Merge branch 'gail' of https://github.com/andrewliao11/baselines into gail 2017-12-12 18:56:00 -08:00
andrew
00573cf5e9 add x, y axis name 2017-12-12 18:54:03 -08:00
Andrew
cfa1236d78 Update README.md 2017-12-11 21:21:56 -08:00
Andrew
64288f9f84 Update gail-result.md 2017-12-11 21:19:47 -08:00
Andrew
5f647d4d34 Update README.md 2017-12-11 21:18:05 -08:00
Andrew
6723455b75 Update gail-result.md 2017-12-11 21:15:30 -08:00
Andrew
45a93cf2b9 add training curve from tensorboard 2017-12-11 21:06:04 -08:00
andrew
11604f7cc9 add download link to readme and add description to python file 2017-12-07 12:08:20 -08:00
John Schulman
2444034d11 Merge pull request #194 from ryanjulian/env_lines
Force shebang lines to Python 3
2017-12-04 14:07:01 -08:00
John Schulman
041b6b76b7 Merge pull request #215 from chris-chris/feature/typo-2017-11-19
fix misspellings
2017-12-04 14:02:49 -08:00
John Schulman
5d62b5bdaa Merge pull request #221 from jvmancuso/patch-1
Docstring fix
2017-12-04 14:01:38 -08:00
John Schulman
2fcc9b9572 Merge pull request #226 from definitelyuncertain/master
Call ppo2 and not ppo1 in ppo2 README.md
2017-12-04 14:01:12 -08:00
Andrew
000033973b Update gail-result.md 2017-12-03 15:50:24 -08:00
andrew
6090ee8292 add comparison for expert/BC/gail 2017-12-03 15:46:52 -08:00
andrew
7954327c5f add behavior cloning learn/eval code 2017-12-03 13:55:44 -08:00
andrew
8495890534 add gail, file_writer for tf.summary, and allow specifying var_list for tf.train.Saver 2017-12-03 01:49:42 -08:00
definitelyuncertain
643184935e Call ppo2 and not ppo1 2017-12-02 22:00:28 +05:30