Matthias Plappert
a418758cbe
Fix plotting script
2018-03-05 11:00:20 +01:00
Alex Nichol
8b3a6c2051
fix DummyVecEnv reusing buffers
2018-03-02 17:18:07 -08:00
Alex Nichol
569bd42629
Merge pull request #308 from araffin/master
...
Bug fix in saving ACER model
2018-03-01 10:45:04 -08:00
Daniel Ziegler
f49a9c3d85
Fix bug in DDPG parameter space noise adaptation ( #306 )
...
The training loop used the rollout step variable `t` rather than the
training step variable `t_train` to decide when to adapt the scale of
the parameter space noise.
2018-03-01 18:00:34 +01:00
Antonin RAFFIN
14f2f9328c
Bug fix in saving ACER model
2018-03-01 10:24:14 +01:00
Alex Nichol
6bdf2f55a2
Merge pull request #132 from bhatiaabhinav/bug_fixes
...
Bug fix in saving a2c model.
2018-02-27 19:00:37 -08:00
Alex Nichol
97be70d6c8
fixes for DummyVecEnv
...
Fixes various problems running MuJoCo tasks.
2018-02-27 18:55:10 -08:00
Matthias Plappert
b71152eea0
Adds support for Hindsight Experience Replay (HER) ( #299 )
...
* Add Hindsight Experience Replay (HER)
* Minor improvements
2018-02-26 17:40:16 +01:00
Christopher Hesse
df2e846ab7
export: fix accidental rename
2018-02-14 22:01:16 -08:00
Christopher Hesse
edb52c22a5
export: Fix deepq param noise refactoring, remove atari experiments and azure dependency
2018-02-14 21:42:22 -08:00
Andrei Kashin
98257ef8c9
Flush temporary file before compressing it.
...
We need to flush the buffer after `pickle.dump`, otherwise the resulting zip archive might be incomplete (reproducible, if the state consists of a single integer).
2018-02-06 07:04:44 -08:00
Oleg Klimov
d9b36601d9
comment about loading weights in ppo2
2018-02-05 12:25:05 -08:00
Oleg Klimov
2793971c10
fix gail tf_util usage
2018-02-05 07:51:27 -08:00
John Schulman
16d7d23b7d
Merge pull request #271 from simontudo/add-requirement-cloudpickle
...
added cloudpickle to requirements
2018-02-02 23:04:53 -08:00
John Schulman
9175b770c6
Merge pull request #273 from simontudo/videorecorder-import
...
updated videorecorder import
2018-02-02 23:03:51 -08:00
simontudo
615870ad6b
updated videorecorder import
2018-02-01 12:09:08 +01:00
simontudo
7bd264e0e9
added cloudpickle to requirements
2018-01-31 10:43:17 +01:00
John Schulman
8d03102d4d
Merge pull request #265 from 20chase/patch-1
...
fix logger error for trpo_mpi
2018-01-29 00:54:51 -08:00
20chase
4a77855529
using mujoco_arg_parser as args
...
remove origin parser
2018-01-29 16:52:01 +08:00
John Schulman
2e29b41592
Merge pull request #268 from ei-grad/master
...
Fix fc call in AcerLstmPolicy
2018-01-27 18:42:31 -08:00
Andrew Grigorev
634e37c5b8
Fix fc call in AcerLstmPolicy
...
The `act` keyword was removed from baselines.a2c.utils.fc in commit 9fa8e1b
.
2018-01-27 23:18:02 +03:00
20chase
452b548c2a
Merge branch 'master' into patch-1
2018-01-26 14:34:01 +08:00
John Schulman
ebb8afff2e
fix trpo_mpi bug where logstd wasn’t included
2018-01-25 21:17:40 -08:00
John Schulman
c9613b2293
Merge pull request #259 from andrewliao11/openai_gail
...
Add gail maintainer list
2018-01-25 20:54:34 -08:00
John Schulman
459f007bcc
Merge pull request #260 from uidilr/master
...
Add GAIL
2018-01-25 20:54:20 -08:00
John Schulman
9fa8e1baf1
Lots of cleanups
...
Fixes for new gym version
Add @olegklimov and @unixpickle to authors list
2018-01-25 18:54:24 -08:00
20chase
ac2ea4f31f
fix logger error for MPI
...
Can't run logger.configure() if rank != 0
2018-01-25 22:09:00 +08:00
Yusuke Nakata
d8cce2309f
Add GAIL
2018-01-23 12:02:03 +09:00
andrew
0c207f0185
fix typo
2018-01-21 22:13:01 -08:00
andrew
41d41fabe3
add gail maintainer list
2018-01-21 22:12:03 -08:00
John Schulman
b5be53dc92
Merge pull request #229 from andrewliao11/gail
...
GAIL implementation
2018-01-21 20:30:20 -05:00
Matthias Plappert
49c1a8ec26
Fix bug in parameter space noise DQN
2018-01-16 10:24:30 -08:00
andrew
e5a714b070
fix relative import
2018-01-12 15:12:45 -08:00
John Schulman
f9d1d3349a
remove mpirun from ppo2 instructions
2018-01-12 11:05:29 -08:00
Alex Nichol
8c90f67560
don't list TensorFlow as a requirement
...
fixes #146
A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly.
2017-12-15 15:54:43 -08:00
Andrew
f22bee085d
Add files via upload
2017-12-12 19:03:42 -08:00
andrew
4acc71fe23
add x, y, axis name
2017-12-12 18:58:57 -08:00
andrew
2f1b629ecc
Merge branch 'gail' of https://github.com/andrewliao11/baselines into gail
2017-12-12 18:56:00 -08:00
andrew
00573cf5e9
add x, y axis name
2017-12-12 18:54:03 -08:00
Andrew
cfa1236d78
Update README.md
2017-12-11 21:21:56 -08:00
Andrew
64288f9f84
Update gail-result.md
2017-12-11 21:19:47 -08:00
Andrew
5f647d4d34
Update README.md
2017-12-11 21:18:05 -08:00
Andrew
6723455b75
Update gail-result.md
2017-12-11 21:15:30 -08:00
Andrew
45a93cf2b9
add training curve from tensorboard
2017-12-11 21:06:04 -08:00
andrew
11604f7cc9
add download link to readme and add description to python file
2017-12-07 12:08:20 -08:00
John Schulman
2444034d11
Merge pull request #194 from ryanjulian/env_lines
...
Force shebang lines to Python 3
2017-12-04 14:07:01 -08:00
John Schulman
041b6b76b7
Merge pull request #215 from chris-chris/feature/typo-2017-11-19
...
fix misspellings
2017-12-04 14:02:49 -08:00
John Schulman
5d62b5bdaa
Merge pull request #221 from jvmancuso/patch-1
...
Docstring fix
2017-12-04 14:01:38 -08:00
John Schulman
2fcc9b9572
Merge pull request #226 from definitelyuncertain/master
...
Call ppo2 and not ppo1 in ppo2 README.md
2017-12-04 14:01:12 -08:00
Andrew
000033973b
Update gail-result.md
2017-12-03 15:50:24 -08:00