Commit Graph

128 Commits

Author SHA1 Message Date
andrew
00573cf5e9 add x, y axis name 2017-12-12 18:54:03 -08:00
Andrew
cfa1236d78 Update README.md 2017-12-11 21:21:56 -08:00
Andrew
64288f9f84 Update gail-result.md 2017-12-11 21:19:47 -08:00
Andrew
5f647d4d34 Update README.md 2017-12-11 21:18:05 -08:00
Andrew
6723455b75 Update gail-result.md 2017-12-11 21:15:30 -08:00
Andrew
45a93cf2b9 add training curve from tensorboard 2017-12-11 21:06:04 -08:00
andrew
11604f7cc9 add download link to readme and add description to python file 2017-12-07 12:08:20 -08:00
John Schulman
2444034d11 Merge pull request #194 from ryanjulian/env_lines
Force shebang lines to Python 3
2017-12-04 14:07:01 -08:00
John Schulman
041b6b76b7 Merge pull request #215 from chris-chris/feature/typo-2017-11-19
fix misspellings
2017-12-04 14:02:49 -08:00
John Schulman
5d62b5bdaa Merge pull request #221 from jvmancuso/patch-1
Docstring fix
2017-12-04 14:01:38 -08:00
John Schulman
2fcc9b9572 Merge pull request #226 from definitelyuncertain/master
Call ppo2 and not ppo1 in ppo2 README.md
2017-12-04 14:01:12 -08:00
Andrew
000033973b Update gail-result.md 2017-12-03 15:50:24 -08:00
andrew
6090ee8292 add comparison for expert/BC/gail 2017-12-03 15:46:52 -08:00
andrew
7954327c5f add behavior cloning learn/eval code 2017-12-03 13:55:44 -08:00
andrew
8495890534 add gail, file_writer for tf.summary, and allow specifying var_list for tf.train.Saver 2017-12-03 01:49:42 -08:00
definitelyuncertain
643184935e Call ppo2 and not ppo1 2017-12-02 22:00:28 +05:30
jvmancuso
36e074da56 Update replay_buffer.py 2017-11-27 14:45:50 -05:00
Ubuntu
c33640932f fix misspellings 2017-11-19 01:29:30 +00:00
John Schulman
b05be68c55 add missing files, fix Issue #209 2017-11-16 22:14:30 -08:00
John Schulman
2dd7d307d7 Add ACER, PPO2, and results_plotter.py 2017-11-16 10:02:32 -08:00
Ryan Julian
df889caf11 Force shebang lines to Python 3
This is a Python 3-only library. A shebang with `#!/usr/bin/env python`
will launch python2 on many systems which do not have python3
installed. Setting the shebang to `#!/usr/bin/env python3` will show a
useful error on systems without Python 3.
2017-11-05 15:22:16 -08:00
John Schulman
6a3cbb4bc5 switch append mode to write mode 2017-10-25 22:20:30 -04:00
John Schulman
bb40378118 change atari preprocessing to use faster opencv
some logger changes
2017-10-25 09:21:29 -04:00
John Schulman
4993286230 Merge pull request #160 from mkarutz/fixFrameStackingA2C
Fixes frame stacking in A2C and ACKTR for multi-channel observations
2017-10-09 14:12:28 -07:00
Malcolm Karutz
cc8818f49e Fixes frame stacking in A2C and ACKTR for multi-channel observation spaces. 2017-10-09 13:08:41 +11:00
John Schulman
3eb71a0ece Merge pull request #151 from emansim/master
Fixes the NaN issues in ACKTR + bug in run_mujoco.py
2017-09-30 14:51:56 -07:00
Elman Mansimov
f8663eaf11 fixes acktr_cont issues 2017-09-30 17:21:04 -04:00
Abhinav Bhatia
3d1e171b3a Bug fix in saving a2c model. 2017-09-12 02:35:43 +08:00
John Schulman
699919f1cf Merge pull request #64 from jhumplik/master
Use standardized advantages in trpo.
2017-09-07 01:57:04 -07:00
John Schulman
498b4cfead Merge pull request #128 from louiehelm/louiehelm-patch-1
Fix command lines
2017-09-06 01:04:47 -07:00
Louie Helm
589387403b fix ppo command in readme 2017-09-05 06:06:19 -07:00
Louie Helm
3d3ea6cb16 fix trpo command in readme 2017-09-05 06:04:37 -07:00
John Schulman
902ffcb767 Merge pull request #120 from hamzamerzic/tensorflow_global_variable
Deprecated VARIABLES -> GLOBAL_VARIABLES.
2017-08-28 21:27:23 -07:00
Hamza Merzic
a7320b80c0 Deprecated VARIABLES -> GLOBAL_VARIABLES. 2017-08-28 16:51:48 +02:00
John Schulman
4e2a570eb4 Merge pull request #104 from stevenschmatz/patch-1
Fix relative links in README.md
2017-08-27 22:54:52 -07:00
John Schulman
6f39148452 fix gym req 2017-08-27 22:49:50 -07:00
John Schulman
2f30833043 Merge branch 'master' of github.com:openai/baselines 2017-08-27 22:36:44 -07:00
John Schulman
00cdeff35e add __init__.py 2017-08-27 22:36:24 -07:00
John Schulman
410ef38898 Merge pull request #103 from learnercys/master
Adding links to source files
2017-08-27 22:31:46 -07:00
John Schulman
aa6e58bdf1 fix readmes 2017-08-27 22:22:14 -07:00
John Schulman
d9f194f797 Fix atari wrapper (affecting a2c perf) and pposgd mujoco performance
- removed vf clipping in pposgd - that was severely degrading performance on mujoco because it didn’t account for scale of returns
- switched adam epsilon in pposgd_simple
- brought back no-ops in atari wrapper (oops)
- added readmes
- revamped run_X_benchmark scripts to have standard form
- cleaned up DDPG a little, removed deprecated SimpleMonitor and non-idiomatic usage of logger
2017-08-27 22:14:59 -07:00
Steven Schmatz
06b071c105 Fix relative links in README.md 2017-08-18 13:35:22 -04:00
John Schulman
3f676f7d1e ACKTR + A2C 2017-08-18 09:25:39 -07:00
Carlos Hernandez
b7966b31a5 Adding links to source files 2017-08-18 01:16:00 -06:00
Matthias Plappert
882251878f Parameter space noise for DQN and DDPG (#75)
* Export param noise

* Update documentation

* Final finishing touches
2017-07-27 08:10:59 -07:00
Jan Humplik
4862140cea Use standardized advantages in trpo. 2017-07-23 22:42:55 +02:00
Peter Welinder
df82a15fd3 Fix broken links in DQN readme 2017-07-23 09:58:10 -07:00
Jonas Schneider
5dc00628fe readme fiddling 2017-07-20 09:00:24 -07:00
John Schulman
79b4a8a88e Merge pull request #60 from openai/ppo-trpo
ppo and trpo
2017-07-20 08:55:43 -07:00
John Schulman
da99706046 ppo and trpo 2017-07-20 08:52:35 -07:00