Commit Graph

  • 390b51597a benchmarks on ppo2 only RUN BENCHMARKS Peter Zhokhov 2018-08-01 11:01:50 -07:00
  • 95104a3592 Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks Peter Zhokhov 2018-08-01 10:50:29 -07:00
  • 3528f7b992 save all variables to make sure we save the vec_normalize normalization Peter Zhokhov 2018-08-01 10:12:19 -07:00
  • 151e48009e flake8 complaints Peter Zhokhov 2018-07-31 16:25:12 -07:00
  • 92f33335e9 dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-31 15:53:18 -07:00
  • af729cff15 dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-31 15:37:00 -07:00
  • 10f815fe1d fixed import in vec_normalize Peter Zhokhov 2018-07-31 15:19:43 -07:00
  • 8c4adac898 running_mean_std uses tensorflow variables Peter Zhokhov 2018-07-31 14:45:55 -07:00
  • 2a93ea8782 serialize variables as a dict, not as a list Peter Zhokhov 2018-07-31 11:13:31 -07:00
  • 9c48f9fad5 very dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-31 10:23:43 -07:00
  • 348cbb4b71 dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-31 09:42:23 -07:00
  • a1602ab15f dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-30 17:51:16 -07:00
  • e63e69bb14 dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-30 17:39:22 -07:00
  • 385e7e5c0d dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-30 17:21:05 -07:00
  • d112a2e49f added approximate humanoid reward with ppo2 into the README for reference Peter Zhokhov 2018-07-30 16:58:31 -07:00
  • e662dd6409 run.py can run algos from both baselines and rl_algs Peter Zhokhov 2018-07-30 16:09:48 -07:00
  • efc6bffce3 replaced atari_arg_parser with common_arg_parser Peter Zhokhov 2018-07-30 15:58:56 -07:00
  • 872181d4c3 re-exported rl_algs - fixed problems with serialization test and test_cartpole Peter Zhokhov 2018-07-30 15:49:48 -07:00
  • 628ddecf6a re-exported rl_algs Peter Zhokhov 2018-07-30 12:15:46 -07:00
  • 83a4a4be65 run slow tests peter 2018-07-26 14:39:25 -07:00
  • 7edac38c73 more stuff from rl-algs peter 2018-07-26 14:26:57 -07:00
  • a6dca44115 exported rl-algs peter 2018-07-26 14:02:04 -07:00
  • 2c818245d6 dummy commit to RUN BENCHMARKS peterz_test_benchmarks Peter Zhokhov 2018-07-25 18:09:30 -07:00
  • ae8e7fd16b dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-07-25 18:07:56 -07:00
  • f272969325 GAIL: bugfix in dataset loading (#447) Adam Gleave 2018-07-06 16:12:14 -07:00
  • 622915c473 fix dtype for wrapper observation spaces observation-dtype Karl Cobbe 2018-06-12 14:48:39 -07:00
  • a1d3c18ec0 fix dtype for wrapper observation spaces Karl Cobbe 2018-06-11 13:35:47 -07:00
  • a6b1bc70f1 re-import internal; fix missing tile_images.py (#427) pzhokhov 2018-06-08 09:41:45 -07:00
  • d1021f5885 Merge branch 'master' of https://github.com/openai/baselines into peterz_import_internal peterz_import_internal Peter Zhokhov 2018-06-07 14:00:58 -07:00
  • 6fd15ec92d adding missing tile_images.py Peter Zhokhov 2018-06-07 13:59:58 -07:00
  • 36ee5d1707 Import internal changes (#422) pzhokhov 2018-06-06 11:39:13 -07:00
  • fab274b9af import internal Peter Zhokhov 2018-06-06 10:34:05 -07:00
  • 0392dda802 Merge branch 'master' of https://github.com/openai/baselines into peterz_import_internal Peter Zhokhov 2018-06-06 10:33:45 -07:00
  • 24fe3d6576 Import internal repo (#409) pzhokhov 2018-05-21 15:24:00 -07:00
  • 4dc709faae proper rng seeding in the test_identity Peter Zhokhov 2018-05-21 14:46:49 -07:00
  • ea55240732 exported commit with identity test Peter Zhokhov 2018-05-21 13:05:39 -07:00
  • 3d7ed16f1f extra import of the baselines badge Peter Zhokhov 2018-05-16 12:05:35 -07:00
  • efb071949e import rl-algs from 2e3a166 commit peterz_import_internal_2e3a166 Peter Zhokhov 2018-05-16 12:00:23 -07:00
  • 9cb7ece338 add opencv-python to the dependencies (#407) pzhokhov 2018-05-14 10:52:19 -07:00
  • 9cf95a0054 setup travis ci build (#388) pzhokhov 2018-05-03 09:43:28 -07:00
  • 8b781038cc put filters and running_stat files in common instead of acktr (#389) pzhokhov 2018-05-02 18:42:48 -07:00
  • 69f25c6028 import internal repo (#385) pzhokhov 2018-05-01 16:54:04 -07:00
  • 2b0283b9db Readme.md detailed installation instructions (#377) pzhokhov 2018-04-25 17:40:48 -07:00
  • 1f8a03f3a6 Update README Matthias Plappert 2018-03-26 16:50:22 +02:00
  • 3cc7df0608 Minor fixes to HER release (#319) Matthias Plappert 2018-03-05 11:06:17 +01:00
  • 4ef879d597 Add warning if num_cpu = 1 her-fixes Matthias Plappert 2018-03-05 11:00:39 +01:00
  • a418758cbe Fix plotting script Matthias Plappert 2018-03-05 11:00:20 +01:00
  • 8b3a6c2051 fix DummyVecEnv reusing buffers Alex Nichol 2018-03-02 17:18:07 -08:00
  • 569bd42629 Merge pull request #308 from araffin/master Alex Nichol 2018-03-01 10:45:04 -08:00
  • f49a9c3d85 Fix bug in DDPG parameter space noise adaptation (#306) Daniel Ziegler 2018-03-01 09:00:34 -08:00
  • 14f2f9328c Bug fix in saving ACER model Antonin RAFFIN 2018-03-01 10:24:14 +01:00
  • 6bdf2f55a2 Merge pull request #132 from bhatiaabhinav/bug_fixes Alex Nichol 2018-02-27 19:00:37 -08:00
  • 97be70d6c8 fixes for DummyVecEnv Alex Nichol 2018-02-27 18:55:10 -08:00
  • b71152eea0 Adds support for Hindsight Experience Replay (HER) (#299) Matthias Plappert 2018-02-26 17:40:16 +01:00
  • d90638b565 Minor improvements matthias-her Matthias Plappert 2018-02-26 10:32:11 +01:00
  • f4953c3c2d Add Hindsight Experience Replay (HER) Matthias Plappert 2018-02-26 10:00:17 +01:00
  • df2e846ab7 export: fix accidental rename Christopher Hesse 2018-02-14 22:01:16 -08:00
  • edb52c22a5 export: Fix deepq param noise refactoring, remove atari experiments and azure dependency Christopher Hesse 2018-02-14 21:42:22 -08:00
  • 98257ef8c9 Flush temporary file before compressing it. Andrei Kashin 2017-10-03 01:29:07 +03:00
  • d9b36601d9 comment about loading weights in ppo2 Oleg Klimov 2018-02-05 12:25:05 -08:00
  • 2793971c10 fix gail tf_util usage Oleg Klimov 2018-02-05 07:51:27 -08:00
  • 16d7d23b7d Merge pull request #271 from simontudo/add-requirement-cloudpickle John Schulman 2018-02-02 23:04:53 -08:00
  • 9175b770c6 Merge pull request #273 from simontudo/videorecorder-import John Schulman 2018-02-02 23:03:51 -08:00
  • 615870ad6b updated videorecorder import simontudo 2018-02-01 12:09:08 +01:00
  • 7bd264e0e9 added cloudpickle to requirements simontudo 2018-01-31 10:43:17 +01:00
  • 8d03102d4d Merge pull request #265 from 20chase/patch-1 John Schulman 2018-01-29 00:54:51 -08:00
  • 4a77855529 using mujoco_arg_parser as args 20chase 2018-01-29 16:52:01 +08:00
  • 2e29b41592 Merge pull request #268 from ei-grad/master John Schulman 2018-01-27 18:42:31 -08:00
  • 634e37c5b8 Fix fc call in AcerLstmPolicy Andrew Grigorev 2018-01-27 23:18:02 +03:00
  • 452b548c2a Merge branch 'master' into patch-1 20chase 2018-01-26 14:34:01 +08:00
  • ebb8afff2e fix trpo_mpi bug where logstd wasn’t included John Schulman 2018-01-25 21:17:40 -08:00
  • c9613b2293 Merge pull request #259 from andrewliao11/openai_gail John Schulman 2018-01-25 20:54:34 -08:00
  • 459f007bcc Merge pull request #260 from uidilr/master John Schulman 2018-01-25 20:54:20 -08:00
  • 9fa8e1baf1 Lots of cleanups Fixes for new gym version Add @olegklimov and @unixpickle to authors list John Schulman 2018-01-25 18:33:48 -08:00
  • ac2ea4f31f fix logger error for MPI 20chase 2018-01-25 22:09:00 +08:00
  • d8cce2309f Add GAIL Yusuke Nakata 2018-01-23 12:00:00 +09:00
  • 0c207f0185 fix typo andrew 2018-01-21 22:13:01 -08:00
  • 41d41fabe3 add gail maintainer list andrew 2018-01-21 22:12:03 -08:00
  • b5be53dc92 Merge pull request #229 from andrewliao11/gail John Schulman 2018-01-21 20:30:20 -05:00
  • 49c1a8ec26 Fix bug in parameter space noise DQN Matthias Plappert 2018-01-16 10:24:30 -08:00
  • e5a714b070 fix relative import andrew 2018-01-12 15:12:45 -08:00
  • f9d1d3349a remove mpirun from ppo2 instructions John Schulman 2018-01-12 11:05:29 -08:00
  • 8c90f67560 don't list TensorFlow as a requirement Alex Nichol 2017-12-15 15:54:43 -08:00
  • f22bee085d Add files via upload Andrew 2017-12-12 19:03:42 -08:00
  • 4acc71fe23 add x, y, axis name andrew 2017-12-12 18:58:57 -08:00
  • 2f1b629ecc Merge branch 'gail' of https://github.com/andrewliao11/baselines into gail andrew 2017-12-12 18:56:00 -08:00
  • 00573cf5e9 add x, y axis name andrew 2017-12-12 18:54:03 -08:00
  • cfa1236d78 Update README.md Andrew 2017-12-11 21:21:56 -08:00
  • 64288f9f84 Update gail-result.md Andrew 2017-12-11 21:19:47 -08:00
  • 5f647d4d34 Update README.md Andrew 2017-12-11 21:18:05 -08:00
  • 6723455b75 Update gail-result.md Andrew 2017-12-11 21:15:30 -08:00
  • 45a93cf2b9 add training curve from tensorboard Andrew 2017-12-11 21:06:04 -08:00
  • 11604f7cc9 add download link to readme and add description to python file andrew 2017-12-07 12:08:20 -08:00
  • 2444034d11 Merge pull request #194 from ryanjulian/env_lines John Schulman 2017-12-04 14:07:01 -08:00
  • 041b6b76b7 Merge pull request #215 from chris-chris/feature/typo-2017-11-19 John Schulman 2017-12-04 14:02:49 -08:00
  • 5d62b5bdaa Merge pull request #221 from jvmancuso/patch-1 John Schulman 2017-12-04 14:01:38 -08:00
  • 2fcc9b9572 Merge pull request #226 from definitelyuncertain/master John Schulman 2017-12-04 14:01:12 -08:00
  • 000033973b Update gail-result.md Andrew 2017-12-03 15:50:24 -08:00
  • 6090ee8292 add comparison for expert/BC/gail andrew 2017-12-03 15:46:52 -08:00
  • 7954327c5f add behavior cloning learn/eval code andrew 2017-12-03 13:55:44 -08:00