Commit Graph

  • 7446e6ea34 fix typo fix_monitor_close Peter Zhokhov 2020-01-31 05:23:33 -08:00
  • 3bd068c82c actually close the file with the results on Monitor.close() Peter Zhokhov 2020-01-31 05:09:29 -08:00
  • ea25b9e8b2 Monitor should close what it inherits (#1076) master Harry Uglow 2020-01-31 13:06:18 +00:00
  • 9ee399f5b2 Fix build with latest gym (#1034) pzhokhov 2019-11-10 11:10:01 -08:00
  • 8d9e20fec3 narrow down gym version to 0.15.4 <= gym < 0.16.0 fix_build Peter Zhokhov 2019-11-10 11:08:59 -08:00
  • fc23c78c77 fix imports Peter Zhokhov 2019-11-08 15:39:57 -08:00
  • 25f750d84f update to use latest version of gym Peter Zhokhov 2019-11-08 15:31:40 -08:00
  • 391811d98c SubprocVecEnv uses CloudpickleWrapper to send specs (#1028) Tomasz Wrona 2019-11-09 00:23:49 +01:00
  • b99a73afe3 entrypoint variable made public (#970) and Fix RuntimeError (#910) (#1015) (#1032) tf2 SOLARIS 2019-11-09 08:20:54 +09:00
  • 517433f22a flake8 complaints Peter Zhokhov 2019-11-08 15:15:38 -08:00
  • 713f1a0aec tf2: Updated setup.py dependencies. (#1002) Isaac Lascasas 2019-10-26 00:50:04 +02:00
  • 665b888eeb Fix behavior cloning due to API changes (#1014) Yen-Chen Lin 2019-10-25 18:44:43 -04:00
  • f40a477a17 fix tf2 branch name Christopher Hesse 2019-10-25 15:27:30 -07:00
  • c6144bdb6a Fix RuntimeError (#910) (#1015) johannespitz 2019-10-26 00:24:41 +02:00
  • adba88b218 add quote marks to tensorflow < 2 to avoid bash logic Peter Zhokhov 2019-10-11 17:13:43 -07:00
  • bfbc3bae14 update status, fix the tensorflow version in the build Peter Zhokhov 2019-10-11 15:23:14 -07:00
  • f703776c91 fix a bug in acer saving and loading model (#990) Haiyang Chen 2019-09-27 23:39:41 +01:00
  • 53797293e5 use allreduce instead of Allreduce (send pickled data instead of floats) - probably affects performance somewhat, but avoid element number mismatch. Fixes 998 (#1000) pzhokhov 2019-09-27 14:45:31 -07:00
  • 0e423a0108 use allreduce instead of Allreduce (send pickled data instead of floats) - probably affects performance somewhat, but avoid element number mismatch. Fixes 998 fix998 Peter Zhokhov 2019-09-06 14:36:35 -07:00
  • 229a772b81 Release notes for Tensorflow 2.0 support. (#997) tanzhenyu 2019-08-29 14:25:44 -07:00
  • d80b075904 Make SubprocVecEnv works with DummyVecEnv (#908) Tomasz Wrona 2019-08-29 21:16:25 +02:00
  • d1a05a0dd2 Baselines for Tensorflow 2.0. (#978) tanzhenyu 2019-08-08 11:03:17 -07:00
  • 0182fe1877 entrypoint variable made public (#970) NicoBach 2019-08-06 01:03:19 +02:00
  • 1fb4dfb780 Fix typo in GAIL dataset log (#950) Seungjae Ryan Lee 2019-08-06 08:02:43 +09:00
  • 7cadef715f Fix typo (#930) Timo Kaufmann 2019-08-06 01:02:21 +02:00
  • fce4370ba2 Remove duplicate code in adaptive param noise. (#976) tanzhenyu 2019-08-05 16:01:54 -07:00
  • c57528573e Remove model def from deepq. (#946) tanzhenyu 2019-06-27 10:12:38 -07:00
  • 2bca7901f5 Updating the version to 0.1.6 (#933) Marcin Michalski 2019-06-24 19:19:01 +02:00
  • ba2b017820 add log_path flag to command line utility (#917) albert 2019-06-07 15:05:52 -07:00
  • 7c520852d9 Fix converting list of LazyFrames to ndarray (#907) Anton Grigoryev 2019-06-01 02:49:46 +03:00
  • 1c872ca8fd run test_monitor through pytest; fix the test, add flake8 to bench direectory - like PR 891 (#921) pzhokhov 2019-05-31 15:36:20 -07:00
  • ff8d36a7a7 Starting to reassign waiting_step in shmem_vecenv (#915) Jinho Lee 2019-06-01 06:31:35 +09:00
  • 7614b02f7a remove f strings for python back compatibility (#906) Sridhar Thiagarajan 2019-05-31 14:27:11 -07:00
  • f7d5a265e1 suppress excessive messages from unused loggers (#920) Andy Twigg 2019-05-31 14:26:45 -07:00
  • 21776e8f57 Support Tuple observation spaces (#911) Joshua Meier 2019-05-31 17:06:20 -04:00
  • 401a89e515 add tuple pdtype tuple_pdtype Jonathan Raiman 2019-05-23 15:43:48 -07:00
  • 9b68103b73 release Internal changes (#895) pzhokhov 2019-05-08 11:36:10 -07:00
  • 1ab9fae0b5 test fixes internal Peter Zhokhov 2019-05-03 16:36:03 -07:00
  • 75200671c4 fix tests - add matplotlib to setup_requires, put mpi4py import in try-except Peter Zhokhov 2019-05-03 16:29:10 -07:00
  • 46fa1b6453 merge master Peter Zhokhov 2019-05-03 15:57:31 -07:00
  • a1a9bd6174 Merge branch 'internal' of github.com:openai/baselines into internal Peter Zhokhov 2019-05-03 15:56:04 -07:00
  • ef7ac116cb (onp, np) -> (np, jp), switch jax code to use mark_slow decorator (#363) John Schulman 2019-04-26 17:42:43 -07:00
  • 1fa6ac38f1 JRL PPO test with delayed identity env (#355) pzhokhov 2019-04-24 17:04:36 -07:00
  • 07536451ee Procgen fixes (#352) Karl Cobbe 2019-04-23 16:33:30 -07:00
  • 64dfabb8eb Add initializer for process-level setup in SubprocVecEnv (#276) Greg Brockman 2019-04-23 13:40:08 -07:00
  • f5daca8c22 delete unnecessary stuff (#338) John Schulman 2019-04-22 14:45:01 -07:00
  • 8e0282ee94 ci/runtests.sh - pass all folders to pytest (#342) pzhokhov 2019-04-22 14:41:46 -07:00
  • ddcab1606d Procgen Benchmark Updates (#328) Karl Cobbe 2019-04-22 13:12:45 -07:00
  • bc4eef6053 fix tests (#335) Christopher Hesse 2019-04-20 15:06:18 -07:00
  • 967fc8c37f Fixed sequence env minor (#333) John Schulman 2019-04-20 10:08:09 -07:00
  • a93dde3b2b extra functionality in baselines.common.plot_util (#310) pzhokhov 2019-04-17 15:17:27 -07:00
  • b83a66527d Add jrl19 as backend for workbench (#324) John Schulman 2019-04-17 10:13:12 -07:00
  • 07cbf1e26a Grad clipping in MpiAdamOptimizer, transformer changes (#304) John Schulman 2019-04-16 09:20:09 -07:00
  • 5082e5d34b Workbench (#303) Karl Cobbe 2019-04-04 13:52:00 -07:00
  • 376fd88bb8 fix vec monitor infos Christopher Hesse 2019-04-03 16:28:29 -07:00
  • 3301089b48 remove bullet extra, constrain gym version to be >= 0.10.0 (#885) pzhokhov 2019-04-26 16:14:49 -07:00
  • a07fad9066 change rms 2 tfrms switch in vec_normalize to be more explicit (#886) pzhokhov 2019-04-26 16:14:21 -07:00
  • fc0c43b199 RNN support for PPO2 (#859) stateful_rnn JongGyun Kim 2019-04-27 07:17:57 +09:00
  • 5d8041d18e Fix indexing LazyFrames (#875) Taeyeong Jeong 2019-04-20 07:00:09 +09:00
  • fa37beb52e fix commit on atari bms page to point to a public commit Peter Zhokhov 2019-04-06 20:03:32 -07:00
  • 8a97e0df10 fix shuffling bug in ppo1 Peter Zhokhov 2019-04-05 15:23:46 -07:00
  • fabbf2c611 short-circuit framestack wrapper with size 1 (#871) pzhokhov 2019-04-05 15:18:15 -07:00
  • 5d285b318f [Update misc_util.py]: clean up unused helper functions (#751) Xingdong Zuo 2019-04-06 00:16:26 +02:00
  • 3e13bb8dc8 short-circuit framestack wrapper with size 1 like_pr_787 Peter Zhokhov 2019-04-05 15:13:59 -07:00
  • 49a99c7d23 Add eps to normalization (#797) Tim Zaman 2019-04-05 14:46:01 -07:00
  • c79b3373bf parse colon-separated env_id's Peter Zhokhov 2019-04-05 14:43:09 -07:00
  • 96b6a31848 Merge branch 'internal' of github.com:openai/baselines into internal Peter Zhokhov 2019-04-05 14:11:09 -07:00
  • 0a48a1fda9 Merge branch 'master' of github.com:openai/baselines into internal Jacob Hilton 2019-04-03 16:21:48 -07:00
  • ea20c8a034 add score calculator wrapper, forward property lookups on vecenv wrap… (#300) Christopher Hesse 2019-03-27 14:36:28 -07:00
  • a08af5d07d make tests use single-threaded session for determinism of KfacOptimizer (#298) pzhokhov 2019-03-22 16:28:29 -07:00
  • cc88c8e4c0 remove tensorflow dependency from VecEnv Oleg Klimov 2019-03-18 10:58:23 -07:00
  • f2654082b2 Symshapes - gives codegen ability to evaluate same algo on envs with different ob/ac shapes (#262) pzhokhov 2019-03-01 17:11:47 -08:00
  • dadc2c2eb6 Rl19 metalearning (#261) Karl Cobbe 2019-03-01 16:41:17 -08:00
  • d9702e7ccb codegen continuous control experiment pr (#256) pzhokhov 2019-02-28 20:13:05 -08:00
  • f641810ef9 update dmlab30 env (#258) Christopher Hesse 2019-02-27 22:46:43 +00:00
  • 3265098cc6 Merge branch 'master' of github.com:openai/baselines into internal Peter Zhokhov 2019-04-01 16:26:25 -07:00
  • 6d1c6c78d3 Interface for U.make_session changed (#865) Sridhar Thiagarajan 2019-04-01 16:24:02 -07:00
  • 62a9c76f18 fix the definition of TfInput.make_feed_dict. (#812) JongGyun Kim 2019-04-02 07:49:25 +09:00
  • 282c9cc91f fix small bug in plot_results() (#864) Hao-Chih, Lin 2019-04-02 00:48:35 +02:00
  • 096f4d9cf0 neaten up stacking logic in mujoco_dset in gail Peter Zhokhov 2019-04-01 15:47:13 -07:00
  • 16136ddca7 fix bugs: obs_ph normalization in adversary.py (#823) Mingfei 2019-04-02 06:44:31 +08:00
  • b1644157d6 Fixed typo on #092 (#824) Darío Hereñú 2019-04-01 19:41:52 -03:00
  • 58541db226 MPI refer to workers as ranks, not threads. (#833) Yu Feng 2019-04-01 15:38:45 -07:00
  • c02b575f01 ppo2: use time.perf_counter() instead of time.time() for time measurement (#847) zlsh80826 2019-04-02 06:37:32 +08:00
  • 897fa31548 Avoid using default config while requesting available GPUs (#863) Pastafarianist 2019-03-29 23:25:56 +03:00
  • d51f8be8f9 Report episode rewards/length in A2C and ACKTR (#856) Brett Daley 2019-03-28 12:21:48 -04:00
  • 3f2f45acef Merge pull request #860 from openai/build-retro-env-framestack-fix Jacob Hilton 2019-03-25 14:33:15 -07:00
  • b64974eb90 build_env now doesn't apply frame stack to retro games twice Jacob Hilton 2019-03-24 12:27:14 -07:00
  • 6bbc4635e6 Update cmd_util with initializer, env_kwargs, and force_dummy gdb Greg Brockman 2019-03-18 17:42:10 -07:00
  • 1b092434fc remove f-strings for python 3.5 compatibility (#854) pzhokhov 2019-03-16 11:54:47 -07:00
  • 1259f6ab25 check for environment being vectorized in the play logic in run.py Peter Zhokhov 2019-03-11 17:44:03 -07:00
  • 5bc6f53960 merged master Peter Zhokhov 2019-03-11 17:31:03 -07:00
  • 74101a9f24 fix freeze of ppo2 (#849) pzhokhov 2019-03-11 17:28:51 -07:00
  • 90d66776a4 remove one of duplicated lines. (#813) JongGyun Kim 2019-03-07 08:13:01 +09:00
  • b875fb7b5e release Internal changes (#800) pzhokhov 2019-02-27 15:35:31 -08:00
  • fa5cb1e1f5 merged master Peter Zhokhov 2019-02-27 15:05:24 -08:00
  • 675b100190 raised the tolerance on the test_microbatches test Peter Zhokhov 2019-02-27 14:22:24 -08:00
  • adc4388f6b fixes to catch changes in gym Peter Zhokhov 2019-02-27 12:49:40 -08:00
  • 6dedd5d241 flake8 complaints in baselines/her Peter Zhokhov 2019-02-26 16:50:10 -08:00
  • 5c7da772a4 Merge branch 'master' of github.com:openai/games Peter Zhokhov 2019-02-26 15:44:26 -08:00