Commit Graph

  • 5c62f5c7dd added peterz to baselines authorlist Peter Zhokhov 2018-09-11 12:44:51 -07:00
  • c5d6f2996c Merge branch 'master' of github.com:openai/baselines Peter Zhokhov 2018-09-11 12:40:29 -07:00
  • 29bf587d15 Merge branch 'master' of github.com:openai/baselines Peter Zhokhov 2018-09-11 12:40:29 -07:00
  • 06bdc2860c docstrings about vecenvs Peter Zhokhov 2018-09-11 12:40:23 -07:00
  • 23b2333238 baselines issue #564 (#574) pzhokhov 2018-09-10 11:50:59 -07:00
  • adaa8aefa8 baselines issue #564 (#574) pzhokhov 2018-09-10 11:50:59 -07:00
  • 8614c4ddbf flake8 Peter Zhokhov 2018-09-10 10:41:29 -07:00
  • 59a7ffb84d fixe tests of test_env_after_learn Peter Zhokhov 2018-09-10 10:32:42 -07:00
  • 58b1021b28 Add tensorboard start command for convenience (#569) Daniel Angelov 2018-09-08 01:04:02 +01:00
  • a60e88bff9 git subrepo pull (merge) baselines Peter Zhokhov 2018-09-07 14:42:29 -07:00
  • 75b93b890e implement pdfromlatent in BernoulliPdType (#81) pzhokhov 2018-09-06 16:17:59 -07:00
  • 565b2153d7 Add lots of docstrings (#76) John Schulman 2018-09-07 07:31:30 +09:00
  • 35e95ee85a fix python 3.5 string format compatibility Peter Zhokhov 2018-09-06 12:00:19 -07:00
  • ad219e205d VecNormalize: set env. returns to zero on resets. (#556) Isaac Lascasas 2018-09-06 19:21:50 +02:00
  • be9118bcd8 git subrepo pull (merge) baselines Peter Zhokhov 2018-09-06 10:17:55 -07:00
  • 02a5e7aed5 fixes to readme and baselines/run.py (#80) pzhokhov 2018-09-06 10:17:21 -07:00
  • 87ac8bc317 install roboschool in install.py (#55) pzhokhov 2018-09-05 14:03:13 -07:00
  • cc4215ef4b refactor common.models via registering reflection (#565) Tom 2018-09-07 01:16:06 +08:00
  • 1e9051e87e fixed warning (#464) Clayton Thorrez 2018-09-05 18:12:01 -04:00
  • 43ed76944b Fix mean reward per episode after training Pong. (#562) uronce-cc 2018-09-06 06:06:29 +08:00
  • 7f08c675bb git subrepo pull (merge) baselines Peter Zhokhov 2018-09-04 10:23:29 -07:00
  • b3f966aa02 use env.render in dummy_vec_env.render when num_envs == 1 (#74) pzhokhov 2018-09-04 10:22:32 -07:00
  • 51cefc933b make load_variables compatible with old list format (#71) pzhokhov 2018-08-30 15:32:55 -07:00
  • 7bccb2969f baselines: default logger similar to configure() logger, rcall: don't call logger.configure() for new rl_algs Christopher Hesse 2018-08-30 15:04:40 -07:00
  • 0a40206c6c ncpu needs to be an integer. (#558) old_acktr_cont uronce-cc 2018-09-01 00:02:18 +08:00
  • 1937826784 Fix alien syntax and apply PEP 8 style (#554) Alfredo Canziani 2018-08-30 20:21:25 -04:00
  • b29c8020d7 remove saving model as a pickle file in ppo2 (tries to pull environment in; bad idea - may need to use constructor argument pickling or somesuch if at all necessary) (#69) pzhokhov 2018-08-30 13:40:40 -07:00
  • 4ec308aaa4 fixed syntax Peter Zhokhov 2018-08-30 10:27:18 -07:00
  • 3bbf3f3511 allow_early_resets=True in create_vec_env Peter Zhokhov 2018-08-30 09:40:42 -07:00
  • e5de29a954 instructions for tensorboard (#61) Joshua Meier 2018-08-29 15:25:47 -07:00
  • 2507d335f9 Tensorboard util (#60) Joshua Meier 2018-08-29 15:17:43 -07:00
  • bdd4d385a6 Fix result_plotters in vectorized mujoco environments (#533) Damien Lancry 2018-08-29 01:48:56 +01:00
  • 0961f5dd94 git subrepo pull (merge) baselines Peter Zhokhov 2018-08-27 16:39:51 -07:00
  • 337d913a8f remove reset_task from subproc vec env (#45) Christopher Hesse 2018-08-27 12:48:05 -07:00
  • 34af61a132 baselines: fix dummy vec env render mode (#42) Karl Cobbe 2018-08-26 20:54:38 -07:00
  • 1ea5ec647c export SimpleEnv and assert_envs_equal, fix minor bug in action space (#46) Christopher Hesse 2018-08-24 15:44:56 -07:00
  • 2fc7a1cbee Trigger benchmarks from buildkite (#40) pzhokhov 2018-08-23 13:20:01 -07:00
  • 14c1d69ef4 Reduce duplication in VecEnv subclasses. (#38) John Schulman 2018-08-22 13:54:34 -07:00
  • c8f6d8bac7 address rl-algs issue #169 (missing util functions from rcall) (#30) pzhokhov 2018-08-21 20:51:09 -07:00
  • 3a006ba50e flake8 fixes (#35) pzhokhov 2018-08-21 17:41:25 -07:00
  • c6c0f45cb1 fix 'async' is a reserved word in Python >= 3.7 (#495) (#542) Tom 2018-08-28 03:36:43 +08:00
  • e92a6ad8f4 Update README.md (#537) wangjksjtu 2018-08-28 03:35:48 +08:00
  • 92b9a37257 Updated example commands to run ppo2 (#534) HelgeS 2018-08-24 00:58:27 +02:00
  • cb14da96ca Fix typo on policies documentation (#535) Armin Primadi 2018-08-24 05:56:13 +07:00
  • 3900f2a447 baselines issue 146 (remove tensorflow from setup.py) (#34) pzhokhov 2018-08-21 16:51:37 -07:00
  • 20d22a5d79 Fix baselines build (fails due to lack of mujoco in public baselines container) (#29) pzhokhov 2018-08-20 19:15:17 -07:00
  • caf7b08b4d Baselines issue #525 (lack of docs for recurrent policies) (#27) pzhokhov 2018-08-20 13:54:56 -07:00
  • ca0165cdf5 flake8 complaints Peter Zhokhov 2018-08-17 18:09:13 -07:00
  • eb5b605f86 restore subrepo conftest.py files (#22) pzhokhov 2018-08-17 17:01:40 -07:00
  • a89bee3c8d Merge commit 'refs/subrepo/baselines/fetch' into subrepo/baselines Peter Zhokhov 2018-08-17 13:55:27 -07:00
  • 353bb15e90 deduplicate algorithms in rl-algs and baselines (#18) pzhokhov 2018-08-17 09:40:35 -07:00
  • 64c0c0a043 Setup travis (#12) pzhokhov 2018-08-15 10:34:38 -07:00
  • 5fee99e771 Setup travis (#12) pzhokhov 2018-08-15 10:34:38 -07:00
  • 5edcd6886e Fix argument error in deepq (#508) Youngjin Kim 2018-08-17 06:55:57 +09:00
  • bcde04e710 Fix argument error in deepq (#508) games/master Youngjin Kim 2018-08-17 06:55:57 +09:00
  • cd375ab209 update readmes (#514) pzhokhov 2018-08-16 14:53:49 -07:00
  • 5622a09fa4 update readmes (#514) pzhokhov 2018-08-16 14:53:49 -07:00
  • 0f8d640554 updated README files and deepq.train_cartpole example peterz_update_READMEs Peter Zhokhov 2018-08-16 13:15:51 -07:00
  • 44b91f3454 Merge branch 'master' of github.com:openai/baselines into peterz_update_READMEs Peter Zhokhov 2018-08-16 12:26:51 -07:00
  • 0c2a6936c4 adding a link to repo-wide README Peter Zhokhov 2018-08-16 12:23:06 -07:00
  • 2614f0f65a update per-algorithm READMEs to reflect new way of running algorithms Peter Zhokhov 2018-08-16 12:18:06 -07:00
  • e2da7cd42f Several bugfixes for #504, #505, #506 related to Classic Control and deepq (#507) Pim de Haan 2018-08-16 12:08:53 -07:00
  • 0f281fd0ca flake8 complaint peterz_alex_propagate_vecenv_changes Peter Zhokhov 2018-08-15 10:31:34 -07:00
  • ef4146005a propagate Alex's changes to vecenv module (needs to be done manually until baselines is removed from rl-algs) Peter Zhokhov 2018-08-15 10:26:44 -07:00
  • 11176eedce tests on ubuntu18.04 peterz_ubuntu18_04 Peter Zhokhov 2018-08-14 16:43:13 -07:00
  • b222dd0610 updated links in README to point to master Peter Zhokhov 2018-08-13 16:01:24 -07:00
  • 1870685071 Publish benchmark results (#502) pzhokhov 2018-08-13 15:59:43 -07:00
  • 841da92f4d add code coverage report peterz_codecov_report Peter Zhokhov 2018-08-13 10:44:49 -07:00
  • 8c2aea2add refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490) pzhokhov 2018-08-13 09:56:44 -07:00
  • 624231827c merged benchmarks branch Peter Zhokhov 2018-08-13 09:28:10 -07:00
  • b650cd862e lstm network builders using tf lstm peterz_tflstm_1 Peter Zhokhov 2018-08-10 14:24:55 -07:00
  • dbcc4e0252 lstm network builders using tf lstm peterz_tflstm Peter Zhokhov 2018-08-10 14:23:45 -07:00
  • 2ebdc28791 lstm network builders using tf lstm peterz_tflstm_with_ppo2 Peter Zhokhov 2018-08-10 14:21:30 -07:00
  • 217b111c88 merged refactor Peter Zhokhov 2018-08-10 14:14:46 -07:00
  • ea68f3b7e6 dummy commit to RUN BENCHMARKS peterz_benchmarks Peter Zhokhov 2018-08-10 09:46:43 -07:00
  • ca721a4be6 Merge branch 'observation-dtype' of github.com:openai/baselines into peterz_benchmarks Peter Zhokhov 2018-08-10 09:45:50 -07:00
  • 72f3572a10 fixed syntax in conv_only RUN BENCHMARKS Peter Zhokhov 2018-08-08 16:24:39 -07:00
  • b9cd941471 dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-08-08 15:59:59 -07:00
  • 0899b71ede scale the images in conv_only RUN BENCHMARKS Peter Zhokhov 2018-08-08 15:15:03 -07:00
  • cc8c9541fb dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-08-08 15:10:39 -07:00
  • cb32522394 enable all benchmarks Peter Zhokhov 2018-08-08 15:10:00 -07:00
  • 1e40ec22be dummy commit to RUN BENCHMARKS Peter Zhokhov 2018-08-08 10:45:18 -07:00
  • 701a36cdfa added a note in README about TfRunningMeanStd and serialization of VecNormalize Peter Zhokhov 2018-08-08 10:44:58 -07:00
  • 366f486e34 Update README.md (#416) Tony Yu Cao 2018-08-08 18:42:10 +01:00
  • 5a7f9847d8 flake8 complaints Peter Zhokhov 2018-08-03 13:59:58 -07:00
  • b63134e5c5 added acer runner (missing import) Peter Zhokhov 2018-08-03 13:31:37 -07:00
  • db314cdeda Merge branch 'peterz_profile_vec_normalize' into peterz_migrate_rlalgs Peter Zhokhov 2018-08-03 11:47:36 -07:00
  • b08c083d91 use VecNormalize with regular RunningMeanStd Peter Zhokhov 2018-08-03 11:44:12 -07:00
  • bfbbe66d9e profiling wip Peter Zhokhov 2018-08-02 11:23:12 -07:00
  • 1c5c6563b7 reverted VecNormalize to use RunningMeanStd (no tf) Peter Zhokhov 2018-08-02 10:55:09 -07:00
  • 1fa8c58da5 reverted VecNormalize to use RunningMeanStd (no tf) Peter Zhokhov 2018-08-02 10:54:07 -07:00
  • f6d1115ead reverted running_mean_std to user property decorators for mean, var, count Peter Zhokhov 2018-08-02 10:32:22 -07:00
  • f6d5a47bed use ncpu=1 for mujoco sessions - gives a bit of a performance speedup Peter Zhokhov 2018-08-02 10:24:21 -07:00
  • c2df27bee4 non-tf normalization benchmark RUN BENCHMARKS Peter Zhokhov 2018-08-02 09:41:41 -07:00
  • 974c15756e changed default ppo2 lr schedule to linear RUN BENCHMARKS Peter Zhokhov 2018-08-01 16:24:44 -07:00
  • ad43fd9a35 add defaults Peter Zhokhov 2018-08-01 16:15:59 -07:00
  • 72c357c638 hardcode names of retro environments Peter Zhokhov 2018-08-01 15:18:59 -07:00
  • e00e5ca016 run ppo_mpi benchmarks only RUN BENCHMARKS Peter Zhokhov 2018-08-01 14:56:08 -07:00
  • 705797f2f0 Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks Peter Zhokhov 2018-08-01 14:46:40 -07:00
  • fcd84aa831 make_atari_env compatible with mpi Peter Zhokhov 2018-08-01 14:46:18 -07:00