* change rms 2 tfrms switch in vec_normalize to be more explicit
* modify the vec_normalize / use_tf logic a little bit
* typo
* use_tf = False by default
* viz docs
* writing vizualization docs
* documenting plot_util
* docstrings in plot_util
* autopep8 and flake8
* spelling (using default vim spellchecker and ingoring things like dataframe, docstring and etc)
* rephrased viz.md a little bit
* more examples of viz code usage in the docs
* replaced vizualization doc with notebook
* viz docs
* writing vizualization docs
* documenting plot_util
* docstrings in plot_util
* autopep8 and flake8
* spelling (using default vim spellchecker and ingoring things like dataframe, docstring and etc)
* rephrased viz.md a little bit
* move vec_env
* cleaning up rl_common
* tests are passing (but mosts tests are deleted as moved to baselines)
* add benchmark runner for smoke tests
* removed duplicated algos
* route references to rl_algs.a2c to baselines.a2c
* route references to rl_algs.a2c to baselines.a2c
* unify conftest.py
* removing references to duplicated algs from codegen
* removing references to duplicated algs from codegen
* alex's changes to dummy_vec_env
* fixed test_carpole[deepq] testcase by decreasing number of training steps... alex's changes seemed to have fixed the bug and make it train better, but at seed=0 there is a dip in the training curve at 30k steps that fails the test
* codegen tests with atol=1e-6 seem to be unstable
* rl_common.vec_env -> baselines.common.vec_env mass replace
* fixed reference in trpo_mpi
* a2c.util references
* restored rl_algs.bench in sonic_prob
* fix reference in ci/runtests.sh
* simplifed expression in baselines/common/cmd_util
* further increased rtol to 1e-3 in codegen tests
* switched vecenvs to use SimpleImageViewer from gym instead of cv2
* make run.py --play option work with num_envs > 1
* make rosenbrock test reproducible
* git subrepo pull (merge) baselines
subrepo:
subdir: "baselines"
merged: "e23524a5"
upstream:
origin: "git@github.com:openai/baselines.git"
branch: "master"
commit: "bcde04e7"
git-subrepo:
version: "0.4.0"
origin: "git@github.com:ingydotnet/git-subrepo.git"
commit: "74339e8"
* updated baselines README (num-timesteps --> num_timesteps)
* typo in deepq/README.md
* updated benchmark pages with final rewards
* use htmlpreview to render pages
* use htmlpreview to render pages
* use htmlpreview to render pages
* updated README to reflect ppo1 being obsolete
* removed navbars from published benchmark pages
* fixed link in README
* exported rl-algs
* more stuff from rl-algs
* run slow tests
* re-exported rl_algs
* re-exported rl_algs - fixed problems with serialization test and test_cartpole
* replaced atari_arg_parser with common_arg_parser
* run.py can run algos from both baselines and rl_algs
* added approximate humanoid reward with ppo2 into the README for reference
* dummy commit to RUN BENCHMARKS
* dummy commit to RUN BENCHMARKS
* dummy commit to RUN BENCHMARKS
* dummy commit to RUN BENCHMARKS
* very dummy commit to RUN BENCHMARKS
* serialize variables as a dict, not as a list
* running_mean_std uses tensorflow variables
* fixed import in vec_normalize
* dummy commit to RUN BENCHMARKS
* dummy commit to RUN BENCHMARKS
* flake8 complaints
* save all variables to make sure we save the vec_normalize normalization
* benchmarks on ppo2 only RUN BENCHMARKS
* make_atari_env compatible with mpi
* run ppo_mpi benchmarks only RUN BENCHMARKS
* hardcode names of retro environments
* add defaults
* changed default ppo2 lr schedule to linear RUN BENCHMARKS
* non-tf normalization benchmark RUN BENCHMARKS
* use ncpu=1 for mujoco sessions - gives a bit of a performance speedup
* reverted running_mean_std to user property decorators for mean, var, count
* reverted VecNormalize to use RunningMeanStd (no tf)
* reverted VecNormalize to use RunningMeanStd (no tf)
* profiling wip
* use VecNormalize with regular RunningMeanStd
* added acer runner (missing import)
* flake8 complaints
* added a note in README about TfRunningMeanStd and serialization of VecNormalize
* dummy commit to RUN BENCHMARKS
* merged benchmarks branch
* import rl-algs from 2e3a166 commit
* extra import of the baselines badge
* exported commit with identity test
* proper rng seeding in the test_identity
* simple .travis.yml file
* added static syntax checks of common to .travis.yml
* dockerizing the build
* fix Dockerfile, adding build shield
* cleaning up workdir in Dockerfile and .travis.yml
* .travis.yml fixed common -> baselines/common for style check
* changes to README.md files with more detailed installation instructions
* md-fying the changes better
* link on the word homebrew in readme.md
* typos in README.md
* README.md
* removed extra comma sign
* removed sudo from brew command