* fixes to enjoy_cartpole, enjoy_mountaincar.py
* fixed {train,enjoy}_pong, removed enjoy_retro
* set number of timesteps to 1e7 in train_pong
* flake8 complaints
* use synchronous version fo acktr in test_env_after_learn
* flake8
* fixes to enjoy_cartpole, enjoy_mountaincar.py
* fixed {train,enjoy}_pong, removed enjoy_retro
* set number of timesteps to 1e7 in train_pong
* flake8 complaints
* use synchronous version fo acktr in test_env_after_learn
* flake8
* implement pdfromlatent in BernoulliPdType
* remove env.close() at the end of algorithms
* test case for environment after learn
* closing env in run.py
* fixes for acktr and trpo_mpi
* add make_session with new graph for every call in test_env_after_learn
* remove extra prints from test_env_after_learn
* Add lots of docstrings
Change hyperparameter transformations for slightly better efficiency and to avoid circular dependency.
Now all parameters are stored in a “human-readable” form.
* improve pretty-print of nodes and trees
* newlines at end-of-file, return graph in render(), assert_valid() fix
* split run_algo_search.py into several simpler scripts
* add joint_train option to get_prob
* minor changes to soln_db and embedding script
* Arguments: -> Args:
* fix replay, part 1
* fix behavior when using unpickled algos
* re-add retrieve_weights
* make training scripts more consistent
* lint
* lint
* lint + remove rendering some rendering functionality from trex env as it’s also elsewhere
* get rid of warnings
* refactor functionality for getting final q-function and losses. revive code for removing useless terms & tests for simplification.
* fix vecenv closing
* finish removing algo folder (most useful functionality has been moved out of it)
* control verbosity of trex
* fix tests
* rename spec => choice_spec, some comments, asserts, debug prints
* fix some tests
* putting instructions from README.md into a script
* install roboschool as a part of setup.py
* install roboschool from install.py
* export pkg_config_path
* remove compilation step from roboschool/setup.py
* removed roboschool install from games install due to extra compilation step
* removed unused import from roboschool/setup.py
* error if logger looks wrong
* check version of logger, call logger.configure() on import
* remove changes entry
* add version to rl-algs
* fix typo
* add comment
* switch version to string
* set logger env variable
* I investigated a bit about running a training in a vectorized monitored mujoco env and found out that the 0.monitor.csv file could not be plotted using baselines.results_plotter.py functions. Moreover the seed is the same in every parallel environments due to the particular behaviour of lambda. this fixes both issues without breaking the function in other files (baselines.acktr.run_mujoco still works)
* unifies make_atari_env and make_mujoco_env
* redefine make_mujoco_env because of run_mujoco in acktr not compatible with DummyVecEnv and SubprocVecEnv
* fix if else
* Update run.py
* rig buildkite pipeline to run benchmarks when commit ends with RUN BENCHMARKS
* fix the buildkite pipeline file
* fix the buildkite pipeline file
* fix the buildkite pipeline file
* fix the buildkite pipeline file
* fix the buildkite pipeline file
* fix the buildkite pipeline file
* fix the buildkite pipeline file - merge test and benchmark steps
* fix the buildkite pipeline file - merge test and benchmark steps
* fix buildkite pipeline file
* fix buildkite pipeline file
* dry RUN BENCHMARKS
* dry RUN BENCHMARKS
* dry not run BENCHMARKS
* not run benchmarks
* not running benchmarks
* no running benchmarks
* no running benchmarks
* still not running benchmarks
* dummy commit to RUN BENCHMARKS
* trigger benchmarks from buildkite RUN BENCHMARKS
* specifying RCALL_KUBE_CLUSTER RUN BENCHMARKS
* remove rl-algs/run-benchmarks-new.py (moved to ci), merged baselines/common/console_util and baselines/common/util.py
* added missing imports in console_util
* clone subrepo over https
* Reduce duplication in VecEnv subclasses.
Now VecEnv base class handles rendering and closing; subclasses should provide get_images and (optionally) close_extras.
* fix tests
* minor docstring change
* raise NotImplementedError
* copied parts of util.py to baselines.common from rcall
* merged fix for baselines.logger, resolved conflicts
* copied ccap to baselines/baselines/common/util.py