baselines

Files

pzhokhov d0cc325e14 store session at policy creation time (#655 )

* sync internal changes. Make ddpg work with vecenvs

* B -> nenvs for consistency with other algos, small cleanups

* eval_done[d]==True -> eval_done[d]

* flake8 and numpy.random.random_integers deprecation warning

* store session at policy creation time

* coexistence tests

* fix a typo

* autopep8

* ... and flake8

* updated todo links in test_serialization

2018-10-19 08:54:21 -07:00

envs

store session at policy creation time (#655 )

2018-10-19 08:54:21 -07:00

__init__.py

refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490 )

2018-08-13 09:56:44 -07:00

test_cartpole.py

disable async acktr (#129 )

2018-10-03 14:38:32 -07:00

test_doc_examples.py

tighten flake8, autopep8 to fix trailing whitespaces and blank lines with whitespaces (#87 )