Files
pzhokhov 8e56ddeac2 Multidiscrete action space compatibility for policy gradient-based methods (#677)
* multidiscrete space compatibility

* flake8 and syntax
2018-10-24 11:01:59 -07:00
..
2017-08-18 09:25:39 -07:00
2018-09-20 16:05:26 -07:00
2018-01-25 18:54:24 -08:00
2018-09-20 16:05:26 -07:00
2018-09-20 16:05:26 -07:00
2018-01-25 18:54:24 -08:00

ACKTR

ACKTR with continuous action spaces

The code of ACKTR has been refactored to handle both discrete and continuous action spaces uniformly. In the original version, discrete and continuous action spaces were handled by different code (actkr_disc.py and acktr_cont.py) with little overlap. If interested in the original version of the acktr for continuous action spaces, use old_acktr_cont branch. Note that original code performs better on the mujoco tasks than the refactored version; we are still investigating why.