Update gail-result.md
This commit is contained in:
@@ -3,16 +3,6 @@
|
|||||||
Here's the extensive results for applying GAIL/BC on Mujoco environments, including
|
Here's the extensive results for applying GAIL/BC on Mujoco environments, including
|
||||||
Hopper, Walker2d, HalfCheetah, Humanoid, HumanoidStandup. Eery imitator is evaluated with seed to be 0.
|
Hopper, Walker2d, HalfCheetah, Humanoid, HumanoidStandup. Eery imitator is evaluated with seed to be 0.
|
||||||
|
|
||||||
## details about GAIL imitator
|
|
||||||
|
|
||||||
For all environments, the
|
|
||||||
imitator is trained with 1, 5, 10, 50 trajectories, where each trajectory contains at most
|
|
||||||
1024 transitions, and seed 0, 1, 2, 3, respectively.
|
|
||||||
|
|
||||||
### details about the BC imitators
|
|
||||||
|
|
||||||
All BC imitators are trained with seed 0.
|
|
||||||
|
|
||||||
## Results
|
## Results
|
||||||
|
|
||||||
### Determinstic Polciy (Set std=0)
|
### Determinstic Polciy (Set std=0)
|
||||||
@@ -33,3 +23,12 @@ All BC imitators are trained with seed 0.
|
|||||||
| Humanoid-v1 | <img src='Humanoid-unnormalized-stochastic-scores.png'> | <img src='Humanoid-normalized-stochastic-scores.png'> |
|
| Humanoid-v1 | <img src='Humanoid-unnormalized-stochastic-scores.png'> | <img src='Humanoid-normalized-stochastic-scores.png'> |
|
||||||
| HumanoidStandup-v1 | <img src='HumanoidStandup-unnormalized-stochastic-scores.png'> | <img src='HumanoidStandup-normalized-stochastic-scores.png'> |
|
| HumanoidStandup-v1 | <img src='HumanoidStandup-unnormalized-stochastic-scores.png'> | <img src='HumanoidStandup-normalized-stochastic-scores.png'> |
|
||||||
|
|
||||||
|
### details about GAIL imitator
|
||||||
|
|
||||||
|
For all environments, the
|
||||||
|
imitator is trained with 1, 5, 10, 50 trajectories, where each trajectory contains at most
|
||||||
|
1024 transitions, and seed 0, 1, 2, 3, respectively.
|
||||||
|
|
||||||
|
### details about the BC imitators
|
||||||
|
|
||||||
|
All BC imitators are trained with seed 0.
|
||||||
|
Reference in New Issue
Block a user