Szymon Sidor
80f94f8ec5
bump version
2017-07-12 14:48:05 -07:00
Szymon Sidor
2b1b437908
Update simple.py
2017-07-12 23:42:36 +02:00
Szymon Sidor
04cd0dcf64
Merge pull request #52 from farbeiza/patch-1
...
Effectively apply weights from the replay buffer
2017-07-12 23:37:28 +02:00
Szymon Sidor
248aad1c3b
Merge pull request #39 from mirceamironenco/master
...
Fix TF graph variables deprecation
2017-07-12 23:32:24 +02:00
Fernando Arbeiza
d76cd1297a
Effectively apply weights from the replay buffer
...
It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something?
In any case, I have added a parameter in order for them to be used; just in case it is useful.
2017-07-11 11:09:51 +02:00
MironencoMircea
91b10857d8
Fixed TF graph variables deprecation
2017-06-28 15:48:45 +02:00
Szymon Sidor
0778e9f10f
Merge pull request #28 from zach-nervana/patch-1
...
remove unnecessary initialization of variable resized_screen
2017-06-23 17:05:25 -07:00
Szymon Sidor
59c7887e6b
Merge pull request #26 from LinZichuan/master
...
Update setup.py
2017-06-23 17:02:05 -07:00
Szymon Sidor
3d235ae7b8
Merge pull request #33 from cxxgtxy/master
...
Fix README since BreakOut pretrained model doesn't match the correct …
2017-06-23 16:59:55 -07:00
cxx
5e73387494
Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead.
2017-06-16 15:38:42 +08:00
Zach Dwiel
ec38bf460e
remove unnecessary initialization of variable resized_screen
2017-06-09 08:53:10 -04:00
Zichuan Lin
ef1a2402fc
Update setup.py
2017-06-07 17:29:38 +08:00
Szymon Sidor
184440ffd3
Merge pull request #22 from ngc92/doc_fixes
...
docstring and comment fixes
2017-06-04 00:41:34 -07:00
Szymon Sidor
fba0ac30ca
Merge pull request #15 from tiagosgc/patch-1
...
Update README.md
2017-06-04 00:40:58 -07:00
Szymon Sidor
584261a94a
Merge pull request #14 from quanvuong/master
...
Consistent initial type (float) for episode_rewards
2017-06-04 00:40:42 -07:00
Szymon Sidor
9c10c2fc27
Merge pull request #13 from ppwwyyxx/patch-1
...
Update setup.py
2017-06-04 00:40:31 -07:00
ngc92
02919483f2
docstring and comment fixes
2017-06-02 01:43:51 +02:00
Tiago Carvalho
1f3c3e33e7
Update README.md
2017-05-31 12:14:28 +01:00
Quan Vuong
86054f7a98
Consistent initial type (float) for episode_rewards
2017-05-30 11:49:25 +08:00
Yuxin Wu
709c327c40
Update setup.py
...
`PongNoFrameskip-v4` seems to require `gym>=0.9.1`
2017-05-29 19:39:25 -07:00
Szymon Sidor
fc2bbed4da
Merge pull request #11 from yenchenlin/fix-typo
...
Fix typos
2017-05-28 12:56:46 -07:00
YenChenLin
4fd1d21845
Fix typo
2017-05-28 13:13:47 -04:00
Szymon Sidor
5c80997d1b
Merge pull request #9 from AAorris/patch-1
...
tf_util: spelling / docstring updates
2017-05-26 16:24:57 -07:00
Aaron Morris
2361220dfe
tf_util: spelling / docstring updates
2017-05-25 22:53:09 -07:00
Szymon Sidor
7327a1560b
update Atari envs to v4 and warn Python 2 users.
2017-05-25 14:53:29 -07:00
Szymon Sidor
0071b852fd
Merge pull request #2 from omoindrot/patch-1
...
Correct path to script "download_model"
2017-05-24 14:55:54 -07:00
Olivier Moindrot
d2c51f5933
Correct path to script "download_model"
...
`python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`
2017-05-24 13:13:30 -07:00
Szymon Sidor
958810ed1e
Initial commit
2017-05-24 02:34:20 -07:00