Commit Graph

128 Commits

Author SHA1 Message Date
Szymon Sidor
80f94f8ec5 bump version 2017-07-12 14:48:05 -07:00
Szymon Sidor
2b1b437908 Update simple.py 2017-07-12 23:42:36 +02:00
Szymon Sidor
04cd0dcf64 Merge pull request #52 from farbeiza/patch-1
Effectively apply weights from the replay buffer
2017-07-12 23:37:28 +02:00
Szymon Sidor
248aad1c3b Merge pull request #39 from mirceamironenco/master
Fix TF graph variables deprecation
2017-07-12 23:32:24 +02:00
Fernando Arbeiza
d76cd1297a Effectively apply weights from the replay buffer
It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something?

In any case, I have added a parameter in order for them to be used; just in case it is useful.
2017-07-11 11:09:51 +02:00
MironencoMircea
91b10857d8 Fixed TF graph variables deprecation 2017-06-28 15:48:45 +02:00
Szymon Sidor
0778e9f10f Merge pull request #28 from zach-nervana/patch-1
remove unnecessary initialization of variable resized_screen
2017-06-23 17:05:25 -07:00
Szymon Sidor
59c7887e6b Merge pull request #26 from LinZichuan/master
Update setup.py
2017-06-23 17:02:05 -07:00
Szymon Sidor
3d235ae7b8 Merge pull request #33 from cxxgtxy/master
Fix README since BreakOut pretrained model doesn't match the correct …
2017-06-23 16:59:55 -07:00
cxx
5e73387494 Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead. 2017-06-16 15:38:42 +08:00
Zach Dwiel
ec38bf460e remove unnecessary initialization of variable resized_screen 2017-06-09 08:53:10 -04:00
Zichuan Lin
ef1a2402fc Update setup.py 2017-06-07 17:29:38 +08:00
Szymon Sidor
184440ffd3 Merge pull request #22 from ngc92/doc_fixes
docstring and comment fixes
2017-06-04 00:41:34 -07:00
Szymon Sidor
fba0ac30ca Merge pull request #15 from tiagosgc/patch-1
Update README.md
2017-06-04 00:40:58 -07:00
Szymon Sidor
584261a94a Merge pull request #14 from quanvuong/master
Consistent initial type (float) for episode_rewards
2017-06-04 00:40:42 -07:00
Szymon Sidor
9c10c2fc27 Merge pull request #13 from ppwwyyxx/patch-1
Update setup.py
2017-06-04 00:40:31 -07:00
ngc92
02919483f2 docstring and comment fixes 2017-06-02 01:43:51 +02:00
Tiago Carvalho
1f3c3e33e7 Update README.md 2017-05-31 12:14:28 +01:00
Quan Vuong
86054f7a98 Consistent initial type (float) for episode_rewards 2017-05-30 11:49:25 +08:00
Yuxin Wu
709c327c40 Update setup.py
`PongNoFrameskip-v4` seems to require `gym>=0.9.1`
2017-05-29 19:39:25 -07:00
Szymon Sidor
fc2bbed4da Merge pull request #11 from yenchenlin/fix-typo
Fix typos
2017-05-28 12:56:46 -07:00
YenChenLin
4fd1d21845 Fix typo 2017-05-28 13:13:47 -04:00
Szymon Sidor
5c80997d1b Merge pull request #9 from AAorris/patch-1
tf_util: spelling / docstring updates
2017-05-26 16:24:57 -07:00
Aaron Morris
2361220dfe tf_util: spelling / docstring updates 2017-05-25 22:53:09 -07:00
Szymon Sidor
7327a1560b update Atari envs to v4 and warn Python 2 users. 2017-05-25 14:53:29 -07:00
Szymon Sidor
0071b852fd Merge pull request #2 from omoindrot/patch-1
Correct path to script "download_model"
2017-05-24 14:55:54 -07:00
Olivier Moindrot
d2c51f5933 Correct path to script "download_model"
`python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`
2017-05-24 13:13:30 -07:00
Szymon Sidor
958810ed1e Initial commit 2017-05-24 02:34:20 -07:00