baselines

Author	SHA1	Message	Date
Szymon Sidor	80f94f8ec5	bump version	2017-07-12 14:48:05 -07:00
Szymon Sidor	2b1b437908	Update simple.py	2017-07-12 23:42:36 +02:00
Szymon Sidor	04cd0dcf64	Merge pull request #52 from farbeiza/patch-1 Effectively apply weights from the replay buffer	2017-07-12 23:37:28 +02:00
Szymon Sidor	248aad1c3b	Merge pull request #39 from mirceamironenco/master Fix TF graph variables deprecation	2017-07-12 23:32:24 +02:00
Fernando Arbeiza	d76cd1297a	Effectively apply weights from the replay buffer It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something? In any case, I have added a parameter in order for them to be used; just in case it is useful.	2017-07-11 11:09:51 +02:00
MironencoMircea	91b10857d8	Fixed TF graph variables deprecation	2017-06-28 15:48:45 +02:00
Szymon Sidor	0778e9f10f	Merge pull request #28 from zach-nervana/patch-1 remove unnecessary initialization of variable resized_screen	2017-06-23 17:05:25 -07:00
Szymon Sidor	59c7887e6b	Merge pull request #26 from LinZichuan/master Update setup.py	2017-06-23 17:02:05 -07:00
Szymon Sidor	3d235ae7b8	Merge pull request #33 from cxxgtxy/master Fix README since BreakOut pretrained model doesn't match the correct …	2017-06-23 16:59:55 -07:00
cxx	5e73387494	Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead.	2017-06-16 15:38:42 +08:00
Zach Dwiel	ec38bf460e	remove unnecessary initialization of variable resized_screen	2017-06-09 08:53:10 -04:00
Zichuan Lin	ef1a2402fc	Update setup.py	2017-06-07 17:29:38 +08:00
Szymon Sidor	184440ffd3	Merge pull request #22 from ngc92/doc_fixes docstring and comment fixes	2017-06-04 00:41:34 -07:00
Szymon Sidor	fba0ac30ca	Merge pull request #15 from tiagosgc/patch-1 Update README.md	2017-06-04 00:40:58 -07:00
Szymon Sidor	584261a94a	Merge pull request #14 from quanvuong/master Consistent initial type (float) for episode_rewards	2017-06-04 00:40:42 -07:00
Szymon Sidor	9c10c2fc27	Merge pull request #13 from ppwwyyxx/patch-1 Update setup.py	2017-06-04 00:40:31 -07:00
ngc92	02919483f2	docstring and comment fixes	2017-06-02 01:43:51 +02:00
Tiago Carvalho	1f3c3e33e7	Update README.md	2017-05-31 12:14:28 +01:00
Quan Vuong	86054f7a98	Consistent initial type (float) for episode_rewards	2017-05-30 11:49:25 +08:00
Yuxin Wu	709c327c40	Update setup.py `PongNoFrameskip-v4` seems to require `gym>=0.9.1`	2017-05-29 19:39:25 -07:00
Szymon Sidor	fc2bbed4da	Merge pull request #11 from yenchenlin/fix-typo Fix typos	2017-05-28 12:56:46 -07:00
YenChenLin	4fd1d21845	Fix typo	2017-05-28 13:13:47 -04:00
Szymon Sidor	5c80997d1b	Merge pull request #9 from AAorris/patch-1 tf_util: spelling / docstring updates	2017-05-26 16:24:57 -07:00
Aaron Morris	2361220dfe	tf_util: spelling / docstring updates	2017-05-25 22:53:09 -07:00
Szymon Sidor	7327a1560b	update Atari envs to v4 and warn Python 2 users.	2017-05-25 14:53:29 -07:00
Szymon Sidor	0071b852fd	Merge pull request #2 from omoindrot/patch-1 Correct path to script "download_model"	2017-05-24 14:55:54 -07:00
Olivier Moindrot	d2c51f5933	Correct path to script "download_model" `python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`	2017-05-24 13:13:30 -07:00
Szymon Sidor	958810ed1e	Initial commit	2017-05-24 02:34:20 -07:00

1 2 3

128 Commits