Commit Graph

1491 Commits

Author SHA1 Message Date
Trevor Blackwell
f719298cf0 Revert "Revert "Revert "Optimize Atari envs"""
This reverts commit 8f62c0ab91.
2017-03-28 10:50:48 -07:00
Trevor Blackwell
8f62c0ab91 Revert "Revert "Optimize Atari envs""
This reverts commit c8a84d3c1d.
2017-03-28 10:50:09 -07:00
Trevor Blackwell
c8a84d3c1d Revert "Optimize Atari envs"
This reverts commit 3bc4692d12.
2017-03-28 10:49:51 -07:00
gwern
c1bc07bcac Generalized Kelly Coinflip game: persist the hyperparameters as well to avoid confusing users
before, hyperparameters would get reset to the defaults by _reset since any user-specified pareto/beta/normal hyperparameters weren't saved after the original init; tediously store them and pass them back into init on reset
2017-03-28 10:44:56 -07:00
gwern
a844c3b6de Generalized Kelly Coinflip game: add a check to init in case anyone accidentally calls init with seed=False, per tlbtlbtlb 2017-03-28 10:44:56 -07:00
gwern
7c813e87c2 Generalized Kelly Coinflip game: right, observation space has to match the sufficient statistics... 2017-03-28 10:44:56 -07:00
gwern
1de6febf46 Generalized Kelly Coinflip game: preserve the random seed across resets 2017-03-28 10:44:56 -07:00
gwern
dc9e36b895 Kelly Coinflip game: add a generalized version with randomized parameters that turns it into a POMDP for deep RL 2017-03-28 10:44:56 -07:00
gwern
61bf52da6f Kelly Coinflip Game: document 2017-03-28 10:44:56 -07:00
Trevor Blackwell
f19f90452c bump atari-py 2017-03-28 10:41:20 -07:00
Trevor Blackwell
3bc4692d12 Optimize Atari envs
Get screen pixels in a way that doesn’t require swapping the RGB
channels.
SeaquestNoFrameskip used to take 3.5 seconds to do 10k steps, now it
takes 2.6 seconds.
When also using opencv to resize to (84,84,3), it used to take 7.8
seconds and now takes 7.0.
2017-03-28 09:47:08 -07:00
Szymon Sidor
255a261e5d bump atari-py requirement 2017-03-22 16:45:10 -07:00
Szymon Sidor
33e96e7341 add hero 2017-03-19 16:14:17 -07:00
andrewschreiber
1b6cb28192 fix: remove duplicate PyOpenGL declaration 2017-03-16 10:51:58 -07:00
Trevor Blackwell
dfd2c475fd Fencepost bug on kellycoinflip observation space 2017-03-16 10:50:14 -07:00
gwern
7a2adc688a +toy text game: The Kelly Coinflip Game ( https://www.gwern.net/Coin-flip )
this is a gambling game introduced by 'Rational Decision-Making Under Uncertainty: Observed Betting Patterns on a Biased Coin', by Haghani & Dewey 2016; starting from $25, one makes double-or-nothing bets trying to reach the $250 ceiling in 300 rounds
2017-03-16 10:50:14 -07:00
Trevor Blackwell
85b2623a95 Fix FAQ link 2017-03-15 12:38:44 -07:00
Trevor Blackwell
72da2d5a16 Link to FAQ 2017-03-15 12:36:39 -07:00
Trevor Blackwell
c4321c2791 Bump version 2017-03-15 11:55:15 -07:00
Trevor Blackwell
bd36540a19 Clarify atari observation space comment 2017-03-15 11:55:07 -07:00
Trevor Blackwell
cb8ca5a4aa Address https://github.com/openai/gym/issues/523 2017-03-15 11:54:44 -07:00
Trevor Blackwell
2a0a2a3a2b Forward spec down the tree, rather than copy at init time, since it can be set after wrapping 2017-03-09 18:16:26 -08:00
Jie Tang
7c7d2f820e Add retries when creating benchmark runs 2017-03-08 11:45:56 -08:00
Jie Tang
28913dad94 Add retries to benchmark upload 2017-03-06 13:46:37 -08:00
Jie Tang
07d9885471 Tighten up this docstring 2017-03-06 13:46:37 -08:00
Jie Tang
f869ed01fe Add skip_videos flag for skipping video uploads on large benchmarks 2017-03-06 13:46:37 -08:00
Jie Tang
ffc30879de Refactor benchmark / evaluation uploads 2017-03-06 13:46:37 -08:00
Jie Tang
b76ea32a46 Revert unused scorer descriptions 2017-03-06 13:46:37 -08:00
Oleg Klimov
d6714824ec README: link to What’s New, fix two other glitches 2017-03-07 00:13:51 +03:00
John Schulman
ed35158524 simplify renderer closing 2017-03-05 20:11:24 -08:00
Greg Brockman
c16f6c8a23 Bump version 2017-03-05 16:30:17 -08:00
Greg Brockman
3174039cbf Revert render change for now 2017-03-05 16:30:04 -08:00
Tom Brown
842de12611 Clarify instructions for resolving env.configure related issues 2017-03-05 14:11:43 -08:00
Tom Brown
90a776b4b3 Bump version, note backwards incompatibility 2017-03-05 13:44:39 -08:00
John Schulman
0ee49635f1 add *args, **kwargs to (non-functional) configure 2017-03-05 13:17:32 -08:00
John Schulman
1fb81d4e3f remove MultiDiscrete adaptors 2017-03-05 13:13:14 -08:00
John Schulman
387f14d381 remove configure, simplify core.py 2017-03-05 12:00:51 -08:00
Jie Tang
6f277090ed * Add descriptions of scorers
* Make null_score a property

* Scorers are proper classes
2017-02-27 19:53:02 -08:00
Falcon Dai
ceda767e34 fix pyglet.gl imports 2017-02-27 18:48:19 -08:00
Falcon Dai
1e75f670ed move rendering import. fix issue-518 2017-02-27 18:48:19 -08:00
Trevor Blackwell
874b6fe2af Better error when called on non-discrete action spaces 2017-02-27 10:00:58 -08:00
Trevor Blackwell
8c6468b34d Don't crash if render called before reset 2017-02-27 10:00:48 -08:00
Toshiki Kataoka
93d554bdbb fix typo 2017-02-25 18:51:37 -08:00
Alex Ray
f724a72006 Lint fixes to mujoco environment (#377) 2017-02-22 17:24:27 -08:00
Tom Brown
6d7e50e67b Update environments.md 2017-02-22 17:23:07 -08:00
Jie Tang
77568accd7 Thread episode lengths through when scoring, add tests 2017-02-13 12:29:11 -08:00
Jie Tang
f1f13a2de1 * Print local benchmark score
* Turn off video recording for benchmark_runner

* Add force argument to benchmark runner
2017-02-13 12:28:48 -08:00
John Schulman
6f4f5653de switch to pytest (#495)
* switch to pytest

* remove observation space sampling

* fix test
2017-02-11 22:17:02 -08:00
John Schulman
33a6112e07 don't unit test atari except pong -- it takes too long 2017-02-11 17:52:18 -08:00
John Schulman
1ef3996010 fix to previous commit 2017-02-11 17:43:11 -08:00