2016-04-27 08:00:58 -07:00
|
|
|
import numpy as np
|
2018-11-29 02:27:27 +01:00
|
|
|
|
2018-02-02 23:01:45 -08:00
|
|
|
import gym
|
|
|
|
from gym import logger
|
2019-01-30 22:39:55 +01:00
|
|
|
from .space import Space
|
2016-04-27 08:00:58 -07:00
|
|
|
|
2019-01-30 22:39:55 +01:00
|
|
|
|
|
|
|
class Box(Space):
|
2016-04-27 08:00:58 -07:00
|
|
|
"""
|
|
|
|
A box in R^n.
|
|
|
|
I.e., each coordinate is bounded.
|
2016-06-11 23:10:58 -07:00
|
|
|
|
|
|
|
Example usage:
|
|
|
|
self.action_space = spaces.Box(low=-10, high=10, shape=(1,))
|
2016-04-27 08:00:58 -07:00
|
|
|
"""
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
def __init__(self, low=None, high=None, shape=None, dtype=None):
|
2016-04-27 08:00:58 -07:00
|
|
|
"""
|
|
|
|
Two kinds of valid input:
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
Box(low=-1.0, high=1.0, shape=(3,4)) # low and high are scalars, and shape is provided
|
2018-03-07 18:09:12 +05:00
|
|
|
Box(low=np.array([-1.0,-2.0]), high=np.array([2.0,4.0])) # low and high are arrays of the same shape
|
2016-04-27 08:00:58 -07:00
|
|
|
"""
|
|
|
|
if shape is None:
|
|
|
|
assert low.shape == high.shape
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
shape = low.shape
|
2016-04-27 08:00:58 -07:00
|
|
|
else:
|
|
|
|
assert np.isscalar(low) and np.isscalar(high)
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
low = low + np.zeros(shape)
|
|
|
|
high = high + np.zeros(shape)
|
|
|
|
if dtype is None: # Autodetect type
|
|
|
|
if (high == 255).all():
|
|
|
|
dtype = np.uint8
|
|
|
|
else:
|
|
|
|
dtype = np.float32
|
2018-11-29 02:27:27 +01:00
|
|
|
logger.warn("gym.spaces.Box autodetected dtype as {}. Please provide explicit dtype.".format(dtype))
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
self.low = low.astype(dtype)
|
|
|
|
self.high = high.astype(dtype)
|
2019-02-07 11:29:04 -08:00
|
|
|
super(Box, self).__init__(shape, dtype)
|
2019-01-30 22:39:55 +01:00
|
|
|
self.np_random = np.random.RandomState()
|
|
|
|
|
|
|
|
def seed(self, seed):
|
|
|
|
self.np_random.seed(seed)
|
Cleanup, removal of unmaintained code (#836)
* add dtype to Box
* remove board_game, debugging, safety, parameter_tuning environments
* massive set of breaking changes
- remove python logging module
- _step, _reset, _seed, _close => non underscored method
- remove benchmark and scoring folder
* Improve render("human"), now resizable, closable window.
* get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods
* CubeCrash unit test environment
* followup fixes
* MemorizeDigits unit test envrionment
* refactored spaces a bit
fixed indentation
disabled test_env_semantics
* fix unit tests
* fixes
* CubeCrash, MemorizeDigits tested
* gym backwards compatibility patch
* gym backwards compatibility, followup fixes
* changelist, add spaces to main namespaces
* undo_logger_setup for backwards compat
* remove configuration.py
2018-01-25 18:20:14 -08:00
|
|
|
|
2016-04-27 08:00:58 -07:00
|
|
|
def sample(self):
|
2019-02-05 17:49:29 -08:00
|
|
|
high = self.high if self.dtype.kind == 'f' else self.high.astype('int64') + 1
|
|
|
|
return self.np_random.uniform(low=self.low, high=high, size=self.low.shape).astype(self.dtype)
|
2018-09-24 20:11:03 +02:00
|
|
|
|
2016-04-27 08:00:58 -07:00
|
|
|
def contains(self, x):
|
|
|
|
return x.shape == self.shape and (x >= self.low).all() and (x <= self.high).all()
|
|
|
|
|
|
|
|
def to_jsonable(self, sample_n):
|
|
|
|
return np.array(sample_n).tolist()
|
2018-09-24 20:11:03 +02:00
|
|
|
|
2016-04-27 08:00:58 -07:00
|
|
|
def from_jsonable(self, sample_n):
|
|
|
|
return [np.asarray(sample) for sample in sample_n]
|
|
|
|
|
|
|
|
def __repr__(self):
|
|
|
|
return "Box" + str(self.shape)
|
2018-11-29 02:27:27 +01:00
|
|
|
|
2016-04-27 08:00:58 -07:00
|
|
|
def __eq__(self, other):
|
2019-03-23 23:18:19 -07:00
|
|
|
return isinstance(other, Box) and np.allclose(self.low, other.low) and np.allclose(self.high, other.high)
|