Gymnasium/gym/vector/vector_env.py

import gym
from gym.spaces import Tuple
from gym.vector.utils.spaces import batch_space

__all__ = ['VectorEnv']


class VectorEnv(gym.Env):
    r"""Base class for vectorized environments.

    Each observation returned from vectorized environment is a batch of observations 
    for each sub-environment. And :meth:`step` is also expected to receive a batch of 
    actions for each sub-environment.
    
    .. note::
    
        All sub-environments should share the identical observation and action spaces.
        In other words, a vector of multiple different environments is not supported. 

    Parameters
    ----------
    num_envs : int
        Number of environments in the vectorized environment.

    observation_space : `gym.spaces.Space` instance
        Observation space of a single environment.

    action_space : `gym.spaces.Space` instance
        Action space of a single environment.
    """
    def __init__(self, num_envs, observation_space, action_space):
        super(VectorEnv, self).__init__()
        self.num_envs = num_envs
        self.observation_space = batch_space(observation_space, n=num_envs)
        self.action_space = Tuple((action_space,) * num_envs)

        self.closed = False
        self.viewer = None

        # The observation and action spaces of a single environment are
        # kept in separate properties
        self.single_observation_space = observation_space
        self.single_action_space = action_space

    def reset_async(self):
        pass

    def reset_wait(self, **kwargs):
        raise NotImplementedError()

    def reset(self):
        r"""Reset all sub-environments and return a batch of initial observations.
        
        Returns
        -------
        observations : sample from `observation_space`
            A batch of observations from the vectorized environment.
        """
        self.reset_async()
        return self.reset_wait()

    def step_async(self, actions):
        pass

    def step_wait(self, **kwargs):
        raise NotImplementedError()

    def step(self, actions):
        r"""Take an action for each sub-environments. 

        Parameters
        ----------
        actions : iterable of samples from `action_space`
            List of actions.

        Returns
        -------
        observations : sample from `observation_space`
            A batch of observations from the vectorized environment.

        rewards : `np.ndarray` instance (dtype `np.float_`)
            A vector of rewards from the vectorized environment.

        dones : `np.ndarray` instance (dtype `np.bool_`)
            A vector whose entries indicate whether the episode has ended.

        infos : list of dict
            A list of auxiliary diagnostic information dicts from sub-environments.
        """

        self.step_async(actions)
        return self.step_wait()

    def close_extras(self, **kwargs):
        r"""Clean up the extra resources e.g. beyond what's in this base class. """
        raise NotImplementedError()

    def close(self, **kwargs):
        r"""Close all sub-environments and release resources.
        
        It also closes all the existing image viewers, then calls :meth:`close_extras` and set
        :attr:`closed` as ``True``. 
        
        .. warning::
        
            This function itself does not close the environments, it should be handled
            in :meth:`close_extras`. This is generic for both synchronous and asynchronous 
            vectorized environments. 
        
        .. note::
        
            This will be automatically called when garbage collected or program exited. 
            
        """
        if self.closed:
            return
        if self.viewer is not None:
            self.viewer.close()
        self.close_extras(**kwargs)
        self.closed = True

    def seed(self, seeds=None):
        """
        Parameters
        ----------
        seeds : list of int, or int, optional
            Random seed for each individual environment. If `seeds` is a list of
            length `num_envs`, then the items of the list are chosen as random
            seeds. If `seeds` is an int, then each environment uses the random
            seed `seeds + n`, where `n` is the index of the environment (between
            `0` and `num_envs - 1`).
        """
        pass

    def __del__(self):
        if hasattr(self, 'closed'):
            if not self.closed:
                self.close(terminate=True)
Vectorized environments (#1513) * Initial version of vectorized environments * Raise an exception in the main process if child process raises an exception * Add list of exposed functions in vector module * Use deepcopy instead of np.copy * Add documentation for vector utils * Add tests for copy in AsyncVectorEnv * Add example in documentation for batch_space * Add cloudpickle dependency in setup.py * Fix __del__ in VectorEnv * Check if all observation spaces are equal in AsyncVectorEnv * Check if all observation spaces are equal in SyncVectorEnv * Fix spaces non equality in SyncVectorEnv for Python 2 * Handle None parameter in create_empty_array * Fix check_observation_space with spaces equality * Raise an exception when operations are out of order in AsyncVectorEnv * Add version requirement for cloudpickle * Use a state instead of binary flags in AsyncVectorEnv * Use numpy.zeros when initializing observations in vectorized environments * Remove poll from public API in AsyncVectorEnv * Remove close_extras from VectorEnv * Add test between AsyncVectorEnv and SyncVectorEnv * Remove close in check_observation_space * Add documentation for seed and close * Refactor exceptions for AsyncVectorEnv * Close pipes if the environment raises an error * Add tests for out of order operations * Change default argument in create_empty_array to np.zeros * Add get_attr and set_attr methods to VectorEnv * Improve consistency in SyncVectorEnv 2019-06-21 17:29:44 -04:00			`import gym`
			`from gym.spaces import Tuple`
			`from gym.vector.utils.spaces import batch_space`

			`__all__ = ['VectorEnv']`


			`class VectorEnv(gym.Env):`
[Update vector_env.py]: add some documentations (#1629) * Update vector_env.py * Update vector_env.py 2019-10-25 23:17:29 +02:00			`r"""Base class for vectorized environments.`

			`Each observation returned from vectorized environment is a batch of observations`
			for each sub-environment. And :meth:`step` is also expected to receive a batch of
			`actions for each sub-environment.`

			`.. note::`

			`All sub-environments should share the identical observation and action spaces.`
			`In other words, a vector of multiple different environments is not supported.`
Vectorized environments (#1513) * Initial version of vectorized environments * Raise an exception in the main process if child process raises an exception * Add list of exposed functions in vector module * Use deepcopy instead of np.copy * Add documentation for vector utils * Add tests for copy in AsyncVectorEnv * Add example in documentation for batch_space * Add cloudpickle dependency in setup.py * Fix __del__ in VectorEnv * Check if all observation spaces are equal in AsyncVectorEnv * Check if all observation spaces are equal in SyncVectorEnv * Fix spaces non equality in SyncVectorEnv for Python 2 * Handle None parameter in create_empty_array * Fix check_observation_space with spaces equality * Raise an exception when operations are out of order in AsyncVectorEnv * Add version requirement for cloudpickle * Use a state instead of binary flags in AsyncVectorEnv * Use numpy.zeros when initializing observations in vectorized environments * Remove poll from public API in AsyncVectorEnv * Remove close_extras from VectorEnv * Add test between AsyncVectorEnv and SyncVectorEnv * Remove close in check_observation_space * Add documentation for seed and close * Refactor exceptions for AsyncVectorEnv * Close pipes if the environment raises an error * Add tests for out of order operations * Change default argument in create_empty_array to np.zeros * Add get_attr and set_attr methods to VectorEnv * Improve consistency in SyncVectorEnv 2019-06-21 17:29:44 -04:00
			`Parameters`
			`----------`
			`num_envs : int`
			`Number of environments in the vectorized environment.`

			observation_space : `gym.spaces.Space` instance
			`Observation space of a single environment.`

			action_space : `gym.spaces.Space` instance
			`Action space of a single environment.`
			`"""`
			`def __init__(self, num_envs, observation_space, action_space):`
			`super(VectorEnv, self).__init__()`
			`self.num_envs = num_envs`
			`self.observation_space = batch_space(observation_space, n=num_envs)`
			`self.action_space = Tuple((action_space,) * num_envs)`

			`self.closed = False`
			`self.viewer = None`

			`# The observation and action spaces of a single environment are`
			`# kept in separate properties`
			`self.single_observation_space = observation_space`
			`self.single_action_space = action_space`

			`def reset_async(self):`
			`pass`

			`def reset_wait(self, **kwargs):`
			`raise NotImplementedError()`

			`def reset(self):`
[Update vector_env.py]: add some documentations (#1629) * Update vector_env.py * Update vector_env.py 2019-10-25 23:17:29 +02:00			`r"""Reset all sub-environments and return a batch of initial observations.`

vector env updates (#1706) * make daemon=True an option of async_vector_env * custom worker in async_vector_env * add compatibility methods to SyncVectorEnv * fix name in sync_vector_env * vectorenv api cleanup * add docstrings for daemon and worker options in AsyncVectorEnv 2019-10-09 15:08:10 -07:00			`Returns`
			`-------`
			observations : sample from `observation_space`
			`A batch of observations from the vectorized environment.`
			`"""`
Vectorized environments (#1513) * Initial version of vectorized environments * Raise an exception in the main process if child process raises an exception * Add list of exposed functions in vector module * Use deepcopy instead of np.copy * Add documentation for vector utils * Add tests for copy in AsyncVectorEnv * Add example in documentation for batch_space * Add cloudpickle dependency in setup.py * Fix __del__ in VectorEnv * Check if all observation spaces are equal in AsyncVectorEnv * Check if all observation spaces are equal in SyncVectorEnv * Fix spaces non equality in SyncVectorEnv for Python 2 * Handle None parameter in create_empty_array * Fix check_observation_space with spaces equality * Raise an exception when operations are out of order in AsyncVectorEnv * Add version requirement for cloudpickle * Use a state instead of binary flags in AsyncVectorEnv * Use numpy.zeros when initializing observations in vectorized environments * Remove poll from public API in AsyncVectorEnv * Remove close_extras from VectorEnv * Add test between AsyncVectorEnv and SyncVectorEnv * Remove close in check_observation_space * Add documentation for seed and close * Refactor exceptions for AsyncVectorEnv * Close pipes if the environment raises an error * Add tests for out of order operations * Change default argument in create_empty_array to np.zeros * Add get_attr and set_attr methods to VectorEnv * Improve consistency in SyncVectorEnv 2019-06-21 17:29:44 -04:00			`self.reset_async()`
			`return self.reset_wait()`

			`def step_async(self, actions):`
			`pass`

			`def step_wait(self, **kwargs):`
			`raise NotImplementedError()`

			`def step(self, actions):`
[Update vector_env.py]: add some documentations (#1629) * Update vector_env.py * Update vector_env.py 2019-10-25 23:17:29 +02:00			`r"""Take an action for each sub-environments.`

vector env updates (#1706) * make daemon=True an option of async_vector_env * custom worker in async_vector_env * add compatibility methods to SyncVectorEnv * fix name in sync_vector_env * vectorenv api cleanup * add docstrings for daemon and worker options in AsyncVectorEnv 2019-10-09 15:08:10 -07:00			`Parameters`
			`----------`
			actions : iterable of samples from `action_space`
			`List of actions.`

			`Returns`
			`-------`
			observations : sample from `observation_space`
			`A batch of observations from the vectorized environment.`

			rewards : `np.ndarray` instance (dtype `np.float_`)
			`A vector of rewards from the vectorized environment.`

			dones : `np.ndarray` instance (dtype `np.bool_`)
			`A vector whose entries indicate whether the episode has ended.`

			`infos : list of dict`
[Update vector_env.py]: add some documentations (#1629) * Update vector_env.py * Update vector_env.py 2019-10-25 23:17:29 +02:00			`A list of auxiliary diagnostic information dicts from sub-environments.`
vector env updates (#1706) * make daemon=True an option of async_vector_env * custom worker in async_vector_env * add compatibility methods to SyncVectorEnv * fix name in sync_vector_env * vectorenv api cleanup * add docstrings for daemon and worker options in AsyncVectorEnv 2019-10-09 15:08:10 -07:00			`"""`
[Update vector_env.py]: add some documentations (#1629) * Update vector_env.py * Update vector_env.py 2019-10-25 23:17:29 +02:00
Vectorized environments (#1513) * Initial version of vectorized environments * Raise an exception in the main process if child process raises an exception * Add list of exposed functions in vector module * Use deepcopy instead of np.copy * Add documentation for vector utils * Add tests for copy in AsyncVectorEnv * Add example in documentation for batch_space * Add cloudpickle dependency in setup.py * Fix __del__ in VectorEnv * Check if all observation spaces are equal in AsyncVectorEnv * Check if all observation spaces are equal in SyncVectorEnv * Fix spaces non equality in SyncVectorEnv for Python 2 * Handle None parameter in create_empty_array * Fix check_observation_space with spaces equality * Raise an exception when operations are out of order in AsyncVectorEnv * Add version requirement for cloudpickle * Use a state instead of binary flags in AsyncVectorEnv * Use numpy.zeros when initializing observations in vectorized environments * Remove poll from public API in AsyncVectorEnv * Remove close_extras from VectorEnv * Add test between AsyncVectorEnv and SyncVectorEnv * Remove close in check_observation_space * Add documentation for seed and close * Refactor exceptions for AsyncVectorEnv * Close pipes if the environment raises an error * Add tests for out of order operations * Change default argument in create_empty_array to np.zeros * Add get_attr and set_attr methods to VectorEnv * Improve consistency in SyncVectorEnv 2019-06-21 17:29:44 -04:00			`self.step_async(actions)`
			`return self.step_wait()`

[gym.vector]: a generic`close` with user-defined `close_extras` (#1631) * Update vector_env.py * Update sync_vector_env.py * Update async_vector_env.py * Update vector_env.py * Update async_vector_env.py 2019-10-26 00:18:54 +02:00			`def close_extras(self, **kwargs):`
			`r"""Clean up the extra resources e.g. beyond what's in this base class. """`
			`raise NotImplementedError()`

			`def close(self, **kwargs):`
			`r"""Close all sub-environments and release resources.`

			It also closes all the existing image viewers, then calls :meth:`close_extras` and set
			:attr:`closed` as ``True``.

			`.. warning::`

			`This function itself does not close the environments, it should be handled`
			in :meth:`close_extras`. This is generic for both synchronous and asynchronous
			`vectorized environments.`

			`.. note::`

			`This will be automatically called when garbage collected or program exited.`

			`"""`
			`if self.closed:`
			`return`
			`if self.viewer is not None:`
			`self.viewer.close()`
			`self.close_extras(**kwargs)`
			`self.closed = True`

vector env updates (#1706) * make daemon=True an option of async_vector_env * custom worker in async_vector_env * add compatibility methods to SyncVectorEnv * fix name in sync_vector_env * vectorenv api cleanup * add docstrings for daemon and worker options in AsyncVectorEnv 2019-10-09 15:08:10 -07:00			`def seed(self, seeds=None):`
			`"""`
			`Parameters`
			`----------`
			`seeds : list of int, or int, optional`
			Random seed for each individual environment. If `seeds` is a list of
			length `num_envs`, then the items of the list are chosen as random
			seeds. If `seeds` is an int, then each environment uses the random
			seed `seeds + n`, where `n` is the index of the environment (between
			`0` and `num_envs - 1`).
			`"""`
			`pass`

Vectorized environments (#1513) * Initial version of vectorized environments * Raise an exception in the main process if child process raises an exception * Add list of exposed functions in vector module * Use deepcopy instead of np.copy * Add documentation for vector utils * Add tests for copy in AsyncVectorEnv * Add example in documentation for batch_space * Add cloudpickle dependency in setup.py * Fix __del__ in VectorEnv * Check if all observation spaces are equal in AsyncVectorEnv * Check if all observation spaces are equal in SyncVectorEnv * Fix spaces non equality in SyncVectorEnv for Python 2 * Handle None parameter in create_empty_array * Fix check_observation_space with spaces equality * Raise an exception when operations are out of order in AsyncVectorEnv * Add version requirement for cloudpickle * Use a state instead of binary flags in AsyncVectorEnv * Use numpy.zeros when initializing observations in vectorized environments * Remove poll from public API in AsyncVectorEnv * Remove close_extras from VectorEnv * Add test between AsyncVectorEnv and SyncVectorEnv * Remove close in check_observation_space * Add documentation for seed and close * Refactor exceptions for AsyncVectorEnv * Close pipes if the environment raises an error * Add tests for out of order operations * Change default argument in create_empty_array to np.zeros * Add get_attr and set_attr methods to VectorEnv * Improve consistency in SyncVectorEnv 2019-06-21 17:29:44 -04:00			`def __del__(self):`
			`if hasattr(self, 'closed'):`
			`if not self.closed:`
[gym.vector]: a generic`close` with user-defined `close_extras` (#1631) * Update vector_env.py * Update sync_vector_env.py * Update async_vector_env.py * Update vector_env.py * Update async_vector_env.py 2019-10-26 00:18:54 +02:00			`self.close(terminate=True)`