Gymnasium/gym/wrappers/record_episode_statistics.py

import time
from collections import deque

import numpy as np

import gym


class RecordEpisodeStatistics(gym.Wrapper):
    def __init__(self, env, deque_size=100):
        super().__init__(env)
        self.num_envs = getattr(env, "num_envs", 1)
        self.t0 = time.perf_counter()
        self.episode_count = 0
        self.episode_returns = None
        self.episode_lengths = None
        self.return_queue = deque(maxlen=deque_size)
        self.length_queue = deque(maxlen=deque_size)
        self.is_vector_env = getattr(env, "is_vector_env", False)

    def reset(self, **kwargs):
        observations = super().reset(**kwargs)
        self.episode_returns = np.zeros(self.num_envs, dtype=np.float32)
        self.episode_lengths = np.zeros(self.num_envs, dtype=np.int32)
        return observations

    def step(self, action):
        observations, rewards, dones, infos = super().step(action)
        self.episode_returns += rewards
        self.episode_lengths += 1
        if not self.is_vector_env:
            infos = [infos]
            dones = [dones]
        else:
            infos = list(infos)  # Convert infos to mutable type
        for i in range(len(dones)):
            if dones[i]:
                infos[i] = infos[i].copy()
                episode_return = self.episode_returns[i]
                episode_length = self.episode_lengths[i]
                episode_info = {
                    "r": episode_return,
                    "l": episode_length,
                    "t": round(time.perf_counter() - self.t0, 6),
                }
                infos[i]["episode"] = episode_info
                self.return_queue.append(episode_return)
                self.length_queue.append(episode_length)
                self.episode_count += 1
                self.episode_returns[i] = 0
                self.episode_lengths[i] = 0
        if self.is_vector_env:
            infos = tuple(infos)
        return (
            observations,
            rewards,
            dones if self.is_vector_env else dones[0],
            infos if self.is_vector_env else infos[0],
        )
[Wrapper]: RecordEpisodeStatistics (#1628) * Create record_episode_statistics.py * Create test_record_episode_statistics.py * Update __init__.py * Update record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py 2019-11-01 22:27:39 +01:00			`import time`
			`from collections import deque`
Seeding update (#2422) * Ditch most of the seeding.py and replace np_random with the numpy default_rng. Let's see if tests pass * Updated a bunch of RNG calls from the RandomState API to Generator API * black; didn't expect that, did ya? * Undo a typo * blaaack * More typo fixes * Fixed setting/getting state in multidiscrete spaces * Fix typo, fix a test to work with the new sampling * Correctly (?) pass the randomly generated seed if np_random is called with None as seed * Convert the Discrete sample to a python int (as opposed to np.int64) * Remove some redundant imports * First version of the compatibility layer for old-style RNG. Mainly to trigger tests. * Removed redundant f-strings * Style fixes, removing unused imports * Try to make tests pass by removing atari from the dockerfile * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * First attempt at deprecating `env.seed` and supporting `env.reset(seed=seed)` instead. Tests should hopefully pass but throw up a million warnings. * black; didn't expect that, didya? * Rename the reset parameter in VecEnvs back to `seed` * Updated tests to use the new seeding method * Removed a bunch of old `seed` calls. Fixed a bug in AsyncVectorEnv * Stop Discrete envs from doing part of the setup (and using the randomness) in init (as opposed to reset) * Add explicit seed to wrappers reset * Remove an accidental return * Re-add some legacy functions with a warning. * Use deprecation instead of regular warnings for the newly deprecated methods/functions 2021-12-08 22:14:15 +01:00
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`import numpy as np`
Improve `pre-commit` workflow (#2602) * feat: add `isort` to `pre-commit` * ci: skip `__init__.py` file for `isort` * ci: make `isort` mandatory in lint pipeline * docs: add a section on Git hooks * ci: check isort diff * fix: isort from master branch * docs: add pre-commit badge * ci: update black + bandit versions * feat: add PR template * refactor: PR template * ci: remove bandit * docs: add Black badge * ci: try to remove all `\|\| true` statements * ci: remove lint_python job - Remove `lint_python` CI job - Move `pyupgrade` job to `pre-commit` workflow * fix: avoid messing with typing * docs: add a note on running `pre-cpmmit` manually * ci: apply `pre-commit` to the whole codebase 2022-03-31 12:50:38 -07:00
[Wrapper]: RecordEpisodeStatistics (#1628) * Create record_episode_statistics.py * Create test_record_episode_statistics.py * Update __init__.py * Update record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py 2019-11-01 22:27:39 +01:00			`import gym`


			`class RecordEpisodeStatistics(gym.Wrapper):`
			`def __init__(self, env, deque_size=100):`
Py36+ syntax in gym/wrappers: derived by running `pyupgrade --py36-plus gym/wrappers/**.py` and `flynt gym --ll 120` (#2464) Co-authored-by: Ilya Kamen <ilya.kamenshchikov@bosch.com> 2021-11-14 01:53:06 +01:00			`super().__init__(env)`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`self.num_envs = getattr(env, "num_envs", 1)`
[Python 3]: replace `time()` with `perf_counter()` for better measurements of short duration. (#2385) * Update record_episode_statistics.py * Update stats_recorder.py * Update async_vector_env.py 2021-09-12 02:03:54 +09:00			`self.t0 = time.perf_counter()`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`self.episode_count = 0`
			`self.episode_returns = None`
			`self.episode_lengths = None`
[Wrapper]: RecordEpisodeStatistics (#1628) * Create record_episode_statistics.py * Create test_record_episode_statistics.py * Update __init__.py * Update record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py 2019-11-01 22:27:39 +01:00			`self.return_queue = deque(maxlen=deque_size)`
			`self.length_queue = deque(maxlen=deque_size)`
Add RecordVideo wrapper (#2300) * Add RecordVideo wrapper * bug fix * don't change gym's core API * add test cases * reformat 2021-08-18 16:36:40 -04:00			`self.is_vector_env = getattr(env, "is_vector_env", False)`
[Wrapper]: RecordEpisodeStatistics (#1628) * Create record_episode_statistics.py * Create test_record_episode_statistics.py * Update __init__.py * Update record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py 2019-11-01 22:27:39 +01:00
Add options to the signature of `env.reset` (#2515) * First find/replace, now tests * Fixes to the vector env * Make seed keyword only in wrappers * (try to) fix the bug with old environments using new wrappers (with the seed keyword) * black * Change *kwargs to options, try to make it work; black Add OrderEnforcing wrapper to wrapper exports Add a test for compatibility with old (pybullet-like) envs * Add OrderEnforcing wrapper to wrapper exports Add a test for compatibility with old (pybullet-like) envs black * Update the env checker * Update the env checker * Update the env checker to use inspect (might fail tests, let's see) * Allow the signature to include kwargs in env_checker * Minor fix 2022-01-19 23:28:59 +01:00			`def reset(self, **kwargs):`
			`observations = super().reset(**kwargs)`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`self.episode_returns = np.zeros(self.num_envs, dtype=np.float32)`
			`self.episode_lengths = np.zeros(self.num_envs, dtype=np.int32)`
			`return observations`
[Wrapper]: RecordEpisodeStatistics (#1628) * Create record_episode_statistics.py * Create test_record_episode_statistics.py * Update __init__.py * Update record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py * Update record_episode_statistics.py * Update test_record_episode_statistics.py 2019-11-01 22:27:39 +01:00
			`def step(self, action):`
Py36+ syntax in gym/wrappers: derived by running `pyupgrade --py36-plus gym/wrappers/**.py` and `flynt gym --ll 120` (#2464) Co-authored-by: Ilya Kamen <ilya.kamenshchikov@bosch.com> 2021-11-14 01:53:06 +01:00			`observations, rewards, dones, infos = super().step(action)`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`self.episode_returns += rewards`
			`self.episode_lengths += 1`
Add RecordVideo wrapper (#2300) * Add RecordVideo wrapper * bug fix * don't change gym's core API * add test cases * reformat 2021-08-18 16:36:40 -04:00			`if not self.is_vector_env:`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`infos = [infos]`
			`dones = [dones]`
Fix RecordEpisodeStatistics wrapper for AsyncEnv (#2586) 2022-01-30 02:44:31 +01:00			`else:`
			`infos = list(infos) # Convert infos to mutable type`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`for i in range(len(dones)):`
			`if dones[i]:`
			`infos[i] = infos[i].copy()`
			`episode_return = self.episode_returns[i]`
			`episode_length = self.episode_lengths[i]`
			`episode_info = {`
			`"r": episode_return,`
			`"l": episode_length,`
[Python 3]: replace `time()` with `perf_counter()` for better measurements of short duration. (#2385) * Update record_episode_statistics.py * Update stats_recorder.py * Update async_vector_env.py 2021-09-12 02:03:54 +09:00			`"t": round(time.perf_counter() - self.t0, 6),`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`}`
			`infos[i]["episode"] = episode_info`
			`self.return_queue.append(episode_return)`
			`self.length_queue.append(episode_length)`
			`self.episode_count += 1`
			`self.episode_returns[i] = 0`
			`self.episode_lengths[i] = 0`
Fix RecordEpisodeStatistics wrapper for AsyncEnv (#2586) 2022-01-30 02:44:31 +01:00			`if self.is_vector_env:`
			`infos = tuple(infos)`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`return (`
			`observations,`
			`rewards,`
Add RecordVideo wrapper (#2300) * Add RecordVideo wrapper * bug fix * don't change gym's core API * add test cases * reformat 2021-08-18 16:36:40 -04:00			`dones if self.is_vector_env else dones[0],`
			`infos if self.is_vector_env else infos[0],`
Make RecordEpisodeStatistics work with VectorEnv (#2296) * Make RecordEpisodeStatistics work with VectorEnv * fix test cases * fix lint * add test cases * fix linting * fix tests * fix test cases... * Update gym/wrappers/record_episode_statistics.py Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> * fix test cases * fix test cases again Co-authored-by: Tristan Deleu <tristandeleu@users.noreply.github.com> 2021-08-05 17:06:49 -04:00			`)`