Gymnasium/gym/envs/classic_control/pendulum.py

from typing import Optional

import gym
from gym import spaces
from gym.utils import seeding
import numpy as np
from os import path


class PendulumEnv(gym.Env):

    metadata = {"render.modes": ["human", "rgb_array"], "video.frames_per_second": 30}

    def __init__(self, g=10.0):
        self.max_speed = 8
        self.max_torque = 2.0
        self.dt = 0.05
        self.g = g
        self.m = 1.0
        self.l = 1.0
        self.viewer = None

        high = np.array([1.0, 1.0, self.max_speed], dtype=np.float32)
        self.action_space = spaces.Box(
            low=-self.max_torque, high=self.max_torque, shape=(1,), dtype=np.float32
        )
        self.observation_space = spaces.Box(low=-high, high=high, dtype=np.float32)

    def step(self, u):
        th, thdot = self.state  # th := theta

        g = self.g
        m = self.m
        l = self.l
        dt = self.dt

        u = np.clip(u, -self.max_torque, self.max_torque)[0]
        self.last_u = u  # for rendering
        costs = angle_normalize(th) ** 2 + 0.1 * thdot ** 2 + 0.001 * (u ** 2)

        newthdot = thdot + (3 * g / (2 * l) * np.sin(th) + 3.0 / (m * l ** 2) * u) * dt
        newthdot = np.clip(newthdot, -self.max_speed, self.max_speed)
        newth = th + newthdot * dt

        self.state = np.array([newth, newthdot])
        return self._get_obs(), -costs, False, {}

    def reset(self, *, seed: Optional[int] = None, options: Optional[dict] = None):
        super().reset(seed=seed)
        high = np.array([np.pi, 1])
        self.state = self.np_random.uniform(low=-high, high=high)
        self.last_u = None
        return self._get_obs()

    def _get_obs(self):
        theta, thetadot = self.state
        return np.array([np.cos(theta), np.sin(theta), thetadot], dtype=np.float32)

    def render(self, mode="human"):
        if self.viewer is None:
            from gym.utils import pyglet_rendering

            self.viewer = pyglet_rendering.Viewer(500, 500)
            self.viewer.set_bounds(-2.2, 2.2, -2.2, 2.2)
            rod = pyglet_rendering.make_capsule(1, 0.2)
            rod.set_color(0.8, 0.3, 0.3)
            self.pole_transform = pyglet_rendering.Transform()
            rod.add_attr(self.pole_transform)
            self.viewer.add_geom(rod)
            axle = pyglet_rendering.make_circle(0.05)
            axle.set_color(0, 0, 0)
            self.viewer.add_geom(axle)
            fname = path.join(path.dirname(__file__), "assets/clockwise.png")
            self.img = pyglet_rendering.Image(fname, 1.0, 1.0)
            self.imgtrans = pyglet_rendering.Transform()
            self.img.add_attr(self.imgtrans)

        self.viewer.add_onetime(self.img)
        self.pole_transform.set_rotation(self.state[0] + np.pi / 2)
        if self.last_u is not None:
            self.imgtrans.scale = (-self.last_u / 2, np.abs(self.last_u) / 2)

        return self.viewer.render(return_rgb_array=mode == "rgb_array")

    def close(self):
        if self.viewer:
            self.viewer.close()
            self.viewer = None


def angle_normalize(x):
    return ((x + np.pi) % (2 * np.pi)) - np.pi
Seeding update (#2422) * Ditch most of the seeding.py and replace np_random with the numpy default_rng. Let's see if tests pass * Updated a bunch of RNG calls from the RandomState API to Generator API * black; didn't expect that, did ya? * Undo a typo * blaaack * More typo fixes * Fixed setting/getting state in multidiscrete spaces * Fix typo, fix a test to work with the new sampling * Correctly (?) pass the randomly generated seed if np_random is called with None as seed * Convert the Discrete sample to a python int (as opposed to np.int64) * Remove some redundant imports * First version of the compatibility layer for old-style RNG. Mainly to trigger tests. * Removed redundant f-strings * Style fixes, removing unused imports * Try to make tests pass by removing atari from the dockerfile * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * First attempt at deprecating `env.seed` and supporting `env.reset(seed=seed)` instead. Tests should hopefully pass but throw up a million warnings. * black; didn't expect that, didya? * Rename the reset parameter in VecEnvs back to `seed` * Updated tests to use the new seeding method * Removed a bunch of old `seed` calls. Fixed a bug in AsyncVectorEnv * Stop Discrete envs from doing part of the setup (and using the randomness) in init (as opposed to reset) * Add explicit seed to wrappers reset * Remove an accidental return * Re-add some legacy functions with a warning. * Use deprecation instead of regular warnings for the newly deprecated methods/functions 2021-12-08 22:14:15 +01:00			`from typing import Optional`

Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`import gym`
			`from gym import spaces`
[WIP] add support for seeding environments (#135) * Make environments seedable * Fix monitor bugs - Set monitor_id before setting the infix. This was a bug that would yield incorrect results with multiple monitors. - Remove extra pid from stats recorder filename. This should be purely cosmetic. * Start uploading seeds in episode_batch * Fix _bigint_from_bytes for python3 * Set seed explicitly in random_agent * Pass through seed argument * Also pass through random state to spaces * Pass random state into the observation/action spaces * Make all _seed methods return the list of used seeds * Switch over to np.random where possible * Start hashing seeds, and also seed doom engine * Fixup seeding determinism in many cases * Seed before loading the ROM * Make seeding more Python3 friendly * Make the MuJoCo skipping a bit more forgiving * Remove debugging PDB calls * Make setInt argument into raw bytes * Validate and upload seeds * Skip box2d * Make seeds smaller, and change representation of seeds in upload * Handle long seeds * Fix RandomAgent example to be deterministic * Handle integer types correctly in Python2 and Python3 * Try caching pip * Try adding swap * Add df and free calls * Bump swap * Bump swap size * Try setting overcommit * Try other sysctls * Try fixing overcommit * Try just setting overcommit_memory=1 * Add explanatory comment * Add what's new section to readme * BUG: Mark ElevatorAction-ram-v0 as non-deterministic for now * Document seed * Move nondetermistic check into spec 2016-05-29 09:07:09 -07:00			`from gym.utils import seeding`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`import numpy as np`
			`from os import path`

Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`class PendulumEnv(gym.Env):`
Readded overwritten changes for offset functionality for Discrete spaces (#2470) Co-authored-by: J K Terry <justinkterry@gmail.com> 2021-10-30 21:42:01 +05:30
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`metadata = {"render.modes": ["human", "rgb_array"], "video.frames_per_second": 30}`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Fix gravity constant (#1515) * Added pendulumV2 with SI units * Added pendulumV2 with SI units * Fix gravity constant * Delete pendulumV2.py * Update pendulum.py * Update pendulum.py * Update pendulum.py 2019-06-22 05:00:04 +05:30			`def __init__(self, g=10.0):`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`self.max_speed = 8`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`self.max_torque = 2.0`
			`self.dt = 0.05`
Fix gravity constant (#1515) * Added pendulumV2 with SI units * Added pendulumV2 with SI units * Fix gravity constant * Delete pendulumV2.py * Update pendulum.py * Update pendulum.py * Update pendulum.py 2019-06-22 05:00:04 +05:30			`self.g = g`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`self.m = 1.0`
			`self.l = 1.0`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`self.viewer = None`
Switch to a global PRNG for action/observation spaces (#144) cf https://github.com/openai/gym/commit/58e6aa95e5af2c738557431f812abb81c505a7cf#commitcomment-17669277 2016-05-30 18:07:59 -07:00
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`high = np.array([1.0, 1.0, self.max_speed], dtype=np.float32)`
redo black (#2272) 2021-07-29 15:39:42 -04:00			`self.action_space = spaces.Box(`
			`low=-self.max_torque, high=self.max_torque, shape=(1,), dtype=np.float32`
			`)`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`self.observation_space = spaces.Box(low=-high, high=high, dtype=np.float32)`
Switch to a global PRNG for action/observation spaces (#144) cf https://github.com/openai/gym/commit/58e6aa95e5af2c738557431f812abb81c505a7cf#commitcomment-17669277 2016-05-30 18:07:59 -07:00
fix: Invalud argument format (#1822) 2020-02-28 15:55:13 -08:00			`def step(self, u):`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`th, thdot = self.state # th := theta`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Fix gravity constant (#1515) * Added pendulumV2 with SI units * Added pendulumV2 with SI units * Fix gravity constant * Delete pendulumV2.py * Update pendulum.py * Update pendulum.py * Update pendulum.py 2019-06-22 05:00:04 +05:30			`g = self.g`
Unified the physical constants in the constructor (#1716) Thought this was more readable. 2019-10-25 17:22:10 -04:00			`m = self.m`
			`l = self.l`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`dt = self.dt`

			`u = np.clip(u, -self.max_torque, self.max_torque)[0]`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`self.last_u = u # for rendering`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`costs = angle_normalize(th) ** 2 + 0.1 * thdot ** 2 + 0.001 * (u ** 2)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Pendulum updates (#2423) * Pendulum env updates Simplify the math a bit (no difference in behavior) * Reorder the clipping of angular velocity * Bump version of Pendulum * black * Update mentions of Pendulum-v0 to Pendulum-v1. 2021-09-25 20:00:28 +02:00			`newthdot = thdot + (3 * g / (2 * l) * np.sin(th) + 3.0 / (m * l ** 2) * u) * dt`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`newthdot = np.clip(newthdot, -self.max_speed, self.max_speed)`
Pendulum updates (#2423) * Pendulum env updates Simplify the math a bit (no difference in behavior) * Reorder the clipping of angular velocity * Bump version of Pendulum * black * Update mentions of Pendulum-v0 to Pendulum-v1. 2021-09-25 20:00:28 +02:00			`newth = th + newthdot * dt`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
			`self.state = np.array([newth, newthdot])`
			`return self._get_obs(), -costs, False, {}`

Add options to the signature of `env.reset` (#2515) * First find/replace, now tests * Fixes to the vector env * Make seed keyword only in wrappers * (try to) fix the bug with old environments using new wrappers (with the seed keyword) * black * Change *kwargs to options, try to make it work; black Add OrderEnforcing wrapper to wrapper exports Add a test for compatibility with old (pybullet-like) envs * Add OrderEnforcing wrapper to wrapper exports Add a test for compatibility with old (pybullet-like) envs black * Update the env checker * Update the env checker * Update the env checker to use inspect (might fail tests, let's see) * Allow the signature to include kwargs in env_checker * Minor fix 2022-01-19 23:28:59 +01:00			`def reset(self, *, seed: Optional[int] = None, options: Optional[dict] = None):`
Seeding update (#2422) * Ditch most of the seeding.py and replace np_random with the numpy default_rng. Let's see if tests pass * Updated a bunch of RNG calls from the RandomState API to Generator API * black; didn't expect that, did ya? * Undo a typo * blaaack * More typo fixes * Fixed setting/getting state in multidiscrete spaces * Fix typo, fix a test to work with the new sampling * Correctly (?) pass the randomly generated seed if np_random is called with None as seed * Convert the Discrete sample to a python int (as opposed to np.int64) * Remove some redundant imports * First version of the compatibility layer for old-style RNG. Mainly to trigger tests. * Removed redundant f-strings * Style fixes, removing unused imports * Try to make tests pass by removing atari from the dockerfile * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * Try to make tests pass by removing atari from the setup * First attempt at deprecating `env.seed` and supporting `env.reset(seed=seed)` instead. Tests should hopefully pass but throw up a million warnings. * black; didn't expect that, didya? * Rename the reset parameter in VecEnvs back to `seed` * Updated tests to use the new seeding method * Removed a bunch of old `seed` calls. Fixed a bug in AsyncVectorEnv * Stop Discrete envs from doing part of the setup (and using the randomness) in init (as opposed to reset) * Add explicit seed to wrappers reset * Remove an accidental return * Re-add some legacy functions with a warning. * Use deprecation instead of regular warnings for the newly deprecated methods/functions 2021-12-08 22:14:15 +01:00			`super().reset(seed=seed)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`high = np.array([np.pi, 1])`
[WIP] add support for seeding environments (#135) * Make environments seedable * Fix monitor bugs - Set monitor_id before setting the infix. This was a bug that would yield incorrect results with multiple monitors. - Remove extra pid from stats recorder filename. This should be purely cosmetic. * Start uploading seeds in episode_batch * Fix _bigint_from_bytes for python3 * Set seed explicitly in random_agent * Pass through seed argument * Also pass through random state to spaces * Pass random state into the observation/action spaces * Make all _seed methods return the list of used seeds * Switch over to np.random where possible * Start hashing seeds, and also seed doom engine * Fixup seeding determinism in many cases * Seed before loading the ROM * Make seeding more Python3 friendly * Make the MuJoCo skipping a bit more forgiving * Remove debugging PDB calls * Make setInt argument into raw bytes * Validate and upload seeds * Skip box2d * Make seeds smaller, and change representation of seeds in upload * Handle long seeds * Fix RandomAgent example to be deterministic * Handle integer types correctly in Python2 and Python3 * Try caching pip * Try adding swap * Add df and free calls * Bump swap * Bump swap size * Try setting overcommit * Try other sysctls * Try fixing overcommit * Try just setting overcommit_memory=1 * Add explanatory comment * Add what's new section to readme * BUG: Mark ElevatorAction-ram-v0 as non-deterministic for now * Document seed * Move nondetermistic check into spec 2016-05-29 09:07:09 -07:00			`self.state = self.np_random.uniform(low=-high, high=high)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`self.last_u = None`
			`return self._get_obs()`

			`def _get_obs(self):`
			`theta, thetadot = self.state`
Fix dtypes to be consistent with observation_space (#2340) * Changed the dtypes of classic control envs to float32 * Fixed formatting via black * Added dtype tests * Formatting, and test error message * Only test dtypes for Box space * Fix Bipedal Walker and Car Racing * Undo the car racing dtype change * Redo the car racing dtype change - set to np.float32, and updated observation_space to reflect it 2021-08-22 00:11:19 +02:00			`return np.array([np.cos(theta), np.sin(theta), thetadot], dtype=np.float32)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`def render(self, mode="human"):`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`if self.viewer is None:`
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`from gym.utils import pyglet_rendering`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`self.viewer = pyglet_rendering.Viewer(500, 500)`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`self.viewer.set_bounds(-2.2, 2.2, -2.2, 2.2)`
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`rod = pyglet_rendering.make_capsule(1, 0.2)`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`rod.set_color(0.8, 0.3, 0.3)`
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`self.pole_transform = pyglet_rendering.Transform()`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`rod.add_attr(self.pole_transform)`
			`self.viewer.add_geom(rod)`
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`axle = pyglet_rendering.make_circle(0.05)`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`axle.set_color(0, 0, 0)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`self.viewer.add_geom(axle)`
			`fname = path.join(path.dirname(__file__), "assets/clockwise.png")`
Move rendering.py to utils (#2551) * Move rendering.py to utils * Rename rendering.py to pyglet_rendering.py 2022-01-06 19:01:29 +01:00			`self.img = pyglet_rendering.Image(fname, 1.0, 1.0)`
			`self.imgtrans = pyglet_rendering.Transform()`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`self.img.add_attr(self.imgtrans)`

			`self.viewer.add_onetime(self.img)`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`self.pole_transform.set_rotation(self.state[0] + np.pi / 2)`
Fix pendulum rendering when sending zero control (#2307) 2021-08-20 22:22:47 -04:00			`if self.last_u is not None:`
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00			`self.imgtrans.scale = (-self.last_u / 2, np.abs(self.last_u) / 2)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`return self.viewer.render(return_rgb_array=mode == "rgb_array")`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Cleanup, removal of unmaintained code (#836) * add dtype to Box * remove board_game, debugging, safety, parameter_tuning environments * massive set of breaking changes - remove python logging module - _step, _reset, _seed, _close => non underscored method - remove benchmark and scoring folder * Improve render("human"), now resizable, closable window. * get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods * CubeCrash unit test environment * followup fixes * MemorizeDigits unit test envrionment * refactored spaces a bit fixed indentation disabled test_env_semantics * fix unit tests * fixes * CubeCrash, MemorizeDigits tested * gym backwards compatibility patch * gym backwards compatibility, followup fixes * changelist, add spaces to main namespaces * undo_logger_setup for backwards compat * remove configuration.py 2018-01-25 18:20:14 -08:00			`def close(self):`
Fix env.close() to allow re-opening window (#1155) 2018-09-14 13:36:57 -07:00			`if self.viewer:`
			`self.viewer.close()`
			`self.viewer = None`
Cleanup, removal of unmaintained code (#836) * add dtype to Box * remove board_game, debugging, safety, parameter_tuning environments * massive set of breaking changes - remove python logging module - _step, _reset, _seed, _close => non underscored method - remove benchmark and scoring folder * Improve render("human"), now resizable, closable window. * get rid of default step and reset in wrappers, so it doesn’t silently fail for people with underscore methods * CubeCrash unit test environment * followup fixes * MemorizeDigits unit test envrionment * refactored spaces a bit fixed indentation disabled test_env_semantics * fix unit tests * fixes * CubeCrash, MemorizeDigits tested * gym backwards compatibility patch * gym backwards compatibility, followup fixes * changelist, add spaces to main namespaces * undo_logger_setup for backwards compat * remove configuration.py 2018-01-25 18:20:14 -08:00
Clean up pendulum environment. (#1877) 2020-04-24 23:56:04 +02:00
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`def angle_normalize(x):`
Blacken the codebase (#2265) 2021-07-29 02:26:34 +02:00			`return ((x + np.pi) % (2 * np.pi)) - np.pi`