Gymnasium/tests/functional/test_functional.py

"""Tests the functional api."""

from __future__ import annotations

from typing import Any

import numpy as np

from gymnasium.experimental.functional import FuncEnv


class GenericTestFuncEnv(FuncEnv):
    """Generic testing functional environment."""

    def __init__(self, options: dict[str, Any] | None = None):
        """Constructor that allows generic options to be set on the environment."""
        super().__init__(options)

    def initial(self, rng: Any, params=None) -> np.ndarray:
        """Testing initial function."""
        return np.array([0, 0], dtype=np.float32)

    def observation(self, state: np.ndarray, rng: Any, params=None) -> np.ndarray:
        """Testing observation function."""
        return state

    def transition(
        self, state: np.ndarray, action: int, rng: None, params=None
    ) -> np.ndarray:
        """Testing transition function."""
        return state + np.array([0, action], dtype=np.float32)

    def reward(
        self,
        state: np.ndarray,
        action: int,
        next_state: np.ndarray,
        rng: Any,
        params=None,
    ) -> float:
        """Testing reward function."""
        return 1.0 if next_state[1] > 0 else 0.0

    def terminal(self, state: np.ndarray, rng: Any, params=None) -> bool:
        """Testing terminal function."""
        return state[1] > 0


def test_functional_api():
    """Tests the core functional api specification using a generic testing environment."""
    env = GenericTestFuncEnv()

    state = env.initial(None)

    obs = env.observation(state, None)

    assert state.shape == (2,)
    assert state.dtype == np.float32
    assert obs.shape == (2,)
    assert obs.dtype == np.float32
    assert np.allclose(obs, state)

    actions = [-1, -2, -5, 3, 5, 2]
    for i, action in enumerate(actions):
        next_state = env.transition(state, action, None)
        assert next_state.shape == (2,)
        assert next_state.dtype == np.float32
        assert np.allclose(next_state, state + np.array([0, action]))

        observation = env.observation(next_state, None)
        assert observation.shape == (2,)
        assert observation.dtype == np.float32
        assert np.allclose(observation, next_state)

        reward = env.reward(state, action, next_state, None)
        assert reward == (1.0 if next_state[1] > 0 else 0.0)

        terminal = env.terminal(next_state, None)
        assert terminal == (i == 5)  # terminal state is in the final action

        state = next_state
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Tests the functional api."""`
Pre commit autoupdate (#1082) 2024-06-10 17:07:47 +01:00
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`from __future__ import annotations`

			`from typing import Any`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00
			`import numpy as np`

Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`from gymnasium.experimental.functional import FuncEnv`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00

Move dev_wrappers and functional to experimental (#159) 2022-11-29 23:37:53 +00:00			`class GenericTestFuncEnv(FuncEnv):`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Generic testing functional environment."""`

			`def __init__(self, options: dict[str, Any] \| None = None):`
			`"""Constructor that allows generic options to be set on the environment."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`super().__init__(options)`

Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`def initial(self, rng: Any, params=None) -> np.ndarray:`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Testing initial function."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`return np.array([0, 0], dtype=np.float32)`

Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`def observation(self, state: np.ndarray, rng: Any, params=None) -> np.ndarray:`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Testing observation function."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`return state`

Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`def transition(`
			`self, state: np.ndarray, action: int, rng: None, params=None`
			`) -> np.ndarray:`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Testing transition function."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`return state + np.array([0, action], dtype=np.float32)`

Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`def reward(`
Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`self,`
			`state: np.ndarray,`
			`action: int,`
			`next_state: np.ndarray,`
			`rng: Any,`
			`params=None,`
Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`) -> float:`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Testing reward function."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`return 1.0 if next_state[1] > 0 else 0.0`

Re-add functional api to experimental (#1145) 2024-08-15 14:49:05 +01:00			`def terminal(self, state: np.ndarray, rng: Any, params=None) -> bool:`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`"""Testing terminal function."""`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`return state[1] > 0`


Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00			`def test_functional_api():`
			`"""Tests the core functional api specification using a generic testing environment."""`
Move dev_wrappers and functional to experimental (#159) 2022-11-29 23:37:53 +00:00			`env = GenericTestFuncEnv()`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`state = env.initial(None)`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00
Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`obs = env.observation(state, None)`
Add wrappers to experimental (#201) 2022-12-10 22:04:14 +00:00
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`assert state.shape == (2,)`
			`assert state.dtype == np.float32`
			`assert obs.shape == (2,)`
			`assert obs.dtype == np.float32`
			`assert np.allclose(obs, state)`

			`actions = [-1, -2, -5, 3, 5, 2]`
			`for i, action in enumerate(actions):`
			`next_state = env.transition(state, action, None)`
			`assert next_state.shape == (2,)`
			`assert next_state.dtype == np.float32`
			`assert np.allclose(next_state, state + np.array([0, action]))`

Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`observation = env.observation(next_state, None)`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`assert observation.shape == (2,)`
			`assert observation.dtype == np.float32`
			`assert np.allclose(observation, next_state)`

Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`reward = env.reward(state, action, next_state, None)`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`assert reward == (1.0 if next_state[1] > 0 else 0.0)`

Add `params` and `rng` argument to all `FuncEnv` member functions (#900) Co-authored-by: Mark Towers <mark.m.towers@gmail.com> Co-authored-by: Pratik Ingle <prin@itu.dk> Co-authored-by: Jose Antonio Martin H <ja.martin.h@repsol.com> Co-authored-by: Oli <ollihaus@t-online.de> Co-authored-by: Jared Swift <j.w.swift@outlook.com> Co-authored-by: Tim Schneider <mail@tim-schneider.me> Co-authored-by: Tim Schneider <tim@robot-learning.de> Co-authored-by: Tim Schneider <tim.schneider94@t-online.de> Co-authored-by: Manuel Goulão <msilvagoulao@gmail.com> Co-authored-by: Michael Panchenko <35432522+MischaPanch@users.noreply.github.com> Co-authored-by: TobiasKallehauge <tkal@es.aau.dk> Co-authored-by: Ariel Kwiatkowski <ariel.j.kwiatkowski@gmail.com> Co-authored-by: James Mochizuki-Freeman <jameymmf@gmail.com> 2024-06-07 20:16:38 +00:00			`terminal = env.terminal(next_state, None)`
Functional API and proof-of-concept jax classic-control envs (#25) (#145) 2022-11-18 22:25:33 +01:00			`assert terminal == (i == 5) # terminal state is in the final action`

			`state = next_state`