Gymnasium/README.md

[![Python](https://img.shields.io/pypi/pyversions/gymnasium.svg)](https://badge.fury.io/py/gymnasium)
[![PyPI](https://badge.fury.io/py/gymnasium.svg)](https://badge.fury.io/py/gymnasium)
[![arXiv](https://img.shields.io/badge/arXiv-2407.17032-b31b1b.svg)](https://arxiv.org/abs/2407.17032)
[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/)
[![License](https://img.shields.io/github/license/Farama-Foundation/Gymnasium)](https://github.com/Farama-Foundation/Gymnasium/blob/main/LICENSE)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

<p align="center">
    <a href="https://gymnasium.farama.org/" target = "_blank">
    <img src="https://raw.githubusercontent.com/Farama-Foundation/Gymnasium/main/gymnasium-text.png" width="500px" />
</a>

</p>

Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. This is a fork of OpenAI's [Gym](https://github.com/openai/gym) library by its maintainers (OpenAI handed over maintenance a few years ago to an outside team), and is where future maintenance will occur going forward.

The documentation website is at [gymnasium.farama.org](https://gymnasium.farama.org), and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/bnJ6kubTg6

## Environments

Gymnasium includes the following families of environments along with a wide variety of third-party environments
* [Classic Control](https://gymnasium.farama.org/environments/classic_control/) - These are classic reinforcement learning based on real-world problems and physics.
* [Box2D](https://gymnasium.farama.org/environments/box2d/) - These environments all involve toy games based around physics control, using box2d based physics and PyGame-based rendering
* [Toy Text](https://gymnasium.farama.org/environments/toy_text/) - These environments are designed to be extremely simple, with small discrete state and action spaces, and hence easy to learn. As a result, they are suitable for debugging implementations of reinforcement learning algorithms.
* [MuJoCo](https://gymnasium.farama.org/environments/mujoco/) - A physics engine based environments with multi-joint control which are more complex than the Box2D environments.
* [Atari](https://ale.farama.org/) - Emulator of Atari 2600 ROMs simulated that have a high range of complexity for agents to learn.
* [Third-party](https://gymnasium.farama.org/environments/third_party_environments/) - A number of environments have been created that are compatible with the Gymnasium API. Be aware of the version that the software was created for and use the `apply_env_compatibility` in `gymnasium.make` if necessary.

## Installation

To install the base Gymnasium library, use `pip install gymnasium`

This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install "gymnasium[atari]"` or use `pip install "gymnasium[all]"` to install all dependencies.

We support and test for Python 3.10, 3.11 and 3.12 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

## API

The Gymnasium API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:

```python
import gymnasium as gym
env = gym.make("CartPole-v1")

observation, info = env.reset(seed=42)
for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)

    if terminated or truncated:
        observation, info = env.reset()
env.close()
```

## Notable Related Libraries

Please note that this is an incomplete list, and just includes libraries that the maintainers most commonly point newcomers to when asked for recommendations.

* [CleanRL](https://github.com/vwxyzjn/cleanrl) is a learning library based on the Gymnasium API. It is designed to cater to newer people in the field and provides very good reference implementations.
* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is a multi-agent version of Gymnasium with a number of implemented environments, i.e. multi-agent Atari environments.
* The Farama Foundation also has a collection of many other [environments](https://farama.org/projects) that are maintained by the same team as Gymnasium and use the Gymnasium API.

## Environment Versioning

Gymnasium keeps strict versioning for reproducibility reasons. All environments end in a suffix like "-v0".  When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion. These were inherited from Gym.

## Support Gymnasium's Development

If you are financially able to do so and would like to support the development of Gymnasium, please join others in the community in [donating to us](https://github.com/sponsors/Farama-Foundation).

## Citation

You can cite Gymnasium using our related paper (https://arxiv.org/abs/2407.17032) as:

```
@article{towers2024gymnasium,
  title={Gymnasium: A Standard Interface for Reinforcement Learning Environments},
  author={Towers, Mark and Kwiatkowski, Ariel and Terry, Jordan and Balis, John U and De Cola, Gianluca and Deleu, Tristan and Goul{\~a}o, Manuel and Kallinteris, Andreas and Krimmel, Markus and KG, Arjun and others},
  journal={arXiv preprint arXiv:2407.17032},
  year={2024}
}
```
Add Python versions and PyPI badges to Readme (#897) 2024-01-29 14:42:30 +00:00			`[![Python](https://img.shields.io/pypi/pyversions/gymnasium.svg)](https://badge.fury.io/py/gymnasium)`
			`[![PyPI](https://badge.fury.io/py/gymnasium.svg)](https://badge.fury.io/py/gymnasium)`
Update citation to Gymnasium Arxiv paper (#1135) 2024-08-08 11:57:26 +01:00			`[![arXiv](https://img.shields.io/badge/arXiv-2407.17032-b31b1b.svg)](https://arxiv.org/abs/2407.17032)`
Add Python versions and PyPI badges to Readme (#897) 2024-01-29 14:42:30 +00:00			`[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/)`
docs: added link and license badge (#1359) 2025-04-15 20:19:34 +03:00			`[![License](https://img.shields.io/github/license/Farama-Foundation/Gymnasium)](https://github.com/Farama-Foundation/Gymnasium/blob/main/LICENSE)`
Add Python versions and PyPI badges to Readme (#897) 2024-01-29 14:42:30 +00:00			`[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)`
Improve `pre-commit` workflow (#2602) * feat: add `isort` to `pre-commit` * ci: skip `__init__.py` file for `isort` * ci: make `isort` mandatory in lint pipeline * docs: add a section on Git hooks * ci: check isort diff * fix: isort from master branch * docs: add pre-commit badge * ci: update black + bandit versions * feat: add PR template * refactor: PR template * ci: remove bandit * docs: add Black badge * ci: try to remove all `\|\| true` statements * ci: remove lint_python job - Remove `lint_python` CI job - Move `pyupgrade` job to `pre-commit` workflow * fix: avoid messing with typing * docs: add a note on running `pre-cpmmit` manually * ci: apply `pre-commit` to the whole codebase 2022-03-31 12:50:38 -07:00
Update README.md 2022-09-12 11:54:31 -04:00			`<p align="center">`
Update README.md (#1286) 2025-01-04 02:25:35 +05:30			`<a href="https://gymnasium.farama.org/" target = "_blank">`
			`<img src="https://raw.githubusercontent.com/Farama-Foundation/Gymnasium/main/gymnasium-text.png" width="500px" />`
			`</a>`

Update README.md 2022-09-12 11:54:31 -04:00			`</p>`

Fixed typo on homepage, "it's" to "its" (#738) 2023-10-13 21:28:08 +08:00			`Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. This is a fork of OpenAI's [Gym](https://github.com/openai/gym) library by its maintainers (OpenAI handed over maintenance a few years ago to an outside team), and is where future maintenance will occur going forward.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Change discord invite link to be different from gym (#51) 2022-10-11 17:12:54 +01:00			`The documentation website is at [gymnasium.farama.org](https://gymnasium.farama.org), and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/bnJ6kubTg6`
Update README.md 2022-04-07 19:36:05 -04:00
Update README.md 2022-10-16 22:48:04 -04:00			`## Environments`

			`Gymnasium includes the following families of environments along with a wide variety of third-party environments`
			`* [Classic Control](https://gymnasium.farama.org/environments/classic_control/) - These are classic reinforcement learning based on real-world problems and physics.`
			`* [Box2D](https://gymnasium.farama.org/environments/box2d/) - These environments all involve toy games based around physics control, using box2d based physics and PyGame-based rendering`
			`* [Toy Text](https://gymnasium.farama.org/environments/toy_text/) - These environments are designed to be extremely simple, with small discrete state and action spaces, and hence easy to learn. As a result, they are suitable for debugging implementations of reinforcement learning algorithms.`
			`* [MuJoCo](https://gymnasium.farama.org/environments/mujoco/) - A physics engine based environments with multi-joint control which are more complex than the Box2D environments.`
Fix the Atari redirect documentation (#1171) 2024-09-20 13:56:26 +01:00			`* [Atari](https://ale.farama.org/) - Emulator of Atari 2600 ROMs simulated that have a high range of complexity for agents to learn.`
Update README.md 2022-10-16 22:48:04 -04:00			* [Third-party](https://gymnasium.farama.org/environments/third_party_environments/) - A number of environments have been created that are compatible with the Gymnasium API. Be aware of the version that the software was created for and use the `apply_env_compatibility` in `gymnasium.make` if necessary.

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Installation`

Update README.md 2022-10-13 20:21:59 -04:00			To install the base Gymnasium library, use `pip install gymnasium`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Transfer to gymnasium https://github.com/openai/gym/pull/3126 2022-11-11 16:38:31 +00:00			This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install "gymnasium[atari]"` or use `pip install "gymnasium[all]"` to install all dependencies.
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Add generic conversion wrapper between Array API compatible frameworks (#1333) 2025-05-12 00:10:06 +02:00			`We support and test for Python 3.10, 3.11 and 3.12 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## API`

Update README.md 2022-09-12 11:54:03 -04:00			The Gymnasium API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00
			```python
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`import gymnasium as gym`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env = gym.make("CartPole-v1")`

Upstream v26.1 changes (#19) 2022-09-16 23:16:59 +01:00			`observation, info = env.reset(seed=42)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`for _ in range(1000):`
			`action = env.action_space.sample()`
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`observation, reward, terminated, truncated, info = env.step(action)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00
Support only new step API (while retaining compatibility functions) (#3019) 2022-08-30 19:41:59 +05:30			`if terminated or truncated:`
Removing return_info argument to env.reset() and deprecated env.seed() function (reset now always returns info) (#2962) * removed return_info, made info dict mandatory in reset * tenatively removed deprecated seed api for environments * added more info type checks to wrapper tests * formatting/style compliance * addressed some comments * polish to address review * fixed tests after merge, and added a test of the return_info deprecation assertion if found in reset signature * some organization of env_checker tests, reverted a probably merge error * added deprecation check for seed function in env * updated docstring * removed debug prints, tweaked test_check_seed_deprecation * changed return_info deprecation check from assertion to warning * fixes to vector envs, now should be correctly structured * added some explanation and typehints for mockup depcreated return info reset function * re-removed seed function from vector envs * added explanation to _reset_return_info_type and changed the return statement 2022-08-23 11:09:54 -04:00			`observation, info = env.reset()`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env.close()`
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00			```

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Notable Related Libraries`

Update citation to Gymnasium Arxiv paper (#1135) 2024-08-08 11:57:26 +01:00			`Please note that this is an incomplete list, and just includes libraries that the maintainers most commonly point newcomers to when asked for recommendations.`
Update README.md 2022-10-03 15:46:30 -04:00
Update README.md 2022-10-27 15:48:16 +01:00			`* [CleanRL](https://github.com/vwxyzjn/cleanrl) is a learning library based on the Gymnasium API. It is designed to cater to newer people in the field and provides very good reference implementations.`
Fix typos in README.md (#184) 2022-12-05 16:40:31 +02:00			`* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is a multi-agent version of Gymnasium with a number of implemented environments, i.e. multi-agent Atari environments.`
Update README.md 2023-01-10 13:50:34 -05:00			`* The Farama Foundation also has a collection of many other [environments](https://farama.org/projects) that are maintained by the same team as Gymnasium and use the Gymnasium API.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## Environment Versioning`

Update citation to Gymnasium Arxiv paper (#1135) 2024-08-08 11:57:26 +01:00			`Gymnasium keeps strict versioning for reproducibility reasons. All environments end in a suffix like "-v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion. These were inherited from Gym.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Update README.md 2023-01-22 19:49:14 -05:00			`## Support Gymnasium's Development`
Update README.md 2023-01-22 19:48:51 -05:00
			`If you are financially able to do so and would like to support the development of Gymnasium, please join others in the community in [donating to us](https://github.com/sponsors/Farama-Foundation).`
Update information about citing Gymnasium (#591) 2023-07-09 23:49:00 +02:00
			`## Citation`

Update citation to Gymnasium Arxiv paper (#1135) 2024-08-08 11:57:26 +01:00			`You can cite Gymnasium using our related paper (https://arxiv.org/abs/2407.17032) as:`
Update information about citing Gymnasium (#591) 2023-07-09 23:49:00 +02:00
			```
Update ArXiV citation 2024-10-21 09:45:38 +01:00			`@article{towers2024gymnasium,`
			`title={Gymnasium: A Standard Interface for Reinforcement Learning Environments},`
			`author={Towers, Mark and Kwiatkowski, Ariel and Terry, Jordan and Balis, John U and De Cola, Gianluca and Deleu, Tristan and Goul{\~a}o, Manuel and Kallinteris, Andreas and Krimmel, Markus and KG, Arjun and others},`
			`journal={arXiv preprint arXiv:2407.17032},`
			`year={2024}`
Update information about citing Gymnasium (#591) 2023-07-09 23:49:00 +02:00			`}`
			```