Gymnasium/README.md

[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

<p align="center">
    <img src="gymnasium-text.png" width="500px"/>
</p>


Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. This is a fork of OpenAI's [Gym](https://github.com/openai/gym) library by the maintainers (OpenAI handed over maintenance a few years ago to an outside team), and is where future maintenance will occur going forward

The documentation website is at [gymnasium.farama.org](https://gymnasium.farama.org), and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/bnJ6kubTg6

## Installation

To install the base Gymnasium library, use `pip install gymnasium`

This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install gymnasium[atari]` or use `pip install gymnasium[all]` to install all dependencies.

We support and test for Python 3.7, 3.8, 3.9 and 3.10 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

## API

The Gymnasium API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:

```python
import gymnasium as gym
env = gym.make("CartPole-v1")

observation, info = env.reset(seed=42)
for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)

    if terminated or truncated:
        observation, info = env.reset()
env.close()
```

## Notable Related Libraries

Please note that this is an incomplete list, and just includes libraries that the maintainers most commonly point newcommers to when asked for recommendations.

* [CleanRL](https://github.com/vwxyzjn/cleanrl) is a learning library based on the Gym API. It is designed to cater to newer people in the field and provides very good reference implementations.
* [Tianshou](https://github.com/thu-ml/tianshou) is a learning library that's geared towards very experienced users and is design to allow for ease in complex algorithm modifications.
* [RLlib](https://docs.ray.io/en/latest/rllib/index.html) is a learning library that allows for distributed training and inferencing and supports an extraordinarily large number of features throughout the reinforcement learning space.
* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is like Gym, but for environments with multiple agents.


## Environment Versioning

Gymnasium keeps strict versioning for reproducibility reasons. All environments end in a suffix like "\_v0".  When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion. These inherent from Gym.


## Development Roadmap

We have a roadmap for future development work for Gymnasium available here: https://github.com/Farama-Foundation/Gymnasium/issues/12
Improve `pre-commit` workflow (#2602) * feat: add `isort` to `pre-commit` * ci: skip `__init__.py` file for `isort` * ci: make `isort` mandatory in lint pipeline * docs: add a section on Git hooks * ci: check isort diff * fix: isort from master branch * docs: add pre-commit badge * ci: update black + bandit versions * feat: add PR template * refactor: PR template * ci: remove bandit * docs: add Black badge * ci: try to remove all `\|\| true` statements * ci: remove lint_python job - Remove `lint_python` CI job - Move `pyupgrade` job to `pre-commit` workflow * fix: avoid messing with typing * docs: add a note on running `pre-cpmmit` manually * ci: apply `pre-commit` to the whole codebase 2022-03-31 12:50:38 -07:00			`[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)`

Update README.md 2022-09-12 11:54:31 -04:00			`<p align="center">`
Update README.md 2022-10-07 11:00:25 -04:00			`<img src="gymnasium-text.png" width="500px"/>`
Update README.md 2022-09-12 11:54:31 -04:00			`</p>`

Update README.md 2022-10-07 11:00:25 -04:00
Update README.md 2022-10-13 20:23:24 -04:00			`Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. This is a fork of OpenAI's [Gym](https://github.com/openai/gym) library by the maintainers (OpenAI handed over maintenance a few years ago to an outside team), and is where future maintenance will occur going forward`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Change discord invite link to be different from gym (#51) 2022-10-11 17:12:54 +01:00			`The documentation website is at [gymnasium.farama.org](https://gymnasium.farama.org), and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/bnJ6kubTg6`
Update README.md 2022-04-07 19:36:05 -04:00
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Installation`

Update README.md 2022-10-13 20:21:59 -04:00			To install the base Gymnasium library, use `pip install gymnasium`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Update README.md 2022-09-13 19:20:44 -04:00			This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install gymnasium[atari]` or use `pip install gymnasium[all]` to install all dependencies.
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Update README.md 2022-09-12 11:54:03 -04:00			`We support and test for Python 3.7, 3.8, 3.9 and 3.10 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## API`

Update README.md 2022-09-12 11:54:03 -04:00			The Gymnasium API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00
			```python
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`import gymnasium as gym`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env = gym.make("CartPole-v1")`

Upstream v26.1 changes (#19) 2022-09-16 23:16:59 +01:00			`observation, info = env.reset(seed=42)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`for _ in range(1000):`
			`action = env.action_space.sample()`
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`observation, reward, terminated, truncated, info = env.step(action)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00
Support only new step API (while retaining compatibility functions) (#3019) 2022-08-30 19:41:59 +05:30			`if terminated or truncated:`
Removing return_info argument to env.reset() and deprecated env.seed() function (reset now always returns info) (#2962) * removed return_info, made info dict mandatory in reset * tenatively removed deprecated seed api for environments * added more info type checks to wrapper tests * formatting/style compliance * addressed some comments * polish to address review * fixed tests after merge, and added a test of the return_info deprecation assertion if found in reset signature * some organization of env_checker tests, reverted a probably merge error * added deprecation check for seed function in env * updated docstring * removed debug prints, tweaked test_check_seed_deprecation * changed return_info deprecation check from assertion to warning * fixes to vector envs, now should be correctly structured * added some explanation and typehints for mockup depcreated return info reset function * re-removed seed function from vector envs * added explanation to _reset_return_info_type and changed the return statement 2022-08-23 11:09:54 -04:00			`observation, info = env.reset()`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env.close()`
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00			```

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Notable Related Libraries`

Update README.md 2022-10-03 15:46:30 -04:00			`Please note that this is an incomplete list, and just includes libraries that the maintainers most commonly point newcommers to when asked for recommendations.`

			`* [CleanRL](https://github.com/vwxyzjn/cleanrl) is a learning library based on the Gym API. It is designed to cater to newer people in the field and provides very good reference implementations.`
refresh notable related libraries 2022-03-16 19:52:57 -04:00			`* [Tianshou](https://github.com/thu-ml/tianshou) is a learning library that's geared towards very experienced users and is design to allow for ease in complex algorithm modifications.`
Update README.md 2022-10-03 15:46:30 -04:00			`* [RLlib](https://docs.ray.io/en/latest/rllib/index.html) is a learning library that allows for distributed training and inferencing and supports an extraordinarily large number of features throughout the reinforcement learning space.`
			`* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is like Gym, but for environments with multiple agents.`

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## Environment Versioning`

Update README.md 2022-09-12 11:54:03 -04:00			`Gymnasium keeps strict versioning for reproducibility reasons. All environments end in a suffix like "\_v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion. These inherent from Gym.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Add new MuJoCo bindings (#2762) 2022-05-24 08:47:51 -04:00
Update README.md 2022-09-12 11:54:03 -04:00			`## Development Roadmap`
Add new MuJoCo bindings (#2762) 2022-05-24 08:47:51 -04:00
Update README.md 2022-10-13 20:20:03 -04:00			`We have a roadmap for future development work for Gymnasium available here: https://github.com/Farama-Foundation/Gymnasium/issues/12`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00