Gymnasium/README.md

[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

## Important notice

### Due to issues with the domain registration, the documentation has been moved to [https://www.gymlibrary.dev/](https://www.gymlibrary.dev/) as opposed to the old .ml address.

## Gymnasium

Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. Since its release, Gym's API has become the field standard for doing this.

Gym documentation website is at [https://www.gymlibrary.dev/](https://www.gymlibrary.dev/), and you can propose fixes and changes to it [here](https://github.com/Farama-Foundation/gym-docs).

Gym also has a discord server for development purposes that you can join here: https://discord.gg/nHg2JRN489

## Installation

To install the base Gym library, use `pip install gym`.

This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install gym[atari]` or use `pip install gym[all]` to install all dependencies.

We support Python 3.7, 3.8, 3.9 and 3.10 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

## API

The Gym API's API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:

```python
import gymnasium as gym
env = gym.make("CartPole-v1")
observation, info = env.reset(seed=42)

for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)

    if terminated or truncated:
        observation, info = env.reset()
env.close()
```

## Notable Related Libraries

* [Stable Baselines 3](https://github.com/DLR-RM/stable-baselines3) is a learning library based on the Gym API. It is designed to cater to complete beginners in the field who want to start learning things quickly.
* [RL Baselines3 Zoo](https://github.com/DLR-RM/rl-baselines3-zoo) builds upon SB3, containing optimal hyperparameters for Gym environments as well as code to easily find new ones.
* [Tianshou](https://github.com/thu-ml/tianshou) is a learning library that's geared towards very experienced users and is design to allow for ease in complex algorithm modifications.
* [RLlib](https://docs.ray.io/en/latest/rllib/index.html) is a learning library that allows for distributed training and inference and supports an extraordinarily large number of features throughout the reinforcement learning space.
* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is like Gym, but for environments with multiple agents.

## Environment Versioning

Gym keeps strict versioning for reproducibility reasons. All environments end in a suffix like "\_v0".  When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion.

## MuJoCo Environments

The latest "\_v4" and future versions of the MuJoCo environments will no longer depend on `mujoco-py`. Instead `mujoco` will be the required dependency for future gymnasiumMuJoCo environment versions. Old gymnasiumMuJoCo environment versions that depend on `mujoco-py` will still be kept but unmaintained.
To install the dependencies for the latest gymnasium MuJoCo environments use `pip install gym[mujoco]`. Dependencies for old MuJoCo environments can still be installed by `pip install gym[mujoco_py]`. 

## Citation

A whitepaper from when Gym just came out is available https://arxiv.org/pdf/1606.01540, and can be cited with the following bibtex entry:

```
@misc{1606.01540,
  Author = {Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba},
  Title = {OpenAI Gym},
  Year = {2016},
  Eprint = {arXiv:1606.01540},
}
```

## Release Notes

There used to be release notes for all the new Gym versions here. New release notes are being moved to [releases page](https://github.com/Farama-Foundation/Gymnasium/releases) on GitHub, like most other libraries do.
Improve `pre-commit` workflow (#2602) * feat: add `isort` to `pre-commit` * ci: skip `__init__.py` file for `isort` * ci: make `isort` mandatory in lint pipeline * docs: add a section on Git hooks * ci: check isort diff * fix: isort from master branch * docs: add pre-commit badge * ci: update black + bandit versions * feat: add PR template * refactor: PR template * ci: remove bandit * docs: add Black badge * ci: try to remove all `\|\| true` statements * ci: remove lint_python job - Remove `lint_python` CI job - Move `pyupgrade` job to `pre-commit` workflow * fix: avoid messing with typing * docs: add a note on running `pre-cpmmit` manually * ci: apply `pre-commit` to the whole codebase 2022-03-31 12:50:38 -07:00			`[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://pre-commit.com/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)`

Update README.md (#3039) 2022-08-23 17:10:15 +02:00			`## Important notice`

			`### Due to issues with the domain registration, the documentation has been moved to [https://www.gymlibrary.dev/](https://www.gymlibrary.dev/) as opposed to the old .ml address.`

Control f chaneg of gym to gymnasium 2022-09-08 10:58:14 +01:00			`## Gymnasium`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Control f chaneg of gym to gymnasium 2022-09-08 10:58:14 +01:00			`Gymnasium is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. Since its release, Gym's API has become the field standard for doing this.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Update README.md 2022-08-22 11:05:00 -04:00			`Gym documentation website is at [https://www.gymlibrary.dev/](https://www.gymlibrary.dev/), and you can propose fixes and changes to it [here](https://github.com/Farama-Foundation/gym-docs).`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
Update README.md 2022-04-07 19:36:05 -04:00			`Gym also has a discord server for development purposes that you can join here: https://discord.gg/nHg2JRN489`

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Installation`

			To install the base Gym library, use `pip install gym`.

typo 2021-09-14 22:13:46 -04:00			This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like `pip install gym[atari]` or use `pip install gym[all]` to install all dependencies.
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
3.10 support (#2493) * test again * typo 2021-11-20 11:41:27 -05:00			`We support Python 3.7, 3.8, 3.9 and 3.10 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## API`

typo 2021-07-29 14:07:03 -04:00			The Gym API's API models environments as simple Python `env` classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00
			```python
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`import gymnasium as gym`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env = gym.make("CartPole-v1")`
Removing return_info argument to env.reset() and deprecated env.seed() function (reset now always returns info) (#2962) * removed return_info, made info dict mandatory in reset * tenatively removed deprecated seed api for environments * added more info type checks to wrapper tests * formatting/style compliance * addressed some comments * polish to address review * fixed tests after merge, and added a test of the return_info deprecation assertion if found in reset signature * some organization of env_checker tests, reverted a probably merge error * added deprecation check for seed function in env * updated docstring * removed debug prints, tweaked test_check_seed_deprecation * changed return_info deprecation check from assertion to warning * fixes to vector envs, now should be correctly structured * added some explanation and typehints for mockup depcreated return info reset function * re-removed seed function from vector envs * added explanation to _reset_return_info_type and changed the return statement 2022-08-23 11:09:54 -04:00			`observation, info = env.reset(seed=42)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00
			`for _ in range(1000):`
			`action = env.action_space.sample()`
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`observation, reward, terminated, truncated, info = env.step(action)`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00
Support only new step API (while retaining compatibility functions) (#3019) 2022-08-30 19:41:59 +05:30			`if terminated or truncated:`
Removing return_info argument to env.reset() and deprecated env.seed() function (reset now always returns info) (#2962) * removed return_info, made info dict mandatory in reset * tenatively removed deprecated seed api for environments * added more info type checks to wrapper tests * formatting/style compliance * addressed some comments * polish to address review * fixed tests after merge, and added a test of the return_info deprecation assertion if found in reset signature * some organization of env_checker tests, reverted a probably merge error * added deprecation check for seed function in env * updated docstring * removed debug prints, tweaked test_check_seed_deprecation * changed return_info deprecation check from assertion to warning * fixes to vector envs, now should be correctly structured * added some explanation and typehints for mockup depcreated return info reset function * re-removed seed function from vector envs * added explanation to _reset_return_info_type and changed the return statement 2022-08-23 11:09:54 -04:00			`observation, info = env.reset()`
Change snippet in README to show off new API (#2643) * Changed snippet in README.md to show off new API * Added new line in snippet 2022-02-26 19:18:06 +01:00			`env.close()`
Update README.md (#2269) Add basic API section. Fix a couple typos toward beginning. 2021-07-29 12:14:44 -04:00			```

New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Notable Related Libraries`

minor update 2022-03-16 19:55:42 -04:00			`* [Stable Baselines 3](https://github.com/DLR-RM/stable-baselines3) is a learning library based on the Gym API. It is designed to cater to complete beginners in the field who want to start learning things quickly.`
refresh notable related libraries 2022-03-16 19:52:57 -04:00			`* [RL Baselines3 Zoo](https://github.com/DLR-RM/rl-baselines3-zoo) builds upon SB3, containing optimal hyperparameters for Gym environments as well as code to easily find new ones.`
			`* [Tianshou](https://github.com/thu-ml/tianshou) is a learning library that's geared towards very experienced users and is design to allow for ease in complex algorithm modifications.`
Rename to gymnasium 2022-09-08 10:10:07 +01:00			`* [RLlib](https://docs.ray.io/en/latest/rllib/index.html) is a learning library that allows for distributed training and inference and supports an extraordinarily large number of features throughout the reinforcement learning space.`
typo 2021-10-13 23:12:46 -04:00			`* [PettingZoo](https://github.com/Farama-Foundation/PettingZoo) is like Gym, but for environments with multiple agents.`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			`## Environment Versioning`

			`Gym keeps strict versioning for reproducibility reasons. All environments end in a suffix like "\_v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion.`

Add new MuJoCo bindings (#2762) 2022-05-24 08:47:51 -04:00			`## MuJoCo Environments`

Control f chaneg of gym to gymnasium 2022-09-08 10:58:14 +01:00			The latest "\_v4" and future versions of the MuJoCo environments will no longer depend on `mujoco-py`. Instead `mujoco` will be the required dependency for future gymnasiumMuJoCo environment versions. Old gymnasiumMuJoCo environment versions that depend on `mujoco-py` will still be kept but unmaintained.
			To install the dependencies for the latest gymnasium MuJoCo environments use `pip install gym[mujoco]`. Dependencies for old MuJoCo environments can still be installed by `pip install gym[mujoco_py]`.
Add new MuJoCo bindings (#2762) 2022-05-24 08:47:51 -04:00
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00			`## Citation`

3.10 support (#2493) * test again * typo 2021-11-20 11:41:27 -05:00			`A whitepaper from when Gym just came out is available https://arxiv.org/pdf/1606.01540, and can be cited with the following bibtex entry:`
New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
			```
			`@misc{1606.01540,`
			`Author = {Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba},`
			`Title = {OpenAI Gym},`
			`Year = {2016},`
			`Eprint = {arXiv:1606.01540},`
			`}`
			```

			`## Release Notes`

Rename to gymnasium 2022-09-08 10:10:07 +01:00			`There used to be release notes for all the new Gym versions here. New release notes are being moved to [releases page](https://github.com/Farama-Foundation/Gymnasium/releases) on GitHub, like most other libraries do.`