Files
Gymnasium/docs/third_party_environments.md

325 lines
15 KiB
Markdown
Raw Normal View History

2021-09-23 17:02:09 -04:00
# Third Party Environments
2021-09-23 17:02:09 -04:00
# Video Games environments
## Procgen
https://github.com/openai/procgen
2019-12-13 22:32:01 -08:00
16 simple-to-use procedurally-generated gym environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills. The environments run at high speed (thousands of steps per second) on a single core.
2021-09-23 17:02:09 -04:00
## SlimeVolleyGym: A simple environment for single and multi-agent reinforcement learning
2019-12-13 22:32:01 -08:00
2021-09-23 17:02:09 -04:00
https://github.com/hardmaru/slimevolleygym
2021-09-23 17:02:09 -04:00
A simple environment for benchmarking single and multi-agent reinforcement learning algorithms on a clone of Slime Volleyball game. Only dependencies are gym and numpy. Both state and pixel observation environments are available. The motivation of this environment is to easily enable trained agents to play against each other, and also facilitate the training of agents directly in a multi-agent setting, thus adding an extra dimension for evaluating an agent's performance.
2021-09-23 17:02:09 -04:00
## stable-retro
2021-09-23 17:02:09 -04:00
https://github.com/MatPoliquin/stable-retro
2021-09-23 17:02:09 -04:00
Supported fork of gym-retro with additional games, states, scenarios, etc. Open to PRs of additional games, features and plateforms since gym-retro is no longer maintained
2021-09-23 17:02:09 -04:00
## gym-derk: GPU accelerated MOBA environment
2021-09-23 17:02:09 -04:00
https://gym.derkgame.com
2021-09-23 17:02:09 -04:00
This is a 3v3 MOBA environment where you train creatures to fight each other. It runs entirely on the GPU so you can easily have hundreds of instances running in parallel. There are around 15 items for the creatures, 60 "senses", 5 actions, and roughly 23 tweakable rewards. It's also possible to benchmark an agent against other agents online. It's available for free for training for personal use, and otherwise costs money; see licensing details on the website
2021-09-23 17:02:09 -04:00
## Unity ML Agents
2021-09-23 17:02:09 -04:00
https://github.com/Unity-Technologies/ml-agents
Gym wrappers for arbitrary and premade environments with the Unity game engine.
2021-09-23 17:02:09 -04:00
# Robotics Environments
2021-09-23 17:02:09 -04:00
## PyBullet Robotics Environments
2021-09-23 17:02:09 -04:00
Docs: https://docs.google.com/document/d/10sXEhzFRSnvFcl3XxNGhnD4N2SedqwdAvK3dsihxVUA/edit#heading=h.wz5to0x8kqmr
2019-04-26 15:11:40 -07:00
3D physics environments like the Mujoco environments but uses the Bullet physics engine and does not require a commercial license. Works on Mac/Linux/Windows.
2021-09-23 17:02:09 -04:00
## gym-gazebo
2019-04-26 15:11:40 -07:00
2021-09-23 17:02:09 -04:00
https://github.com/erlerobot/gym-gazebo/
2021-09-23 17:02:09 -04:00
gym-gazebo presents an extension of the initial OpenAI gym for robotics using ROS and Gazebo, an advanced 3D modeling and
rendering tool.
2021-09-23 17:02:09 -04:00
## GymFC: A flight control tuning and training framework
2021-09-23 17:02:09 -04:00
https://github.com/wil3/gymfc/
2021-09-23 17:02:09 -04:00
GymFC is a modular framework for synthesizing neuro-flight controllers. The
architecture integrates digital twinning concepts to provide seamless transfer
of trained policies to hardware. The OpenAI environment has been used to
generate policies for the worlds first open source neural network flight
control firmware [Neuroflight](https://github.com/wil3/neuroflight).
2016-05-03 16:42:52 -04:00
2021-09-23 17:02:09 -04:00
## gym-jiminy: training Robots in Jiminy
2021-09-23 17:02:09 -04:00
https://github.com/Wandercraft/jiminy
2021-09-23 17:02:09 -04:00
gym-jiminy presents an extension of the initial OpenAI gym for robotics using Jiminy, an extremely fast and light weight simulator for poly-articulated systems using Pinocchio for physics evaluation and Meshcat for web-based 3D rendering.
2021-09-23 17:02:09 -04:00
## gym-goddard: Goddard's Rocket Problem
2021-09-23 17:02:09 -04:00
https://github.com/osannolik/gym-goddard
2016-09-23 10:59:41 +02:00
2021-09-23 17:02:09 -04:00
An environment for simulating the classical optimal control problem where the thrust of a vertically ascending rocket shall be determined such that it reaches the maximum possible altitude, while being subject to varying aerodynamic drag, gravity and mass.
2016-09-23 10:59:41 +02:00
2021-09-23 17:02:09 -04:00
## gym-pybullet-drones
2016-09-23 10:59:41 +02:00
2021-09-23 17:02:09 -04:00
https://github.com/JacopoPan/gym-pybullet-drones
2021-09-23 17:02:09 -04:00
A simple environment using [PyBullet](https://github.com/bulletphysics/bullet3) to simulate the dynamics of a [Bitcraze Crazyflie 2.x](https://www.bitcraze.io/documentation/hardware/crazyflie_2_1/crazyflie_2_1-datasheet.pdf) nanoquadrotor
2021-09-23 17:02:09 -04:00
## Offworld-gym
2017-12-25 15:08:38 -05:00
2021-09-23 17:02:09 -04:00
https://github.com/offworld-projects/offworld-gym
2021-09-23 17:02:09 -04:00
Gym environments that let you control physics robotics in a laboratory via the internet.
2021-09-23 17:02:09 -04:00
## MarsExplorer: Deep Reinforcement Learning for Extraterrestrial Exploration
2021-09-23 17:02:09 -04:00
https://github.com/dimikout3/MarsExplorer
2017-12-25 15:08:38 -05:00
2021-09-23 17:02:09 -04:00
Mars Explorer is an openai-gym compatible environment designed and developed as an initial endeavor to bridge the gap between powerful Deep Reinforcement Learning methodologies and the problem of exploration/coverage of an unknown terrain.
2017-12-25 15:08:38 -05:00
2021-09-23 17:02:09 -04:00
## robo-gym: Environments for Real and Simulated Robots
2021-09-23 17:02:09 -04:00
https://github.com/jr-robotics/robo-gym
2021-09-23 17:02:09 -04:00
robo-gym provides a collection of reinforcement learning environments involving robotic tasks applicable in both simulation and real world robotics.
2021-09-23 17:02:09 -04:00
# Classic Environments (board, card, etc. games)
2021-09-23 17:02:09 -04:00
## gym-spoof
2021-09-23 17:02:09 -04:00
https://github.com/MouseAndKeyboard/gym-spoof
2021-09-23 17:02:09 -04:00
Spoof, otherwise known as "The 3-coin game", is a multi-agent (2 player), imperfect-information, zero-sum game.
2018-05-09 11:21:49 -04:00
2021-09-23 17:02:09 -04:00
## gym-xiangqi: Xiangqi - The Chinese Chess Game
2018-05-09 11:21:49 -04:00
2021-09-23 17:02:09 -04:00
https://github.com/tanliyon/gym-xiangqi
2018-05-09 11:21:49 -04:00
2021-09-23 17:02:09 -04:00
A reinforcement learning environment of Xiangqi, the Chinese Chess game.
2019-07-26 12:44:33 -04:00
2021-09-23 17:02:09 -04:00
## gym-abalone: A two-player abstract strategy board game
2019-07-26 12:44:33 -04:00
2021-09-23 17:02:09 -04:00
https://github.com/towzeur/gym-abalone
2019-07-26 12:44:33 -04:00
2021-09-23 17:02:09 -04:00
An implementation of the board game Abalone.
2019-09-23 01:07:51 +03:30
2021-09-23 17:02:09 -04:00
## RubiksCubeGym
2019-09-23 01:07:51 +03:30
2021-09-23 17:02:09 -04:00
https://github.com/DoubleGremlin181/RubiksCubeGym
2021-09-23 17:02:09 -04:00
The RubiksCubeGym package provides environments for twisty puzzles with multiple reward functions to help simluate the methods used by humans.
2021-09-08 01:56:29 +04:30
2021-09-23 17:02:09 -04:00
# Autonomous Driving and Traffic Control Environments
2021-09-08 01:56:29 +04:30
2021-09-23 17:02:09 -04:00
## gym-duckietown
2021-09-08 01:56:29 +04:30
2021-09-23 17:02:09 -04:00
https://github.com/duckietown/gym-duckietown
2021-09-23 17:02:09 -04:00
A lane-following simulator built for the [Duckietown](http://duckietown.org/) project (small-scale self-driving car course).
2021-09-23 17:02:09 -04:00
## gym-electric-motor: Intelligent control of electric drives
2021-09-23 17:02:09 -04:00
https://github.com/upb-lea/gym-electric-motor
An environment for simulating a wide variety of electric drives taking into account different types of electric motors and converters. Control schemes can be continuous, yielding a voltage duty cycle, or discrete, determining converter switching states directly.
2021-09-23 17:02:09 -04:00
## highway-env: Tactical Decision-Making for Autonomous Driving
2021-09-23 17:02:09 -04:00
https://github.com/eleurent/highway-env
2021-09-23 17:02:09 -04:00
An environment for behavioural planning in autonomous driving, with an emphasis on high-level perception and decision rather than low-level sensing and control. The difficulty of the task lies in understanding the social interactions with other drivers, whose behaviours are uncertain. Several scenes are proposed, such as highway, merge, intersection and roundabout.
2021-09-23 17:02:09 -04:00
## gym-carla: Gym Wrapper for CARLA Driving Simulator
2021-09-23 17:02:09 -04:00
https://github.com/cjy1992/gym-carla
2021-09-23 17:02:09 -04:00
gym-carla provides a gym wrapper for the [CARLA simulator](http://carla.org/), which is a realistic 3D simulator for autonomous driving research. The environment includes a virtual city with several surrounding vehicles running around. Multiple source of observations are provided for the ego vehicle, such as front-view camera image, lidar point cloud image, and birdeye view semantic mask. Several applications have been developed based on this wrapper, such as deep reinforcement learning for end-to-end autonomous driving.
2021-09-23 17:02:09 -04:00
## sumo-rl
2021-09-23 17:02:09 -04:00
https://github.com/LucasAlegre/sumo-rl
2021-09-23 17:02:09 -04:00
Gym wrapper for various environments in the Sumo traffic simulator
2021-09-23 17:02:09 -04:00
# Other Environments
2021-09-23 17:02:09 -04:00
## gym-algorithmic
2021-09-23 17:02:09 -04:00
https://github.com/Rohan138/gym-algorithmic
2021-09-23 17:02:09 -04:00
These are a variety of algorithmic tasks, such as learning to copy a sequence, present in Gym prior to Gym 0.20.0.
2020-04-24 15:01:13 -07:00
2021-09-23 17:02:09 -04:00
## gym-legacy-toytext
2020-04-24 15:01:13 -07:00
2021-09-23 17:02:09 -04:00
https://github.com/Rohan138/gym-legacy-toytext
2020-04-24 15:01:13 -07:00
2021-09-23 17:02:09 -04:00
These are the unused toy-text environments present in Gym prior to Gym 0.20.0.
2020-05-30 02:38:01 +05:30
2021-09-23 17:02:09 -04:00
## Obstacle Tower
2020-05-30 02:38:01 +05:30
2021-09-23 17:02:09 -04:00
https://github.com/Unity-Technologies/obstacle-tower-env
2020-05-30 02:38:01 +05:30
2021-09-23 17:02:09 -04:00
3D procedurally generated tower where you have to climb to the highest level possible
2021-09-23 17:02:09 -04:00
## PGE: Parallel Game Engine
2021-09-23 17:02:09 -04:00
https://github.com/222464/PGE
2021-09-23 17:02:09 -04:00
PGE is a FOSS 3D engine for AI simulations, and can interoperate with the Gym. Contains environments with modern 3D graphics, and uses Bullet for physics.
2020-06-24 05:30:45 +05:30
2021-09-23 17:02:09 -04:00
## gym-inventory: Inventory Control Environments
2020-06-24 05:30:45 +05:30
2021-09-23 17:02:09 -04:00
https://github.com/paulhendricks/gym-inventory
2020-06-24 05:30:45 +05:30
2021-09-23 17:02:09 -04:00
gym-inventory is a single agent domain featuring discrete state and action spaces that an AI agent might encounter in inventory control problems.
2021-09-23 17:02:09 -04:00
## gym-maze
2021-09-23 17:02:09 -04:00
https://github.com/tuzzer/gym-maze/
2021-09-23 17:02:09 -04:00
A simple 2D maze environment where an agent finds its way from the start position to the goal.
2021-09-23 17:02:09 -04:00
## osim-rl: Musculoskeletal Models in OpenSim
2021-09-23 17:02:09 -04:00
https://github.com/stanfordnmbl/osim-rl
2021-09-23 17:02:09 -04:00
A human musculoskeletal model and a physics-based simulation environment where you can synthesize physically and physiologically accurate motion. One of the environments built in this framework is a competition environment for a NIPS 2017 challenge.
2021-09-23 17:02:09 -04:00
## gym-miniworld: Minimalistic 3D Interior Environment Simulator
2021-09-23 17:02:09 -04:00
https://github.com/maximecb/gym-miniworld
2021-09-23 17:02:09 -04:00
MiniWorld is a minimalistic 3D interior environment simulator for reinforcement learning & robotics research. It can be used to simulate environments with rooms, doors, hallways and various objects (eg: office and home environments, mazes). MiniWorld can be seen as an alternative to VizDoom or DMLab. It is written 100% in Python and designed to be easily modified or extended.
2021-09-23 17:02:09 -04:00
## gym-sokoban: 2D Transportation Puzzles
2021-09-23 17:02:09 -04:00
https://github.com/mpSchrader/gym-sokoban
2021-09-23 17:02:09 -04:00
The environment consists of transportation puzzles in which the player's goal is to push all boxes on the warehouse's storage locations.
The advantage of the environment is that it generates a new random level every time it is initialized or reset, which prevents over fitting to predefined levels.
2020-09-26 00:49:37 +02:00
2021-09-23 17:02:09 -04:00
## gym-anytrading: Environments for trading markets
2020-09-26 00:49:37 +02:00
2021-09-23 17:02:09 -04:00
https://github.com/AminHP/gym-anytrading
2020-09-26 00:49:37 +02:00
2021-09-23 17:02:09 -04:00
AnyTrading is a collection of OpenAI Gym environments for reinforcement learning-based trading algorithms with a great focus on simplicity, flexibility, and comprehensiveness.
2021-09-23 17:02:09 -04:00
## gym-mtsim: Real-world simulator/environment for trading markets
2021-09-23 17:02:09 -04:00
https://github.com/AminHP/gym-mtsim
2021-09-23 17:02:09 -04:00
MtSim is a general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform.
2021-09-23 17:02:09 -04:00
## NASGym: gym environment for Neural Architecture Search (NAS)
2021-09-23 17:02:09 -04:00
https://github.com/gomerudo/nas-env
2021-09-23 17:02:09 -04:00
The environment is fully-compatible with the OpenAI baselines and exposes a NAS environment following the Neural Structure Code of [BlockQNN: Efficient Block-wise Neural Network Architecture Generation](https://arxiv.org/abs/1808.05584). Under this setting, a Neural Network (i.e. the state for the reinforcement learning agent) is modeled as a list of NSCs, an action is the addition of a layer to the network, and the reward is the accuracy after the early-stop training. The datasets considered so far are the CIFAR-10 dataset (available by default) and the meta-dataset (has to be manually downloaded as specified in [this repository](https://github.com/gomerudo/meta-dataset)).
2021-09-23 17:02:09 -04:00
## openmodelica-microgrid-gym: Intelligent control of microgrids
2021-09-23 17:02:09 -04:00
https://github.com/upb-lea/openmodelica-microgrid-gym
2021-09-23 17:02:09 -04:00
The OpenModelica Microgrid Gym (OMG) package is a software toolbox for the simulation and control optimization of microgrids based on energy conversion by power electronic converters.
2021-09-23 17:02:09 -04:00
## Gridworld: A simple 2D grid environment
2021-09-23 17:02:09 -04:00
https://github.com/addy1997/Gridworld
2021-09-23 17:02:09 -04:00
The Gridworld package provides grid-based environments to help simulate the results for model-based reinforcement learning algorithms. Initial release supports single agent system only. Some features in this version of software have become obsolete. New features are being added in the software like windygrid environment.
2021-09-23 17:02:09 -04:00
## gym-adserver: Environment for online advertising
2021-09-23 17:02:09 -04:00
https://github.com/falox/gym-adserver
2021-09-23 17:02:09 -04:00
An environment that implements a typical [multi-armed bandit scenario](https://en.wikipedia.org/wiki/Multi-armed_bandit) where an [ad server](https://en.wikipedia.org/wiki/Ad_serving) must select the best advertisement to be displayed in a web page. Some example agents are included: Random, epsilon-Greedy, Softmax, and UCB1.
2021-09-23 17:02:09 -04:00
## gym-autokey: Automated rule-based deductive program verification
2021-09-23 17:02:09 -04:00
https://github.com/Flunzmas/gym-autokey
2021-09-23 17:02:09 -04:00
An environment for automated rule-based deductive program verification in the KeY verification system.
2021-09-23 17:02:09 -04:00
## gym-riverswim: A hard-exploration environment
2021-09-23 17:02:09 -04:00
https://github.com/erfanMhi/gym-riverswim
2021-08-11 12:22:58 +09:00
2021-09-23 17:02:09 -04:00
A simple environment for benchmarking reinforcement learning exploration techniques in a simplified setting.
2021-08-11 12:22:58 +09:00
2021-09-23 17:02:09 -04:00
## gym-ccc: Continuous classic control environments
2021-08-11 12:22:58 +09:00
2021-09-23 17:02:09 -04:00
https://github.com/acxz/gym-ccc
2021-09-23 17:02:09 -04:00
Environments that extend gym's classic control and add many new features.
2021-09-23 17:02:09 -04:00
## NLPGym: A toolkit to develop RL agents to solve NLP tasks
2021-09-23 17:02:09 -04:00
https://github.com/rajcscw/nlp-gym
2021-09-23 17:02:09 -04:00
[NLPGym](https://arxiv.org/pdf/2011.08272v1.pdf) provides interactive environments for standard NLP tasks such as sequence tagging, question answering, and sequence classification. Users can easily customize the tasks with their own datasets, observations, featurizers and reward functions.
2021-09-23 17:02:09 -04:00
## math-prog-synth-env
2021-09-23 17:02:09 -04:00
https://github.com/JohnnyYeeee/math_prog_synth_env
2021-09-23 17:02:09 -04:00
In our paper "A Reinforcement Learning Environment for Mathematical Reasoning via Program Synthesis" we convert the DeepMind Mathematics Dataset into an RL environment based around program synthesis.https://arxiv.org/abs/2107.07373
2021-09-23 17:02:09 -04:00
## VirtualTaobao: Environment of online recommendation
2021-09-23 17:02:09 -04:00
https://github.com/eyounx/VirtualTaobao/
2021-09-23 17:02:09 -04:00
An environment for online recommendation, where customers are learned from Taobao.com, one of the world's largest e-commerce platform.
2021-09-23 17:02:09 -04:00
## gym-recsys: Customizable RecSys Simulator
2021-09-23 17:02:09 -04:00
https://github.com/zuoxingdong/gym-recsys
2021-09-23 17:02:09 -04:00
This package describes an OpenAI Gym interface for creating a simulation environment of reinforcement learning-based recommender systems (RL-RecSys). The design strives for simple and flexible APIs to support novel research.
2021-09-23 17:02:09 -04:00
## QASGym: gym environment for Quantum Architecture Search (QAS)
https://github.com/qdevpsi3/quantum-arch-search
This a list of environments for quantum architecture search following the description in [Quantum Architecture Search via Deep Reinforcement Learning](https://arxiv.org/abs/2104.07715). The agent design the quantum circuit by taking actions in the environment. Each action corresponds to a gate applied on some wires. The goal is to build a circuit U such that generates the target n-qubit quantum state that belongs to the environment and hidden from the agent. The circuits are built using [Google QuantumAI Cirq](https://quantumai.google/cirq).
## anomalous_rl_envs: Gym environments with anomaly injection
2021-09-23 17:02:09 -04:00
https://github.com/modanesh/anomalous_rl_envs
A set of environments from control tasks: Acrobot, CartPole, and LunarLander with various types of anomalies injected into them. It could be very useful to study the behavior and robustness of a policy.
## CompilerGym
https://github.com/facebookresearch/CompilerGym
Reinforcement learning environments for compiler optimization tasks, such as LLVM phase ordering, GCC flag tuning, and CUDA loop nest code generation.
2021-09-23 17:02:09 -04:00
## LongiControl
2021-08-20 22:21:01 -04:00
2021-09-23 17:02:09 -04:00
https://github.com/dynamik1703/gym_longicontrol
An environment for the stochastic longitudinal control of an electric vehicle.
It is intended to be a descriptive and comprehensible example for a continuous real-world problem within the field of autonomous driving.
2021-09-23 17:02:09 -04:00
## safe-control-gym
2021-09-23 17:02:09 -04:00
https://github.com/utiasDSL/safe-control-gym
2021-08-20 22:21:01 -04:00
2021-08-21 16:09:36 -04:00
PyBullet-based CartPole and Quadrotor environments—with [CasADi](https://web.casadi.org) (symbolic) *a priori* dynamics and constraints—for learning-based control and model-based reinforcement learning.
2021-08-20 22:21:01 -04:00
2021-09-23 17:02:09 -04:00
## gym-games
2021-09-23 17:02:09 -04:00
https://github.com/qlan3/gym-games
2021-09-23 17:02:09 -04:00
Gym implementations of the MinAtar games, various PyGame Learning Environment games, and various custom exploration games