Files
Gymnasium/docs/environments/atari/breakout.md

87 lines
4.6 KiB
Markdown
Raw Normal View History

2022-09-13 20:27:34 +01:00
---
title: Breakout
---
2022-09-13 20:27:34 +01:00
# Breakout
```{figure} ../../_static/videos/atari/breakout.gif
2022-09-13 20:27:34 +01:00
:width: 120px
:name: Breakout
```
This environment is part of the <a href='..'>Atari environments</a>. Please read that page first for general information.
## Description
2023-02-15 01:30:47 +00:00
Another famous Atari game. The dynamics are similar to pong: You move a paddle and hit the ball in a brick wall at the top of the screen. Your goal is to destroy the brick wall. You can try to break through the wall and let the ball wreak havoc on the other side, all on its own! You have five lives.
For a more detailed documentation, see [the AtariAge page](https://atariage.com/manual_html_page.php?SoftwareID=889)
2022-09-13 20:27:34 +01:00
## Actions
2023-02-15 01:30:47 +00:00
Breakout has the action space `Discrete(4)` with the table below lists the meaning of each action's meanings.
As Breakout uses a reduced set of actions for `v0`, `v4` and `v5` versions of the environment.
To enable all 18 possible actions that can be performed on an Atari 2600, specify `full_action_space=True` during
initialization or by passing `full_action_space=True` to `gymnasium.make`.
2022-09-13 20:27:34 +01:00
2023-02-15 01:30:47 +00:00
| Value | Meaning |
|---------|-----------|
| `0` | `NOOP` |
| `1` | `FIRE` |
| `2` | `RIGHT` |
| `3` | `LEFT` |
## Observations
2023-02-15 01:30:47 +00:00
Atari environment have two possible observation types, the observation space is listed below.
See variants section for the type of observation used by each environment id.
2023-02-15 01:30:47 +00:00
- `obs_type="rgb" -> observation_space=Box(0, 255, (210, 160, 3), np.uint8)`
- `obs_type="ram" -> observation_space=Box(0, 255, (128,), np.uint8)`
2022-09-13 20:27:34 +01:00
2023-02-15 01:30:47 +00:00
Additionally, `obs_type="grayscale"` cause the environment return a grayscale version of the rgb array for observations with the observation space being `Box(0, 255, (210, 160), np.uint8)`
2022-09-13 20:27:34 +01:00
### Rewards
2022-09-13 20:27:34 +01:00
You score points by destroying bricks in the wall. The reward for destroying a brick depends on the color of the brick.
For a more detailed documentation, see [the AtariAge page](https://atariage.com/manual_html_page.php?SoftwareID=889).
2023-02-15 01:30:47 +00:00
## Variants
Breakout has the following variants of the environment id which have the following differences in observation,
the number of frame-skips and the repeat action probability.
| Env-id | obs_type= | frameskip= | repeat_action_probability= |
|------------------------------|-------------|--------------|------------------------------|
| Breakout-v0 | `"rgb"` | `(2, 5)` | `0.25` |
| Breakout-ram-v0 | `"ram"` | `(2, 5)` | `0.25` |
| Breakout-ramDeterministic-v0 | `"ram"` | `4` | `0.25` |
| Breakout-ramNoFrameskip-v0 | `"ram"` | `1` | `0.25` |
| BreakoutDeterministic-v0 | `"rgb"` | `4` | `0.25` |
| BreakoutNoFrameskip-v0 | `"rgb"` | `1` | `0.25` |
| Breakout-v4 | `"rgb"` | `(2, 5)` | `0.0` |
| Breakout-ram-v4 | `"ram"` | `(2, 5)` | `0.0` |
| Breakout-ramDeterministic-v4 | `"ram"` | `4` | `0.0` |
| Breakout-ramNoFrameskip-v4 | `"ram"` | `1` | `0.0` |
| BreakoutDeterministic-v4 | `"rgb"` | `4` | `0.0` |
| BreakoutNoFrameskip-v4 | `"rgb"` | `1` | `0.0` |
| ALE/Breakout-v5 | `"rgb"` | `4` | `0.25` |
| ALE/Breakout-ram-v5 | `"ram"` | `4` | `0.25` |
## Difficulty and modes
2022-09-13 20:27:34 +01:00
It is possible to specify various flavors of the environment via the keyword arguments `difficulty` and `mode`.
2023-02-15 01:30:47 +00:00
A flavor is a combination of a game mode and a difficulty setting. The table below lists the possible difficulty and mode values
along with the default values.
2022-09-13 20:27:34 +01:00
2023-02-15 01:30:47 +00:00
| Available Modes | Default Mode | Available Difficulties | Default Difficulty |
|-------------------------------------------------|----------------|--------------------------|----------------------|
| `[0, 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44]` | `0` | `[0, 1]` | `0` |
2022-09-13 20:27:34 +01:00
## Version History
2023-02-15 01:30:47 +00:00
A thorough discussion of the intricate differences between the versions and configurations can be found in the general article on Atari environments.
2022-09-13 20:27:34 +01:00
2023-02-15 01:30:47 +00:00
* v5: Stickiness was added back and stochastic frameskipping was removed. The environments are now in the "ALE" namespace.
2022-09-13 20:27:34 +01:00
* v4: Stickiness of actions was removed
2023-02-15 01:30:47 +00:00
* v0: Initial versions release