mirror of
https://github.com/Farama-Foundation/Gymnasium.git
synced 2025-08-01 22:11:25 +00:00
typos
This commit is contained in:
@@ -28,10 +28,10 @@ The output should look something like this
|
||||
The commonly used methods are:
|
||||
|
||||
`reset()` resets the environment to its initial state and returns the observation corresponding to the initial state
|
||||
`step(action)` takes an action as an input and implements that action in the environement. This method returns a set of four values
|
||||
`step(action)` takes an action as an input and implements that action in the environment. This method returns a set of four values
|
||||
`render()` renders the environment
|
||||
|
||||
- `observation` (**object**) : an environment specific object repesentation your observation of the environment after the step is taken. Its often aliased as the next state after the action has been taken
|
||||
- `observation` (**object**) : an environment specific object representation your observation of the environment after the step is taken. Its often aliased as the next state after the action has been taken
|
||||
- `reward`(**float**) : immediate reward achieved by the previous action. Actual value and range will varies between environments, but the final goal is always to increase your total reward
|
||||
- `done`(**boolean**): whether it’s time to `reset` the environment again. Most (but not all) tasks are divided up into well-defined episodes, and `done` being `True` indicates the episode has terminated. (For example, perhaps the pole tipped too far, or you lost your last life.)
|
||||
- `info`(**dict**) : This provides general information helpful for debugging or additional information depending on the environment, such as the raw probabilities behind the environment’s last state change
|
||||
|
Reference in New Issue
Block a user