Update train_agent.md (#1257)

This commit is contained in:
worthy7
2024-11-19 18:48:11 +09:00
committed by GitHub
parent d2dc19f771
commit 79a2306dba

View File

@@ -105,7 +105,7 @@ class BlackjackAgent:
## Training the agent
To train the agent, we will let the agent play one episode (one complete game is called an episode) at a time and then update it's Q-values after each episode. The agent will have to experience a lot of episodes to explore the environment sufficiently.
To train the agent, we will let the agent play one episode (one complete game is called an episode) at a time and update it's Q-values after each action taken during the episode. The agent will have to experience a lot of episodes to explore the environment sufficiently.
```python
# hyperparameters