Fix explanation of training agent (#650)

2025-07-31 05:44:31 +00:00 · 2023-08-05 12:35:20 +02:00
parent ba348890af
commit 933d481189
1 changed files with 4 additions and 2 deletions
--- a/docs/tutorials/training_agents/blackjack_tutorial.py
+++ b/docs/tutorials/training_agents/blackjack_tutorial.py
@@ -242,8 +242,10 @@ class BlackjackAgent:
 # %%
 # To train the agent, we will let the agent play one episode (one complete
 # game is called an episode) at a time and then update it’s Q-values after
-# each episode. The agent will have to experience a lot of episodes to
-# explore the environment sufficiently.
+# each step (one single action in a game is called a step).
+#
+# The agent will have to experience a lot of episodes to explore the
+# environment sufficiently.
 #
 # Now we should be ready to build the training loop.
 #