freeCodeCamp/curriculum/challenges/english/11-machine-learning-with-python/tensorflow/reinforcement-learning-with-q-learning-example.md

---
id: 5e8f2f13c4cdbe86b5c72da5
title: 'Reinforcement Learning With Q-Learning: Example'
challengeType: 11
videoId: RBBSNta234s
bilibiliIds:
  aid: 848073871
  bvid: BV1uL4y187Eq
  cid: 409139471
dashedName: reinforcement-learning-with-q-learning-example
---

# --question--

## --text--

Fill in the blanks to complete the following Q-Learning equation:

```py
Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])
```

## --answers--

A: `state`

B: `action`

C: `next_state`

---

A: `state`

B: `action`

C: `prev_state`

---

A: `state`

B: `reaction`

C: `next_state`

## --video-solution--

1
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`---`
			`id: 5e8f2f13c4cdbe86b5c72da5`
fix: rename tensorflow lessons (#38617) 2020-04-24 05:52:42 -05:00			`title: 'Reinforcement Learning With Q-Learning: Example'`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`challengeType: 11`
			`videoId: RBBSNta234s`
feat(i18n,curriculum): add Bilibili ids for Chinese (#43564) Co-authored-by: Kristofer Koishigawa <scissorsneedfoodtoo@gmail.com> Co-authored-by: Oliver Eyton-Williams <ojeytonwilliams@gmail.com> 2021-10-01 12:24:12 +08:00			`bilibiliIds:`
			`aid: 848073871`
			`bvid: BV1uL4y187Eq`
			`cid: 409139471`
feat(curriculum): restore seed + solution to Chinese (#40683) * feat(tools): add seed/solution restore script * chore(curriculum): remove empty sections' markers * chore(curriculum): add seed + solution to Chinese * chore: remove old formatter * fix: update getChallenges parse translated challenges separately, without reference to the source * chore(curriculum): add dashedName to English * chore(curriculum): add dashedName to Chinese * refactor: remove unused challenge property 'name' * fix: relax dashedName requirement * fix: stray tag Remove stray `pre` tag from challenge file. Signed-off-by: nhcarrigan <nhcarrigan@gmail.com> Co-authored-by: nhcarrigan <nhcarrigan@gmail.com> 2021-01-13 03:31:00 +01:00			`dashedName: reinforcement-learning-with-q-learning-example`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`---`

Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`# --question--`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`## --text--`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`Fill in the blanks to complete the following Q-Learning equation:`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			```py
			`Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])`
			```
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`## --answers--`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			A: `state`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			B: `action`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			C: `next_state`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`---`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			A: `state`
feat(curriculum): add python multiple choice questions (#38890) 2020-05-28 22:40:36 +09:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			B: `action`

			C: `prev_state`

			`---`

			A: `state`

			B: `reaction`

			C: `next_state`

			`## --video-solution--`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`1`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00