freeCodeCamp/curriculum/challenges/english/11-machine-learning-with-python/tensorflow/reinforcement-learning-with-q-learning-part-2.md

---
id: 5e8f2f13c4cdbe86b5c72da4
title: 'Reinforcement Learning With Q-Learning: Part 2'
challengeType: 11
videoId: DX7hJuaUZ7o
dashedName: reinforcement-learning-with-q-learning-part-2
---

# --question--

## --text--

What can happen if the agent does not have a good balance of taking random actions and using learned actions?

## --answers--

The agent will always try to minimize its reward for the current state/action, leading to local minima.

---

The agent will always try to maximize its reward for the current state/action, leading to local maxima.

## --video-solution--

2
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`---`
			`id: 5e8f2f13c4cdbe86b5c72da4`
fix: rename tensorflow lessons (#38617) 2020-04-24 05:52:42 -05:00			`title: 'Reinforcement Learning With Q-Learning: Part 2'`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`challengeType: 11`
			`videoId: DX7hJuaUZ7o`
feat(curriculum): restore seed + solution to Chinese (#40683) * feat(tools): add seed/solution restore script * chore(curriculum): remove empty sections' markers * chore(curriculum): add seed + solution to Chinese * chore: remove old formatter * fix: update getChallenges parse translated challenges separately, without reference to the source * chore(curriculum): add dashedName to English * chore(curriculum): add dashedName to Chinese * refactor: remove unused challenge property 'name' * fix: relax dashedName requirement * fix: stray tag Remove stray `pre` tag from challenge file. Signed-off-by: nhcarrigan <nhcarrigan@gmail.com> Co-authored-by: nhcarrigan <nhcarrigan@gmail.com> 2021-01-13 03:31:00 +01:00			`dashedName: reinforcement-learning-with-q-learning-part-2`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00			`---`

Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`# --question--`
fix(curriculum): convert all video challenges to markdown (#39189) 2020-08-04 20:56:41 +01:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`## --text--`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`What can happen if the agent does not have a good balance of taking random actions and using learned actions?`
fix(curriculum): convert all video challenges to markdown (#39189) 2020-08-04 20:56:41 +01:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`## --answers--`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00
Feat: add new Markdown parser (#39800) and change all the challenges to new `md` format. 2020-11-27 19:02:05 +01:00			`The agent will always try to minimize its reward for the current state/action, leading to local minima.`

			`---`

			`The agent will always try to maximize its reward for the current state/action, leading to local maxima.`

			`## --video-solution--`

			`2`
add tensorflow course without questions (#38525) 2020-04-21 11:19:42 -04:00