freeCodeCamp/curriculum/challenges/chinese/11-machine-learning-with-python/tensorflow/reinforcement-learning-with-q-learning-part-2.md

---
id: 5e8f2f13c4cdbe86b5c72da4
title: '使用 Q-Learning 进行强化学习：第 2 部分'
challengeType: 11
videoId: DX7hJuaUZ7o
bilibiliIds:
  aid: 420570359
  bvid: BV1G341127zr
  cid: 409139190
dashedName: reinforcement-learning-with-q-learning-part-2
---

# --question--

## --text--

如果智能体在采取随机动作和使用学习动作之间没有很好的平衡，会发生什么？

## --answers--

智能体将始终尝试将其对当前状态/动作的奖励最小化，从而导致局部最小值。

---

智能体将始终尝试将其对当前状态/动作的奖励最大化，从而导致局部最大值。

## --video-solution--

2
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00			`---`
			`id: 5e8f2f13c4cdbe86b5c72da4`
chore(i18n,chn): manually downloaded curriculum (#42858) 2021-07-15 13:04:11 +05:30			`title: '使用 Q-Learning 进行强化学习：第 2 部分'`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00			`challengeType: 11`
			`videoId: DX7hJuaUZ7o`
chore(i18n,curriculum): update translations (#43661) 2021-10-03 12:24:27 -07:00			`bilibiliIds:`
			`aid: 420570359`
			`bvid: BV1G341127zr`
			`cid: 409139190`
feat(curriculum): restore seed + solution to Chinese (#40683) * feat(tools): add seed/solution restore script * chore(curriculum): remove empty sections' markers * chore(curriculum): add seed + solution to Chinese * chore: remove old formatter * fix: update getChallenges parse translated challenges separately, without reference to the source * chore(curriculum): add dashedName to English * chore(curriculum): add dashedName to Chinese * refactor: remove unused challenge property 'name' * fix: relax dashedName requirement * fix: stray tag Remove stray `pre` tag from challenge file. Signed-off-by: nhcarrigan <nhcarrigan@gmail.com> Co-authored-by: nhcarrigan <nhcarrigan@gmail.com> 2021-01-13 03:31:00 +01:00			`dashedName: reinforcement-learning-with-q-learning-part-2`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00			`---`

chore(learn): Applied MDX format to Chinese curriculum files (#40462) 2020-12-16 00:37:30 -07:00			`# --question--`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00
chore(learn): Applied MDX format to Chinese curriculum files (#40462) 2020-12-16 00:37:30 -07:00			`## --text--`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00
chore(i18n,chn): manually downloaded curriculum (#42858) 2021-07-15 13:04:11 +05:30			`如果智能体在采取随机动作和使用学习动作之间没有很好的平衡，会发生什么？`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00
chore(learn): Applied MDX format to Chinese curriculum files (#40462) 2020-12-16 00:37:30 -07:00			`## --answers--`
fix: QA/Infosec update and python to chinese 2020-08-13 12:00:20 +02:00
chore(i18n,chn): manually downloaded curriculum (#42858) 2021-07-15 13:04:11 +05:30			`智能体将始终尝试将其对当前状态/动作的奖励最小化，从而导致局部最小值。`
chore(learn): Applied MDX format to Chinese curriculum files (#40462) 2020-12-16 00:37:30 -07:00
			`---`

chore(i18n,chn): manually downloaded curriculum (#42858) 2021-07-15 13:04:11 +05:30			`智能体将始终尝试将其对当前状态/动作的奖励最大化，从而导致局部最大值。`
chore(learn): Applied MDX format to Chinese curriculum files (#40462) 2020-12-16 00:37:30 -07:00
			`## --video-solution--`

			`2`