Files

camperbot 504ed3a917 chore(i18n,curriculum): update translations (#43661 )

2021-10-03 20:24:27 +01:00

id, title, challengeType, videoId, bilibiliIds, dashedName

title

challengeType

videoId

bilibiliIds

dashedName

5e8f2f13c4cdbe86b5c72da4

使用 Q-Learning 进行强化学习：第 2 部分

DX7hJuaUZ7o

aid	bvid	cid
420570359	BV1G341127zr	409139190

reinforcement-learning-with-q-learning-part-2

--question--

--text--

如果智能体在采取随机动作和使用学习动作之间没有很好的平衡，会发生什么？

智能体将始终尝试将其对当前状态/动作的奖励最小化，从而导致局部最小值。

智能体将始终尝试将其对当前状态/动作的奖励最大化，从而导致局部最大值。