2020-08-13 12:00:20 +02:00
|
|
|
---
|
|
|
|
id: 5e8f2f13c4cdbe86b5c72da3
|
2021-07-15 13:04:11 +05:30
|
|
|
title: 使用 Q-Learning 进行强化学习
|
2020-08-13 12:00:20 +02:00
|
|
|
challengeType: 11
|
|
|
|
videoId: Cf7DSU0gVb4
|
2021-01-13 03:31:00 +01:00
|
|
|
dashedName: reinforcement-learning-with-q-learning
|
2020-08-13 12:00:20 +02:00
|
|
|
---
|
|
|
|
|
2020-12-16 00:37:30 -07:00
|
|
|
# --question--
|
|
|
|
|
|
|
|
## --text--
|
|
|
|
|
2021-07-15 13:04:11 +05:30
|
|
|
强化学习的关键组成部分是......
|
2020-12-16 00:37:30 -07:00
|
|
|
|
|
|
|
## --answers--
|
|
|
|
|
2021-07-15 13:04:11 +05:30
|
|
|
环境、代表、状态、反应和奖励。
|
2020-12-16 00:37:30 -07:00
|
|
|
|
|
|
|
---
|
|
|
|
|
2021-07-15 13:04:11 +05:30
|
|
|
环境、代理、状态、动作和奖励。
|
2020-12-16 00:37:30 -07:00
|
|
|
|
|
|
|
---
|
|
|
|
|
2021-07-15 13:04:11 +05:30
|
|
|
环境、代理、状态、动作和惩罚。
|
2020-12-16 00:37:30 -07:00
|
|
|
|
|
|
|
## --video-solution--
|
|
|
|
|
|
|
|
2
|
|
|
|
|