2020-04-21 11:19:42 -04:00
|
|
|
---
|
|
|
|
id: 5e8f2f13c4cdbe86b5c72da4
|
2020-04-24 05:52:42 -05:00
|
|
|
title: 'Reinforcement Learning With Q-Learning: Part 2'
|
2020-04-21 11:19:42 -04:00
|
|
|
challengeType: 11
|
2020-06-23 17:36:39 +05:30
|
|
|
isHidden: false
|
2020-04-21 11:19:42 -04:00
|
|
|
videoId: DX7hJuaUZ7o
|
|
|
|
---
|
|
|
|
|
|
|
|
## Description
|
|
|
|
<section id='description'>
|
|
|
|
</section>
|
|
|
|
|
|
|
|
## Tests
|
|
|
|
<section id='tests'>
|
|
|
|
|
|
|
|
```yml
|
|
|
|
question:
|
2020-05-28 22:40:36 +09:00
|
|
|
text: What can happen if the agent does not have a good balance of taking random actions and using learned actions?
|
2020-04-21 11:19:42 -04:00
|
|
|
answers:
|
2020-05-28 22:40:36 +09:00
|
|
|
- The agent will always try to minimize its reward for the current state/action, leading to local minima.
|
|
|
|
- The agent will always try to maximize its reward for the current state/action, leading to local maxima.
|
|
|
|
solution: 2
|
2020-04-21 11:19:42 -04:00
|
|
|
```
|
|
|
|
|
|
|
|
</section>
|
|
|
|
|