27 lines
		
	
	
		
			588 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
		
		
			
		
	
	
			27 lines
		
	
	
		
			588 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
|   | --- | ||
|  | id: 5e8f2f13c4cdbe86b5c72da4 | ||
|  | title: 'Reinforcement Learning With Q-Learning: Part 2' | ||
|  | challengeType: 11 | ||
|  | videoId: DX7hJuaUZ7o | ||
|  | dashedName: reinforcement-learning-with-q-learning-part-2 | ||
|  | --- | ||
|  | 
 | ||
|  | # --question--
 | ||
|  | 
 | ||
|  | ## --text--
 | ||
|  | 
 | ||
|  | What can happen if the agent does not have a good balance of taking random actions and using learned actions? | ||
|  | 
 | ||
|  | ## --answers--
 | ||
|  | 
 | ||
|  | The agent will always try to minimize its reward for the current state/action, leading to local minima. | ||
|  | 
 | ||
|  | --- | ||
|  | 
 | ||
|  | The agent will always try to maximize its reward for the current state/action, leading to local maxima. | ||
|  | 
 | ||
|  | ## --video-solution--
 | ||
|  | 
 | ||
|  | 2 | ||
|  | 
 |