From 3dad9329732eafdedf0f67c360d48b299eb90427 Mon Sep 17 00:00:00 2001 From: Mudit Choraria <32506556+procode-dd@users.noreply.github.com> Date: Sat, 17 Nov 2018 23:49:52 +0530 Subject: [PATCH] Added description for Reinforcement Learning (#21889) Described the similarity of Reinforcement Learning with Supervised and Unsupervised Learning. --- guide/english/machine-learning/reinforcement-learning/index.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/guide/english/machine-learning/reinforcement-learning/index.md b/guide/english/machine-learning/reinforcement-learning/index.md index 347f440f51..f5fe355709 100644 --- a/guide/english/machine-learning/reinforcement-learning/index.md +++ b/guide/english/machine-learning/reinforcement-learning/index.md @@ -11,6 +11,8 @@ title: Reinforcement Learning Reinforcement Learning refers to a field of Machine Learning that applies to agents that you reinforce by giving them reward and punishment. It gives a nice gradual learning and can simplify the learning of agent in tasks where you cannot determine a proper error value. +Reinforcement learning lies *in between* supervised and unsupervised learning. While the former has a label and the latter does not, reinforcement learning has *sparse time delay labels* (rewards). It revolves around creating a mathematical framework that encapsulates the idea of an AI interacting with an environment and *time* acting as a dimension and learning through **trial and error**. + Example: A bot is given a task to play Space Invaders, it tries to learn to play it by interacting with game and in return getting a reward for the points that it scored at end of the game. Greater the reward, greater are its chances of doing the similar gameplay. In that way, it learns how to play the game and perform in the best possible way.