From 720f8562bfbf3f25aab15b5727b2addd666b0fc8 Mon Sep 17 00:00:00 2001 From: Alvin Tan <43322095+KidoVin01@users.noreply.github.com> Date: Fri, 30 Nov 2018 10:14:27 -0500 Subject: [PATCH] Added "overfitting and underfitting" (#25528) --- guide/english/machine-learning/index.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/guide/english/machine-learning/index.md b/guide/english/machine-learning/index.md index c13349e7cc..2c428dff90 100644 --- a/guide/english/machine-learning/index.md +++ b/guide/english/machine-learning/index.md @@ -27,6 +27,9 @@ Supervised learning is the machine learning task of inferring a function from su data consist of a set of training examples. In supervised learning, each example is a pair consisting of an input object (typically a vector) and a desired output value (also called the supervisory signal). Further, the supervised learning can be taken as 2 paradigm, classification and regression. +### Overfitting and Underfitting +The idea of overfitting in machine learning is making the machine model the data too well. This essentially making the machine to recognize a situation with only a specific characteristics that the data shows, or in other words picking up too much noise in the data. This can be problematic in terms of the lack of flexibility in adapting different situations. On the other hand, underfitting generalizes too much. It doesn't recognizes patterns as well and would not be able to differentiate much about different situations. + #### Basic flowchart/steps for supervised learning 1. Collect training set. 2. Divide training set into input object (features) and output object (classes or value)