34 lines
		
	
	
		
			1.8 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
		
		
			
		
	
	
			34 lines
		
	
	
		
			1.8 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
|   | --- | ||
|  | title: Data Alone Is not Enough | ||
|  | --- | ||
|  | ## Data Alone Is not Enough
 | ||
|  | 
 | ||
|  | Without accounting for changing machine learning algorithms or other aspects of | ||
|  | training the model, data alone is not enough to help your learner do better. | ||
|  | 
 | ||
|  | > Every learner must embody some knowledge or assumptions beyond the data it's
 | ||
|  | > given in order to generalize beyond it (Domingos, 2012).
 | ||
|  | 
 | ||
|  | What this statement is essentially saying is that if you blindly choose a | ||
|  | learner just because you've heard it does well, collecting more data won't | ||
|  | necessarily help you in your machine learning goals. | ||
|  | 
 | ||
|  | For example, say you have data which depends on time (e.g. time series data) | ||
|  | and you want to use a binary classifier (e.g. logistic regression). Collecting | ||
|  | more time series data might not be the best to help your learner. This is | ||
|  | because a binary classifier isn't designed for time series. | ||
|  | 
 | ||
|  | This is not to say that once you've chosen the best machine learning algorithm | ||
|  | based on your problem that adding more data does you no good. In this case, it | ||
|  | will help you. | ||
|  | 
 | ||
|  | > Machine learning is not magic; it can't get something from nothing. What it
 | ||
|  | > does is get more from less...Learners combine knowledge with data to grow
 | ||
|  | > programs (Domingos, 2012).
 | ||
|  | 
 | ||
|  | #### More Information:
 | ||
|  | 
 | ||
|  | - <a href='https://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf' target='_blank' rel='nofollow'>A Few Useful Things to Know about Machine Learning</a> | ||
|  | - <a href='http://www.kdnuggets.com/2015/06/machine-learning-more-data-better-algorithms.html' target='_blank' rel='nofollow'>In Machine Learning, What is Better: More Data or better Algorithms?</a> | ||
|  | - <a href='https://www.quora.com/In-machine-learning-is-more-data-always-better-than-better-algorithms/answer/Xavier-Amatriain?srid=Tds3' target='_blank' rel='nofollow'>In machine learning, is more data always better than better algorithms?</a> |