Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google


Glossary



With the rather unruly explosion of interest in data mining and its well documented commercial successes, and the fact that data mining is the fusion of many disciplines, each with their own heritage, the terminology used in the data mining community is at times quite confusing and often redundant. The beginnings of a Glossary began here but has ceased. http://en.wikipedia.orgWikipedia is now the canonical source.

Bias: The error in a model due to systematic inadequacies in the learning process. That is, those instances consistently incorrectly classified by models built by the learning algorithm. Modelling error due to bias can be reduced using http://en.wikipedia.org/wiki/BoostingBoosting. Compare with http://en.wikipedia.org/wiki/variancevariance.



Subsections

Copyright © 2004-2006 [email protected]
Support further development through the purchase of the PDF version of the book.
Brought to you by Togaware.