2001 DataMiningAtTheIntOfCompSciAndStats

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Data Mining, statistics, pattern recognition, transaction data, correlation.

Notes

Cited By

~12 http://scholar.google.com/scholar?cites=2039945908716953323

Quotes

Abstract

2. Is Data Mining Different from Statistics?

  • Is data mining as currently practiced substantially different from conventional applied statistics? Certainly if one looks at the published commercial applications of data mining, such as the case studies presented in [BL00], one sees a heavy reliance on techniques that have their lineage in applied statistics. For example, decision trees are perhaps the single most widely-used modeling technique in commercial predictive data mining applications [Joh99, Koh00]. They are particularly popular because of their ability to both deal with heterogenous data types (they can easily handle both categorical and real-valued variables) and to find relatively low-dimensional parsimonious predictors for high-dimensional problems.

References

  • M. J. A. Berry, and G. Linoff. (2000). “Mastering Data Mining: The Art and Science of Customer Relationship Management.” John Wiley and Sons,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2001 DataMiningAtTheIntOfCompSciAndStatsPadhraic SmythData Mining at the Interface of Computer Science and Statisticshttp://www.datalab.uci.edu/papers/dmchap.pdf2001