2012 DifferentSlopesforDifferentFolk
- (Duivesteijn et al., 2012) ⇒ Wouter Duivesteijn, Ad Feelders, and Arno Knobbe. (2012). “Different Slopes for Different Folks: Mining for Exceptional Regression Models with Cook's Distance.” In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2012). ISBN:978-1-4503-1462-6 doi:10.1145/2339530.2339668
Subject Headings:
Notes
Cited By
- http://scholar.google.com/scholar?q=%222012%22+Different+Slopes+for+Different+Folks%3A+Mining+for+Exceptional+Regression+Models+with+Cook%27s+Distance
- http://dl.acm.org/citation.cfm?id=2339530.2339668&preflayout=flat#citedby
Quotes
Author Keywords
Abstract
Exceptional Model Mining (EMM) is an exploratory data analysis technique that can be regarded as a generalization of subgroup discovery. In EMM we look for subgroups of the data for which a model fitted to the subgroup differs substantially from the same model fitted to the entire dataset. In this paper we develop methods to mine for exceptional regression models. We propose a measure for the exceptionality of regression models (Cook's distance), and explore the possibilities to avoid having to fit the regression model to each candidate subgroup. The algorithm is evaluated on a number of real life datasets. These datasets are also used to illustrate the results of the algorithm. We find interesting subgroups with deviating models on datasets from several different domains. We also show that under certain circumstances one can forego fitting regression models on up to 40% of the subgroups, and these 40% are the relatively expensive regression models to compute.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2012 DifferentSlopesforDifferentFolk | Arno Knobbe Wouter Duivesteijn Ad Feelders | Different Slopes for Different Folks: Mining for Exceptional Regression Models with Cook's Distance | 10.1145/2339530.2339668 | 2012 |