2013 SubsamplingforEfficientandEffec: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - " Jing Gao" to " Jing Gao")
m (Text replacement - " ↵↵" to " ")
 
Line 1: Line 1:
* ([[2013_SubsamplingforEfficientandEffec|Zimek et al., 2013]]) ⇒ [[author::Arthur Zimek]], [[author::Matthew Gaudet]], [[author::Ricardo J.G.B. Campello]], and [[author::Jörg Sander]]. ([[year::2013]]). “Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles.” In: [[proceedings::Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining]]. ISBN:978-1-4503-2174-7 [http://dx.doi.org/10.1145/2487575.2487676 doi:10.1145/2487575.2487676]  
* ([[2013_SubsamplingforEfficientandEffec|Zimek et al., 2013]]) ⇒ [[author::Arthur Zimek]], [[author::Matthew Gaudet]], [[author::Ricardo J.G.B. Campello]], and [[author::Jörg Sander]]. ([[year::2013]]). “Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles.” In: [[proceedings::Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining]]. ISBN:978-1-4503-2174-7 [http://dx.doi.org/10.1145/2487575.2487676 doi:10.1145/2487575.2487676]


<B>Subject Headings:</B>  
<B>Subject Headings:</B>


== Notes ==
== Notes ==

Latest revision as of 19:33, 20 December 2023

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

Outlier detection and ensemble learning are well established research directions in data mining yet the application of ensemble techniques to outlier detection has been rarely studied. Here, we propose and study subsampling as a technique to induce diversity among individual outlier detectors. We show analytically and experimentally that an outlier detector based on a subsample per se, besides inducing diversity, can, under certain conditions, already improve upon the results of the same outlier detector on the complete dataset. Building an ensemble on top of several subsamples is further improving the results. While in the literature so far the intuition that ensembles improve over single outlier detectors has just been transferred from the classification literature, here we also justify analytically why ensembles are also expected to work in the unsupervised area of outlier detection. As a side effect, running an ensemble of several outlier detectors on subsamples of the dataset is more efficient than ensembles based on other means of introducing diversity and, depending on the sample rate and the size of the ensemble, can be even more efficient than just the single outlier detector on the complete data.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2013 SubsamplingforEfficientandEffecJörg Sander
Arthur Zimek
Matthew Gaudet
Ricardo J.G.B. Campello
Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles10.1145/2487575.24876762013