2011 AnEffectiveEvaluationMeasurefor

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

Due to the ever growing presence of data streams, there has been a considerable amount of research on stream mining algorithms. While many algorithms have been introduced that tackle the problem of clustering on evolving data streams, hardly any attention has been paid to appropriate evaluation measures. Measures developed for static scenarios, namely structural measures and ground-truth-based measures, cannot correctly reflect errors attributable to emerging, splitting, or moving clusters. These situations are inherent to the streaming context due to the dynamic changes in the data distribution. In this paper we develop a novel evaluation measure for stream clustering called Cluster Mapping Measure (CMM). CMM effectively indicates different types of errors by taking the important properties of evolving data streams into account. We show in extensive experiments on real and synthetic data that CMM is a robust measure for stream clustering evaluation.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2011 AnEffectiveEvaluationMeasureforAlbert Bifet
Geoff Holmes
Bernhard Pfahringer
Philipp Kranen
Thomas Seidl
Timm Jansen
Hardy Kremer
An Effective Evaluation Measure for Clustering on Evolving Data Streams10.1145/2020408.20205552011