2011 TellMeWhatINeedtoKnowSuccinctly
- (Mampaey et al., 2011) ⇒ Michael Mampaey, Nikolaj Tatti, and Jilles Vreeken. (2011). “Tell Me What I Need to Know: Succinctly Summarizing Data with Itemsets.” In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2011) Journal. ISBN:978-1-4503-0813-7 doi:10.1145/2020408.2020499
Subject Headings:
Notes
Cited By
- http://scholar.google.com/scholar?q=%222011%22+Tell+Me+What+I+Need+to+Know%3A+Succinctly+Summarizing+Data+with+Itemsets
- http://dl.acm.org/citation.cfm?id=2020408.2020499&preflayout=flat#citedby
Quotes
Author Keywords
Abstract
Data analysis is an inherently iterative process. That is, what we know about the data greatly determines our expectations, and hence, what result we would find the most interesting. With this in mind, we introduce a well-founded approach for succinctly summarizing data with a collection of itemsets; using a probabilistic maximum entropy model, we iteratively find the most interesting itemset, and in turn update our model of the data accordingly. As we only include itemsets that are surprising with regard to the current model, the summary is guaranteed to be both descriptive and non-redundant. The algorithm that we present can either mine the top-k most interesting itemsets, or use the Bayesian Information Criterion to automatically identify the model containing only the itemsets most important for describing the data. Or, in other words, it will ' tell you what you need to know'. Experiments on synthetic and benchmark data show that the discovered summaries are succinct, and correctly identify the key patterns in the data. The models they form attain high likelihoods, and inspection shows that they summarize the data well with increasingly specific, yet non-redundant itemsets.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2011 TellMeWhatINeedtoKnowSuccinctly | Nikolaj Tatti Jilles Vreeken Michael Mampaey | Tell Me What I Need to Know: Succinctly Summarizing Data with Itemsets | 10.1145/2020408.2020499 | 2011 |