2008 ModelbasedDocumentClusteringwit

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

Model-based algorithms are emerging as a preferred method for document clustering. As computing resources improve, methods such as Gibbs sampling have become more common for parameter estimation in these models. Gibbs sampling is well understood for many applications, but has not been extensively studied for use in document clustering. We explore the convergence rate, the possibility of label switching, and chain summarization methodologies for document clustering on a particular model, namely a mixture of multinomials model, and show that fairly simple methods can be employed, while still producing clusterings of superior quality compared to those produced with the EM algorithm.

References

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2008 ModelbasedDocumentClusteringwitDaniel David Walker
Eric K. Ringger
Model-based Document Clustering with a Collapsed Gibbs Sampler10.1145/1401890.1401975