2004 WordnetBasedTextDocClustring
Jump to navigation
Jump to search
- (Sedding and Kazakov, 2004) ⇒ Julian Sedding, Dimitar Kazakov. (2004). “Wordnet-based Text Document Clustering.” In: COLING-2004 Workshop on Robust Methods in Analysis of Natural Language Data (ROMAND).
Subject Headings:
Notes
- It presents a Text Clustering Algorithm.
- It extends the work of (Hotho et al., 2003)
- It analyzes the benefits of partial disambiguation of words by their PoS and the inclusion of WordNet concepts.
- It uses on POS tags to ais in WSD. This leads to noisy Vectors which significantly hurts performance.
- A possible solution to that would be to use a word-by-word disambiguation in order to chose the correct sense of a word
- Only the hypernyms for the correct sense would be considered.
References
- (Hotho et al., 2003) ⇒ Andreas Hotho, Steffen Staab, and Gerd Stumme. (2003). “Wordnet improves text document clustering.” In: Proceedings of the Semantic Web Workshop at SIGIR-2003, 26th ACM SIGIR Conference .
,