2003 OntologiesImprovTextDocClustering
Jump to navigation
Jump to search
- (Hotho et al., 2003b) ⇒ Andreas Hotho, Steffen Staab, Gerd Stumme. (2003). “Ontologies Improve Text Document Clustering.” In: Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003). http://doi.ieeecomputersociety.org/10.1109/ICDM.2003.1250972
Subject Headings:
Notes
Cited By
~ 118 http://scholar.google.com/scholar?cites=1566765002042881945
Quotes
Abstract
- Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large sets of documents into a small number of meaningful clusters. The bag of words representation used for these clustering methods is often unsatisfactory as it ignores relationships between important terms that do not co-occur literally. In order to deal with the problem, we integrate core ontologies as background knowledge into the process of clustering text documents. Our experimental evaluations compare clustering techniques based on pre-categorizations of texts from Reuters newsfeeds and on a smaller domain of an eLearning course about Java. In the experiments, improvements of results by background knowledge compared to a baseline without background knowledge can be shown in many interesting combinations.
References
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2003 OntologiesImprovTextDocClustering | Steffen Staab Andreas Hotho Gerd Stumme | Ontologies Improve Text Document Clustering | http://people.aifb.kit.edu/aho/pub/hothoa icdm poster03.pdf | 10.1109/ICDM.2003.1250972 |