2009 CollectiveAnnotationofWikipedia
- (Kulkarni et al., 2009) ⇒ Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, and Soumen Chakrabarti. (2009). “Collective Annotation of Wikipedia Entities in Web Text.” In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2009). doi:10.1145/1557019.1557073
Subject Headings: Named Entity Disambiguation, Collective Entity Linking System.
Notes
- Categories and Subject Descriptors: H.3.3 Information Search and Retrieval: Information Systems – Information Storage And Retrieval.
- General Terms: Algorithms, Experimentation.
- PDF: https://www.cc.gatech.edu/~zha/CSE8801/query-annotation/p457-kulkarni.pdf
Cited By
- http://scholar.google.com/scholar?q=%22Collective+annotation+of+Wikipedia+entities+in+web+text%22+2009
- http://portal.acm.org/citation.cfm?doid=1557019.1557073&preflayout=flat#citedby
Quotes
Author Keywords
Entity Annotation, Entity Disambiguation, Wikipedia, Collective Inference.
Abstract
To take the first step beyond keyword-based search toward entity-based search, suitable token spans (""spots"") on documents must be identified references to real-world entities from an entity catalog. Several systems have been proposed to link spots on Web pages to entities in Wikipedia. They are largely based on local compatibility between the text around the spot and textual metadata associated with the entity. Two recent systems exploit inter-label dependencies, but in limited ways. We propose a general collective disambiguation approach. Our premise is that coherent documents refer to entities from one or a few related topics or domains. We give formulations for the trade-off between local spot-to-entity compatibility and measures of global coherence between entities. Optimizing the overall entity assignment is NP-hard. We investigate practical solutions based on local hill-climbing, rounding integer linear programs, and pre-clustering entities followed by local optimization within clusters. In experiments involving over a hundred manually-annotated Web pages and tens of thousands of spots, our approaches significantly outperform recently-proposed algorithms.
References
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2009 CollectiveAnnotationofWikipedia | Soumen Chakrabarti Ganesh Ramakrishnan Amit Singh Sayali Kulkarni | Collective Annotation of Wikipedia Entities in Web Text | KDD-2009 Proceedings | 10.1145/1557019.1557073 | 2009 |