2003 UnsupPersonalNameDisambig
Jump to navigation
Jump to search
- (Mann & Yarowsky, 2003) ⇒ Gideon S. Mann, David Yarowsky. (2003). “Unsupervised Personal Name Disambiguation.” In: Proceedings of HLT-NAACL (2003). doi:10.3115/1119176.1119181
Subject Headings: Entity Mention Coreference Resolution, Person Mention Coreference Resolution.
Notes
- It proposes the use of a Hierarchical Agglomerative Clustering Algorithm to the Multi-Document Coreference Resolution Task.
- It proposes the use of Semantic Information such as the date of birth, professional career or education.
Cited By
~204 http://scholar.google.com/scholar?cites=13562376193064410392
- (Bekkerman & McCallum, 2005) ⇒ Ron Bekkerman, and Andrew McCallum. (2005). “Disambiguating Web Appearance of People in a Social Network.” In: Proceedings of the 14th International World Wide Web Conference. (WWW 2005).
- Mann and Yarowsky (2003) addressed the task of clustering the Web search results for a set of ambiguous personal names by employing a rich feature space of biographic facts obtained via bootstrapped extraction patterns. They reported 88% precision and 73% recall in a three-way classification (most common, secondary, and other uses).
Quotes
Abstract
- This paper presents a set of algorithms for distinguishing personal names with multiple real referents in text, based on little or no supervision. The approach utilizes an unsupervised clustering technique over a rich feature space of biographic facts, which are automatically extracted via a language-independent bootstrapping process. The induced clustering of named entities are then partitioned and linked to their real referents via the automatically extracted biographic data. Performance is evaluated based on both a test set of handlabeled multi-referent personal names and via automatically generated pseudonames.
References
- Amit Bagga, Breck Baldwin, Entity-based cross-document coreferencing using the Vector Space Model, Proceedings of the 17th International Conference on Computational linguistics, August 10-14, 1998, Montreal, Quebec, Canada
- S. Brin. (1998). Extracting patterns and relations from the world wide web. In WebDB Workshop at 6th International Conference on Extending Database Technology, EDBT'98.
- M. E. Califf and Raymond Mooney. (1998). Relational learning of pattern-match rules for information extraction. In Working Notes, of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing, pages 6--11, Menlo Park, CA. AAAI Press.
- Dayne Freitag and Andrew McCallum. (1999). Information extraction with hmms and shrinkage. In: Proceedings of the AAAI-99 Workshop on Machine Learning for Information Extraction.
- B. Gale, Kenneth W. Church, and David Yarowsky. (1992). Work on statistical methods for word sense disambiguation. In: Proceedings of AAAIFall Symposium on Probabilistic Approaches to Natural Language Processing, pages 54--60, Cambridge, MA.
- Scott B. Huffman, Learning information extraction patterns from examples, Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing, p.246-260, January 1996
- Deepak Ravichandran, Eduard Hovy, Learning surface text patterns for a Question Answering system, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, July 07-12, 2002, Philadelphia, Pennsylvania doi:10.3115/1073083.1073092
- Barry Schiffman, Inderjeet Mani, Kristian J. Concepcion, Producing biographical summaries: combining linguistic knowledge with corpus statistics, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, p.458-465, July 06-11, 2001, Toulouse, France doi:10.3115/1073012.1073071
- David A. Smith, Gregory Crane, Disambiguating Geographic Names in a Historical Digital Library, Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries, p.127-136, September 04-09, 2001
- Nina Wacholder, Yael Ravin, Misook Choi, Disambiguation of proper names in text, Proceedings of the fifth Conference on Applied Natural Language Processing, p.202-208, March 31-April 03, 1997, Washington, DC doi:10.3115/974557.974587
- Roman Yangarber, Ralph Grishman, Pasi Tapanainen, Silja Huttunen, Unsupervised discovery of scenario-level patterns for Information Extraction, Proceedings of the sixth Conference on Applied Natural Language Processing, p.282-289, April 29-May 04, 2000, Seattle, Washington doi:10.3115/974147.974186,