1992 AutomaticHyponymAcquisition
- (Hearst, 1992) ⇒ Marti Hearst. (1992). “Automatic Acquisition of Hyponyms from Large Text Corpora.” In: Proceedings of the 14th International Conference on Computational Linguistics (COLING-1992). doi:10.3115/992133.992154.
Subject Headings: Relation Recognition, Lexical Acquisition, Hearst Lexical Pattern.
Notes
- It proposes the use of the English lexical pattern (Hearst pattern). “X such as Y”.
Cited By
2005
- (Etzioni et al., 2005) ⇒ Oren Etzioni, Michael J. Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates. (2005). “Unsupervised Named-Entity Extraction from the Web: An Experimental Study.” In: Artificial Intelligence, 165(1).
Quotes
Abstract
We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate the approach: (i) avoidance of the need for pre-encoded knowledge and (ii) applicability across a wide range of text. We identify a set of lexicosyntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest. We describe a method for discovering these patterns and suggest that other lexical relations will also be acquirable in this way. A subset of the acquisition algorithm is implemented and the results are used to augment and critique the structure of a large hand-built thesaurus. Extensions and applications to areas such as information retrieval are suggested.
References
- Ahlswede, T. & M. Evens (1988). Parsing vs. text processing in the analysis of dictionary de nitions. Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics, pages 217{224.
- Alshawi, H. (1987). Processing dictionary de nitions with phrasal pattern hierarchies. American Journal of Computational Linguistics, 13(3):195{202.
- Batali, J. (1991). Automatic Acquisition and Use of Some of the Knowledge in Physics Texts. PhD thesis, Massachusetts Institute of Technology, Artificial Intelligence Laboratory.
- Brent, M. R. (1991). Automatic acquisition of subcategorization frames from untagged, free-text corpora. In: Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics.
- Calzolari, N. & R. Bindi (1990). Acquisition of lexical information from a large textual italian corpus. In: Proceedings of the Thirteenth International Conference on Computational Linguistics, Helsinki.
- Coates-Stephens, S. (1991). Coping with lexical inadequacy { the automatic acquisition of proper nouns from news text. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, pages 154{169, Oxford.
- Cutting, D., J. Kupiec, J. Pedersen, & P. Sibun (1991). A practical part-of-speech tagger. Submitted to The 3rd Conference on Applied Natural Language Processing.
- Grolier (1990). Academic American Encyclopedia. Grolier Electronic Publishing, Danbury, Connecticut.
- Hearst, M. A. (1991). Noun homograph disambiguation using local context in large text corpora. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, Oxford.
- Hindle, D. (1990). Noun classification from predicate-argument structures. Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, pages 268{275. 7
- Jacobs, P. & U. Zernik (1988). Acquiring lexical knowledge from text: A case study. In: Proceedings of AAAI88, pages 739{744.
- Jensen, K. & J.-L. Binot (1987). Disambiguating prepositional phrase attachments by using on-line dictionary de nitions. American Journal of Computational Linguistics, 13(3):251{260.
- Markowitz, J., T. Ahlswede, & M. Evens (1986). Semantically signi cant patterns in dictionary definitions. Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics, pages 112{119.
- George A. Miller, R. Beckwith, C. Fellbaum, D. Gross, & K. J. Miller (1990). Introduction to WordNet: An on-line lexical database. Journal of Lexicography, 3(4):235{244.
- Morris, J. & Graeme Hirst (1991). Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics, 17(1):21{ 48.
- Nakamura, J. & M. Nagao (1988). Extraction of semantic information from an ordinary english dictionary and its evaluation. In: Proceedings of the Twelfth International Conference on Computational Linguistics, pages 459{464, Budapest.
- Smadja, F. A. & Kathleen R. McKeown (1990). Automatically extracting and representing collocations for language generation. Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, pages 252{259.
- Paola Velardi & M. T. Pazienza (1989). Computer aided interpretation of lexical cooccurrences. Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, pages 185{192.
- Wilks, Y. A., D. C. Fass, C. ming Guo, J. E. McDonald, T. Plate, & B. M. Slator (1990). Providing machine tractable dictionary tools. Journal of Machine Translation, 2.
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
1992 AutomaticHyponymAcquisition | Marti Hearst | Automatic Acquisition of Hyponyms from Large Text Corpora | Proceedings of the 14th International Conference on Computational Linguistics | http://people.ischool.berkeley.edu/~hearst/papers/coling92.pdf | 10.3115/992133.992154 | 1992 |