1992 SurfaceGramAnForTheExtrOfTermNPs
- (Bourigault, 1992) ⇒ Didier Bourigault. (1992). “Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases.” In: Proceedings of the Fifteenth International Conference on Computational Linguistics. doi:10.3115/993079.993111
Subject Headings: Terminological Unit Recognition Task, Terminological Noun Phrase, NP Chunking Task, LEXTER
Notes
Cited By
2000
- (Frantzi et al., 2000) ⇒ Katerina Frantzi, Sophia Ananiadou, and Hideki Mima. (2000). “Automatic Recognition of Multi-Word Terms: The Cvalue/NC-value method.” In: International Journal on Digital Libraries, 3(2). doi:10.1007/s007999900023
1995
- (Ramshaw & Marcus, 1995) ⇒ Lance A. Ramshaw, and Mitch P. Marcus. (1995). “Text Chunking Using Transformation-based Learning.” In: Proceedings of the Third ACL Workshop on Very Large Corpora (WVLC 1995).
- On the grammar-based side, [[1992_SurfaceGramAnForTheExtrOfTermNPs|Bourigault (1992)]] describes a system for extracting “terminological noun phrases” from French text. This system first uses heuristics to find “maximal length noun phrases", and then uses a grammar to extract “terminological units.” For example, from the maximal NP le disque dur de la station de travail it extracts the two terminological phrases disque dur, and station de travail. Bourigault claims that the grammar can parse "around 95% of the maximal length noun phrases" in a test corpus into possible terminological phrases, which then require manual validation. However, because its goal is terminological phrases, it appears that this system ignores NP chunk-initial determiners and other initial prenominal modifiers, somewhat simplifying the parsing task.
Quotes
Abstract
LEXTER is a software package for extracting terminology. A corpus of French language texts on any subject field is fed in, and LEXTER produces a list of likely terminological units to be submitted to an expert to be validated. To identify the terminological units, LEXTER takes their form into account and proceeds in two main stages: analysis, parsing. In the first stage, LEXTER uses a base of rules designed to indentify frontier markers in view to analysing the texts and extracting maximal-length noun phrases. In the second stage, LEXTER parses these maximal-length noun phrases to extract subgroups which by virtue of their grammatical structure and their place in the maximal-length noun phrases are likely to be terminological units. In this article, the type of analysis used (surface grammatical analysis) is highlighted, as the methodological approach adopted to adapt the rules (experimental approach).
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
1992 SurfaceGramAnForTheExtrOfTermNPs | Didier Bourigault | Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases | Proceedings of the Fifteenth International Conference on Computational Linguistics | 10.3115/993079.993111 | 1992 |