2002 TowardsBestPracticeForMWEinCompLex
- (Calzolari et al., 2002) ⇒ Nicoletta Calzolari, Charles J. Fillmore, Ralph Grishman, Nancy Ide, Alessandro Lenci, Catherine MacLeod, Antonio Zampolli. (2002). “Towards Best Practice for Multiword Expressions in Computational Lexicons.” In: Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC 2002).
Subject Headings: Compound Word
Notes
- It uses the term Multiword Expression.
- It defines MWE as a sequence of words that acts as a single unit at some level of linguistic analysis. In addition, they exhibit some or all of the following behaviors:
- 1. reduced syntactic and semantic transparency;
- 2. reduced or lack of compositionality;
- 3. more or less frozen or fixed status;
- 4. possible violation of some otherwise general syntactic patterns or rules;
- 5. a high degree of lexicalization (depending on pragmatic factors);
- 6. a high degree of conventionality.
Cited By
2005
- (Villavicencio et al., 2005) ⇒ Aline Villavicencio, Francis Bonda, Anna Korhonena, and Diana McCarthya. (2005). “Introduction to the Special Issue on Multiword Expressions: Having a crack at a hard nut.” In: Special issue on Multiword Expression, Computer Speech & Language, 19(4). doi:10.1016/j.csl.2005.05.001
Quotes
Abstract
The importance and role of multi-word expressions (MWE) in the description and processing of natural language has been long recognized. However, multi-word information has often been relegated to the marginal role of idiosyncratic lexical information. The need for MWE lexicons grows even more acute for multi-lingual applications, for which (sometimes complex) correspondences must be identified, classified, and recorded. Within the XMELLT and ISLE projects we have started to investigate the potential to develop multi-lingual, multi-word expression lexicons incorporating both syntactic and semantic information. We aim at specifying means to acquire and represent multi-word lexical entries for multiple languages, and establishing uniform (or inter-translatable) standards for describing multi-word lexical entries. We explored theoretical approaches used in large lexicon-building projects, in particular FrameNet and SIMPLE. They constitute interesting frameworks for the explicit syntactic and semantic representation of MWEs, due mainly to their ability to capture semantic multidimensionality, through frame elements and qualia relations respectively. We also developed an abstract data model for lexical information together with a representation in XML for it. Our goal is to define a set of minimal lexicon “objects”, which can serve not only as a model for MWEs but also for lexical data in general.
2. What is an MWE?
In different theoretical or practical contexts the term multiword expression (MWE) is used to describe different but related phenomena, including fixed or semi-fixed phrases, compounds, support verbs, idioms, phrasal verbs, collocations, etc. At the level of greatest generality, all of these phenomena can be described as a sequence of words that acts as a single unit at some level of linguistic analysis. In addition, they exhibit some or all of the following behaviors:
- . reduced syntactic and semantic transparency;
- . reduced or lack of compositionality;
- . more or less frozen or fixed status;
- . possible violation of some otherwise general syntactic patterns or rules;
- . a high degree of lexicalization (depending on pragmatic factors);
- . a high degree of conventionality.
,