2008 NounPhraseBracketingGuidelinesV1
Jump to navigation
Jump to search
- (Vadas, 2008) ⇒ David Vadas. (2008). “Noun Phrase Bracketing Guidelines, Version 1.0.” The University of Sydney, School of Information Technologies.
Subject Headings: NP Bracketing Task.
Notes
- Version 0.9 Guidelines: http://www.cs.usyd.edu.au/~dvadas1/?download=guidelines_0.9.pdf
- The paper that used v0.9 of this data: (Vadas & Curran, 2007) ⇒ David Vadas, and James R. Curran. (2007). “Adding Noun Phrase Structure to the Penn Treebank.” In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL-2007).
Quotes
1. Introduction
- This document describes guidelines for bracketing noun phrase (NP) structure in the Penn Treebank. These guidelines are in addition to the Treebank II Guidelines (Bies et al., 1995). They are also based on, and overlap with the Addendum for BioMedical Annotation (Warner et al., 2004).
An earlier version (0.9) of these guidelines were used in the annotation described in Vadas and Curran (2007), while this version was used in a subsequent pass over the data.
2 Bracketing NPs
- The goal of our annotation is to identify and bracket multi-token premodifiers in NPs. Quirk et al. (1985, p. 1321) describes such premodifiers, which include adjectives, participles, nouns, genitives and adverbs. All of these items are modifiable themselves, and this is precisely the behaviour that we have annotated.
- “dark red car”
- current ⇒ (NP (JJ dark) (JJ red) (NN car) )
- proposed ⇒ (NP (JJP (JJ dark) (JJ red) ) (NN car) )
- “the fastest developing trends”
- current ⇒ (NP (DT the) (JJS fastest) (VBG developing) (NNS trends))
- proposed ⇒ (NP (DT the) (JJP (JJS fastest) (VBG developing) ) (NNS trends))
- “relatively higher rates”
- current ⇒ (NP (RB relatively) (JJR higher) (NNS rates))
- proposed ⇒ (NP (JJP (RB relatively) (JJR higher) ) (NNS rates))
- Coordinated Adjectival
When two or three adjectives of the same class are used, they are called "coordinated adjectives".
(EX) Her [simple but expensive] long silk dress (EX) The [red, white and blue] French flag.
bright and incredibly sharp
References
- Ann Bies, Mark Ferguson, Karen Katz, and Robert MacIntyre. (1995). Bracketing guidelines for Treebank II style: Penn Treebank project. Technical report.
- Michael Collins. (1999). Head-Driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania.
- Randolph Quirk, Sidney Greenbaum, Geoffrey Leech, and Jan Svartvik. (1985). A Comprehensive Grammar of the English Language. Longman, London.
- David Vadas and James R. Curran. (2007). Adding noun phrase structure to the Penn Treebank. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL-07). (to appear).
- Colin Warner, Ann Bies, Christine Brisson, and Justin Mott. (2004). Addendum to the Penn Tree-bank II style bracketing guidelines: BioMedical Treebank annotation. Technical report.
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2008 NounPhraseBracketingGuidelinesV1 | David Vadas | Noun Phrase Bracketing Guidelines, Version 1.0 | http://www.cs.usyd.edu.au/~dvadas1/?download=guidelines1.0.pdf | 2008 |