Linguistic Annotation System
A Linguistic Annotation System is a Annotation System that is based on Natural Language Processing and Natural Language Understanding Systems.
- AKA: Natural Language Annotation System.
- Context:
- It usually integrates Natural Language Processing System-Modules such as:
- an Entity Mention Coreference Resolution System for linking references to same entities;
- a Named Entity Recognition System for indentifying and labeling named entities;
- a Semantic Analysis System for labeling predicate-argument relations ;
- a Syntactic Parsing System for analyzing constituent phrases in a sentence;
- a Part-of-Speech Tagging System for labeling words with word categories;
- a Tokenization System for segmenting text into words;
- a Sentence Boundary Detection System for segmantic text into words,
- It can range from being a Ortographic-Linguistic Annotation System (written language) to being a Phonetic-Linguistic Annotation System (spoken language).
- It can range from being a Morphological Linguistic Annotation System to being Syntactic-Linguistic Annotation System to being a Semantic-Linguistic Annotation System.
- It can range from being a Discourse Linguistic Annotation System to being a Pragmatics-Linguistic Annotation System.
- It usually integrates Natural Language Processing System-Modules such as:
- Example(s):
- Counter-Examples(s):
- See: UIMA, CoNLL File Format, Annotation System, Document Annotation System, Natural Language Processing System, Natural Language Understanding System.
References
2009
- (Wilcock, 2009) ⇒ Graham Wilcock. (2009). “Introduction to Linguistic Annotation and Text Analytics.” In: Synthesis Lectures on Human Language Technologies. Morgan & Claypool. doi:10.2200/S00194ED1V01Y200905HLT003 ISBN:1598297384
- QUOTE:
The different levels of linguistic description can be thought of as layers, as shown in Figure 2.1. Phonology and orthography deal with the smallest units (individual sounds and letters) at the bottom. Morphology, syntax and semantics deal with the medium-sized units (words, phrases and sentences). Discourse and pragmatics deal with the largest units (whole paragraphs and dialogues) at the top.
Fig 2.1 : Levels of Linguistic descriptions.
The current state of the art in linguistic annotation also divides the different annotation tasks into different levels, which can be arranged into a similar set of layers as shown in Figure 2.2. However, there is only an approximate correspondence between the levels of the tasks performed in practical corpus annotation work and the levels of description in linguistic theory.
File:2009 IntroToLinguisticAnnotation Chap2 Fig2.png----Fig 2.2 : Levels of Linguistic annotations.
- QUOTE: