Parsing Task
A Parsing Task is a Information Processing Task that maps a generated string sequence to a structured representation.
- AKA: Syntax Analysis Task, Syntactic Analysis Task.
- Context:
- Task Input: a String Sequence in Natural Language, Formal Language, or Data Structures.
- optional: a Formal Model.
- Task Output: a Populated Data Structure.
- Task Requirement(s): Parsing Method.
- It can be solved by Parsing System that implements a Parsing Algorithm.
- It can range from being a Shallow Parsing Task to being a Deep Parsing Task.
- It can range from being a Syntactic Parsing Task to being a Semantic Parsing Task.
- It can range from being a Top-Down Parsing Task to being a Bottom-Up Parsing Task.
- It can range from being a Table-Driven Parsing Task to being an Adaptive Parsing Task.
- Task Input: a String Sequence in Natural Language, Formal Language, or Data Structures.
- Example(s):
- a Bottom-Up Parsing Task such as:
- a Data Structures Parsing Task,
- a Discourse Parsing Task,
- a Formal Language Parsing Task such as:
- an Image Parsing Task,
- a Linguistic Syntactic Parsing Task,
- a Lookahead Parsing Task,
- a Morphological Parsing Task,
- a Natural Language Parsing Task,
- a Speech Parsing Task,
- a Text Parsing Task,
- a Top-Down Parsing Task as:
- …
- Counter-Example(s):
- an Automatic Summarization Task,
- a Chunking Task,
- a Coreference Resolution Task,
- a Grammar Induction Task,
- a Lemmatization Task,
- a Linguistic Structural Disambiguation Task,
- a Machine Translation Task,
- a Morphological Segmentation Task,
- a Named Entity Recognition (NER) Task,
- a Part-Of-Speech (POS) Tagging Task,
- a Question Answering Task,
- a Relationship Extraction Task,
- a Sentiment Analysis Task,
- a Stemming Task,
- a Text Segmentation Task,
- a Textual Entailment Recognition Task,
- a Terminology Extraction Task.
- See: Natural Language Processing Task, Natural Language Generation Task, Natural Language Understanding Task, Text Processing Task, Text Error Correction Task, Reverse Engineering Task, Reading Task, Optical Character Recognition (OCR) Task, Topic Segmentation and Recognition Task, Word Sense Disambiguation Task.
References
2020
- (Wikipedia, 2020) ⇒ https://en.wikipedia.org/wiki/Parsing Retrieved:2020-2-29.
- Parsing, syntax analysis, or syntactic analysis is the process of analyzing a string of symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal grammar. The term parsing comes from Latin pars (orationis), meaning part (of speech).[1]
The term has slightly different meanings in different branches of linguistics and computer science. Traditional sentence parsing is often performed as a method of understanding the exact meaning of a sentence or word, sometimes with the aid of devices such as sentence diagrams. It usually emphasizes the importance of grammatical divisions such as subject and predicate.
Within computational linguistics the term is used to refer to the formal analysis by a computer of a sentence or other string of words into its constituents, resulting in a parse tree showing their syntactic relation to each other, which may also contain semantic and other information. Some parsing algorithms may generate a parse forest or list of parse trees for a syntactically ambiguous input.[2]
The term is also used in psycholinguistics when describing language comprehension. In this context, parsing refers to the way that human beings analyze a sentence or phrase (in spoken language or text) "in terms of grammatical constituents, identifying the parts of speech, syntactic relations, etc." This term is especially common when discussing what linguistic cues help speakers to interpret garden-path sentences.
Within computer science, the term is used in the analysis of computer languages, referring to the syntactic analysis of the input code into its component parts in order to facilitate the writing of compilers and interpreters. The term may also be used to describe a split or separation.
- Parsing, syntax analysis, or syntactic analysis is the process of analyzing a string of symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal grammar. The term parsing comes from Latin pars (orationis), meaning part (of speech).[1]
- ↑ "Parse". dictionary.reference.com. Retrieved 27 November 2010.
- ↑ Masaru Tomita (6 December 2012). "Generalized LR Parsing". Springer Science & Business Media. ISBN 978-1-4615-4034-2.
2017
- (WikiBooks, 2017) ⇒ https://en.wikibooks.org/wiki/Compiler_Construction/Syntax_Analysis
- QUOTE: This is alternatively known as parsing. It is roughly the equivalent of checking that some ordinary text written in a natural language (e.g. English) is grammatically correct (without worrying about meaning).
The purpose of syntax analysis or parsing is to check that we have a valid sequence of tokens. Tokens are valid sequence of symbols, keywords, identifiers etc. Note that this sequence need not be meaningful; as far as syntax goes, a phrase such as "true + 3" is valid but it doesn't make any sense in most programming languages.
The parser takes the tokens produced during the lexical analysis stage, and attempts to build some kind of in-memory structure to represent that input. Frequently, that structure is an 'abstract syntax tree' (AST).
The parser needs to be able to handle the infinite number of possible valid programs that may be presented to it. The usual way to define the language is to specify a grammar. A grammar is a set of rules (or productions) that specifies the syntax of the language (i.e. what is a valid sentence in the language).
- QUOTE: This is alternatively known as parsing. It is roughly the equivalent of checking that some ordinary text written in a natural language (e.g. English) is grammatically correct (without worrying about meaning).
2009
- (foldoc, 2009) ⇒ http://foldoc.org/index.cgi?grammar
- QUOTE: Parsing assigns a terminal syntactic category to each input token and a non-terminal category to each appropriate group of tokens, up to the level of the whole sentence. Parsing is usually preceded by lexical analysis.