Linguistic Dataset
Jump to navigation
Jump to search
A Linguistic Dataset is a dataset in which data points are linguistic units.
- AKA: Lexical Dataset, Language Data.
- Context:
- It can range from being a Natural Language Dataset to being a Formal Language Dataset.
- It can range from being a Text-based Linguistic Dataset to being a Linguistic Audio Dataset.
- …
- Example(s):
- Counter-Example(s):
- See: Language Model, Natural Language, Formal Language, Markup Language, Lexicon, Vocabulary.
References
2023
- "Language Data Analyst." Language Data Analyst, Job Description.
- QUOTE: As a language data analyst at AI21Labs, you will take part in the process of developing AI based text-interaction technology with focus on meticulous empirical work guided by insights about human linguistic knowledge and behavior.
- ...
Identify systematic patterns in language data by applying usage-based and theoretical generalizations