Grapheme
A Grapheme is a linguistic symbol within a writing system that functions as the fundamental written unit of language representation through visual encoding of linguistic information.
- AKA: Writing System Character, Written Linguistic Symbol, Orthographic Unit, Script Element, Written Language Unit.
- Context:
- It can typically be a member of a Language Grapheme Set with grapheme inventory characteristics specific to that language system.
- It can typically be used by a Linguistic Agent to compose Written Utterances through grapheme combination patterns.
- It can typically convey Linguistic Information through grapheme interpretation processes in grapheme reading activity.
- It can typically form Grapheme Sequences that encode grapheme-based language structure and grapheme-encoded meaning.
- It can typically be distinguished from other graphemes by its grapheme visual form and grapheme functional role.
- It can typically have Unicode Representation through grapheme code point assignment in digital text systems.
- It can typically enable Optical Character Recognition through grapheme pattern recognition in document digitization.
- It can typically support Cross-Platform Communication through grapheme encoding standards across computing systems.
- ...
- It can often be represented by different Glyphs through grapheme visualization variation in various font systems and typographic styles.
- It can often correspond to a Phonological Unit in spoken language through grapheme-phoneme correspondence rules.
- It can often vary across Writing Cultures with different grapheme classification systems and grapheme usage conventions.
- It can often be part of a Grapheme String, such as an Orthographic Word, through grapheme sequencing rules.
- It can often undergo Historical Evolution through grapheme development processes in writing traditions.
- It can often serve as Machine Learning Feature in grapheme-based NLP models for text processing tasks.
- It can often require Rendering Engine support for grapheme display accuracy in digital interfaces.
- It can often participate in Accessibility Technology through grapheme description protocols for screen readers.
- ...
- It can range from being a Simple Grapheme to being a Complex Grapheme, depending on its grapheme structural composition.
- It can range from being a Phonetic Grapheme to being a Logographic Grapheme, depending on its grapheme representational function.
- It can range from being an Ancient Grapheme to being a Modern Grapheme, depending on its grapheme historical period.
- It can range from being a Standard Grapheme to being a Variant Grapheme, depending on its grapheme orthographic status.
- It can range from being a Monolingual Grapheme to being a Multilingual Grapheme, depending on its grapheme language distribution.
- It can range from being a Print-Optimized Grapheme to being a Screen-Optimized Grapheme, depending on its grapheme display medium.
- It can range from being a Human-Readable Grapheme to being a Machine-Readable Grapheme, depending on its grapheme processing context.
- ...
- It can have a specific position in a Character Encoding System through grapheme digitization standards for computational representation.
- It can follow specific Orthographic Rules in grapheme usage patterns within a writing convention.
- It can facilitate Written Communication through grapheme recognition processes by human readers and machine readers.
- It can be analyzed in Linguistic Research through grapheme identification methodologies.
- It can enable Text Mining through grapheme feature extraction in computational linguistics.
- It can support Language Learning through grapheme acquisition processes in literacy development.
- ...
- Examples:
- Grapheme Categories, such as:
- Alphabetic Letter Graphemes (members of a Language Alphabet), such as:
- Latin Alphabetic Graphemes, such as:
- Basic Latin Graphemes: Any Member of: {a, b, c, ..., y, z}, in English Writing System for phoneme representation.
- Extended Latin Graphemes: Any Member of: {ä, ö, ü, ß, ñ, ç}, in European Language Writing Systems for specific sound indication.
- Greek Alphabetic Graphemes, such as:
- Any Member of: {α, β, γ, δ, ..., ψ, ω}, in Greek Writing System for Greek phoneme encoding.
- Cyrillic Alphabetic Graphemes, such as:
- Any Member of: {а, б, в, г, ..., я}, in Slavic Writing Systems for Slavic sound representation.
- Arabic Alphabetic Graphemes, such as:
- Any Member of: {ا, ب, ت, ث, ..., ي}, in Arabic Writing System for consonant and vowel marking.
- Latin Alphabetic Graphemes, such as:
- Punctuation Graphemes, such as:
- Sentence Punctuation Graphemes, such as:
- Delimiter Punctuation Graphemes, such as:
- Whitespace Graphemes, such as:
- Numeral Graphemes, such as:
- Western Arabic Numeral Graphemes: {0, 1, 2, ..., 9} for decimal number representation.
- Eastern Arabic Numeral Graphemes: {٠, ١, ٢, ..., ٩} for Arabic number notation.
- Roman Numeral Graphemes: {I, V, X, L, C, M} for classical number notation.
- Chinese Numeral Graphemes: {一, 二, 三, ..., 十} for Chinese number system.
- Symbol Graphemes, such as:
- Mathematical Symbol Graphemes, such as:
- Operator Symbol Graphemes: {+, -, ×, ÷, =} for mathematical operations.
- Set Theory Symbol Graphemes: {∈, ∪, ∩, ⊂} for set relations.
- Currency Symbol Graphemes, such as:
- Major Currency Graphemes: {$, €, £, ¥, ₹} for monetary units.
- Cryptocurrency Symbol Graphemes: {₿, Ξ} for digital currencys.
- Technical Symbol Graphemes, such as:
- Programming Symbol Graphemes: {<, >, &, |, ~} for code syntax.
- Trademark Symbol Graphemes: {™, ®, ©} for legal markings.
- Mathematical Symbol Graphemes, such as:
- Logographic Graphemes, such as:
- CJK Ideographic Graphemes, such as:
- Chinese Character Graphemes: {人, 水, 火, 山} for concept representation.
- Japanese Kanji Graphemes: {愛, 夢, 希望} for Japanese concept encoding.
- Korean Hanja Graphemes: {韓, 國, 語} for Korean classical writing.
- Emoji Graphemes, such as:
- Face Emoji Graphemes: {😀, 😢, 😍} for emotion expression.
- Object Emoji Graphemes: {🚗, 🏠, 📱} for object representation.
- Symbol Emoji Graphemes: {❤️, ✓, ⚡} for symbolic communication.
- CJK Ideographic Graphemes, such as:
- Diacritic Graphemes, such as:
- Alphabetic Letter Graphemes (members of a Language Alphabet), such as:
- ...
- Grapheme Categories, such as:
- Counter-Examples:
- Computer Character, which functions as a computational encoding unit rather than a linguistic symbol in a natural language writing system.
- Phoneme, which exists as a sound unit in spoken language rather than a visual unit in written language.
- Pictogram, which directly represents an object or concept through visual resemblance rather than through conventional grapheme symbol system.
- Font Glyph, which provides a specific visual rendering of a grapheme rather than being the abstract grapheme itself.
- Character Encoding, which is a computational representation scheme for graphemes rather than the grapheme unit itself.
- Gesture, which conveys meaning through physical movement rather than through written grapheme symbols.
- See: Writing System, Orthography, Typography, Unicode Standard, Character Encoding, Optical Character Recognition, Natural Language Processing, Graphemics, Phoneme, Morpheme, Script (Writing System), Glyph.
References
2020
- (Wikipedia, 2020) ⇒ https://en.wikipedia.org/wiki/grapheme Retrieved:2020-3-8.
- In linguistics, a grapheme is the smallest unit of a writing system of any given language. [1] An individual grapheme may or may not carry meaning by itself, and may or may not correspond to a single phoneme of the spoken language. In other words, a grapheme is a letter or a set of letters that represent a sound (more correctly, phoneme) in a word. [2] Graphemes include alphabetic letters, typographic ligatures, Chinese characters, numerical digits, punctuation marks, and other individual symbols. A grapheme can also be construed as a graphical sign that independently represents a portion of linguistic material. [3]
The word grapheme, coined in analogy with phoneme, is derived , and the suffix -eme by analogy with phoneme and other names of emic units. The study of graphemes is called graphemics.
The concept of graphemes is abstract and similar to the notion in computing of a character. By comparison, a specific shape that represents any particular grapheme in a specific typeface is called a glyph. For example, the grapheme corresponding to the abstract concept of "the Arabic numeral one" has a distinct glyph with identical meaning (an allograph) in each of many typefaces (such as, for example, a serif form as in Times New Roman and a sans-serif form as in Helvetica).
- In linguistics, a grapheme is the smallest unit of a writing system of any given language. [1] An individual grapheme may or may not carry meaning by itself, and may or may not correspond to a single phoneme of the spoken language. In other words, a grapheme is a letter or a set of letters that represent a sound (more correctly, phoneme) in a word. [2] Graphemes include alphabetic letters, typographic ligatures, Chinese characters, numerical digits, punctuation marks, and other individual symbols. A grapheme can also be construed as a graphical sign that independently represents a portion of linguistic material. [3]
- ↑ Coulmas, F. (1996), The Blackwell's Encyclopedia of Writing Systems, Oxford: Blackwells, p.174
- ↑ What is a grapheme?. Phonic books.
- ↑ Altmann, G., & Fengxiang, F. (Eds.). (2008). Analyses of script : properties of characters and writing systems. https://www.degruyter.com/view/product/34314