2012 FourTypesofContextforAutomaticS
- (Flor, 2012) ⇒ Michael Flor. (2012). “Four Types of Context for Automatic Spelling Correction.” In: Traitement Automatique des Langues (TAL), 53(3).
Subject Headings: Spelling Error Correction System, Language Model, N-Gram.
Notes
- Article versions and URLs:
Cited By
- Google Scholar: ~ 24 Citations.
- Semantic Scholar: ~ 17 Citations
2020
- (Melli et al., 2020) ⇒ Gabor Melli, Abdelrhman Eldallal, Bassim Lazem, and Olga Moreira. (2020). “GM-RKB WikiText Error Correction Task and Baselines.”. In: Proceedings of LREC 2020 (LREC-2020).
Quotes
Author Keywords
Abstract
This paper presents an investigation on using four types of contextual information for improving the accuracy of automatic correction of single-token non-word misspellings. The task is framed as contextually-informed re-ranking of correction candidates. Immediate local context is captured by word n-grams statistics from a Web-scale language model. The second approach measures how well a candidate correction fits in the semantic fabric of the local lexical neighborhood, using a very large Distributional Semantic Model. In the third approach, recognizing a misspelling as an instance of a recurring word can be useful for reranking. The fourth approach looks at context beyond the text itself. If the approximate topic can be known in advance, spelling correction can be biased towards the topic. Effectiveness of proposed methods is demonstrated with an annotated corpus of 3,000 student essays from international high-stakes English language assessments. The paper also describes an implemented system that achieves high accuracy on this task.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2012 FourTypesofContextforAutomaticS | Michael Flor | Four Types of Context for Automatic Spelling Correction. |