Out-Of-Vocabulary (OOV) Subword
(Redirected from Out-Of-Vocabulary (OOV) Subtoken)
Jump to navigation
Jump to search
A Out-Of-Vocabulary (OOV) Subword is a Subword Unit that does not appear in training subword vocabulary.
- AKA:Out-Of-Vocabulary (OOV) Subtoken.
- Context:
- It can be model by Out-Of-Vocabulary (OOV) Subword Modeling Task.
- Example(s):
- a subword that has not been seen during training.
- a subtoken that has not been seen during training.
- an Out-Of-Vocabulary (OOV) Symbol.
- a Out-Of-Vocabulary (OOV) Character.
- Example(s):
- See: Word Segmentation System, Subword Embedding, Vocabulary Word, Lexicon Word, Abbreviated Word, Unsupervised Transliteration Model, Lexical Normalization Task.
References
2017a
- (Goldberg, 2017) ⇒ Yoav Goldberg. (2017). “Neural Network Methods for Natural Language Processing.” In: Synthesis Lectures on Human Language Technologies, 10(1). doi:10.2200/S00762ED1V01Y201703HLT037
2017b
- (Ruder, 2017) ⇒ Sebastian Ruder (2017). "Word embeddings in 2017: Trends and future directions".
2017c
- (Dhingra et al., 2017) ⇒ Bhuwan Dhingra, Hanxiao Liu, Ruslan Salakhutdinov, and William W. Cohen. (2017). “A Comparative Study of Word Embeddings for Reading Comprehension.” In: arXiv, abs/1703.00993.
2017d
- (Herbelot & Baroni, 2017) ⇒ Aurelie Herbelot, and Marco Baroni (2017). "High-risk learning: acquiring new word vectors from tiny data". In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017).
2017e
- (Pinter et al., 2017) ⇒ Yuval Pinter, Robert Guthrie, and Jacob Eisenstein. (2017). “Mimicking Word Embeddings Using Subword RNNs.” In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017).
2017f
- (See et al., 2017) ⇒ Abigail See, Peter J. Liu, and Christopher D. Manning. (2017). “Get To The Point: Summarization with Pointer-Generator Networks.” In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). DOI:10.18653/v1/P17-1099.
2014
- (Durrani et al., 2014) ⇒ Nadir Durrani, Hassan Sajjad, Hieu Hoang, and Philipp Koehn. (2014). “Integrating An Unsupervised Transliteration Model Into Statistical Machine Translation.” In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014).
2000
- (Bazzi & Glass, 2000) ⇒ Issam Bazzi, and James R. Glass. (2000). “Modeling Out-Of-Vocabulary Words For Robust Speech Recognition.” In: Sixth International Conference on Spoken Language Processing (ICSLP 2000) / (INTERSPEECH 2000).