Out-Of-Vocabulary (OOV) Subword

From GM-RKB

(Redirected from Out-Of-Vocabulary (OOV) Subtoken)

Jump to navigation Jump to search

A Out-Of-Vocabulary (OOV) Subword is a Subword Unit that does not appear in training subword vocabulary.

AKA:Out-Of-Vocabulary (OOV) Subtoken.
Context:
- It can be model by Out-Of-Vocabulary (OOV) Subword Modeling Task.
Example(s):
- a subword that has not been seen during training.
- a subtoken that has not been seen during training.
- an Out-Of-Vocabulary (OOV) Symbol.
- a Out-Of-Vocabulary (OOV) Character.
Example(s):
- an Out-Of-Vocabulary (OOV) Word.
- an In-Vocabulary Word.
See: Word Segmentation System, Subword Embedding, Vocabulary Word, Lexicon Word, Abbreviated Word, Unsupervised Transliteration Model, Lexical Normalization Task.

References

2017a

(Goldberg, 2017) ⇒ Yoav Goldberg. (2017). “Neural Network Methods for Natural Language Processing.” In: Synthesis Lectures on Human Language Technologies, 10(1). doi:10.2200/S00762ED1V01Y201703HLT037

2017b

(Ruder, 2017) ⇒ Sebastian Ruder (2017). "Word embeddings in 2017: Trends and future directions".

2017c

(Dhingra et al., 2017) ⇒ Bhuwan Dhingra, Hanxiao Liu, Ruslan Salakhutdinov, and William W. Cohen. (2017). “A Comparative Study of Word Embeddings for Reading Comprehension.” In: arXiv, abs/1703.00993.

2017d

(Herbelot & Baroni, 2017) ⇒ Aurelie Herbelot, and Marco Baroni (2017). "High-risk learning: acquiring new word vectors from tiny data". In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017).

2017e

(Pinter et al., 2017) ⇒ Yuval Pinter, Robert Guthrie, and Jacob Eisenstein. (2017). “Mimicking Word Embeddings Using Subword RNNs.” In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017).

2017f

(See et al., 2017) ⇒ Abigail See, Peter J. Liu, and Christopher D. Manning. (2017). “Get To The Point: Summarization with Pointer-Generator Networks.” In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). DOI:10.18653/v1/P17-1099.

2014

(Durrani et al., 2014) ⇒ Nadir Durrani, Hassan Sajjad, Hieu Hoang, and Philipp Koehn. (2014). “Integrating An Unsupervised Transliteration Model Into Statistical Machine Translation.” In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014).

2000

(Bazzi & Glass, 2000) ⇒ Issam Bazzi, and James R. Glass. (2000). “Modeling Out-Of-Vocabulary Words For Robust Speech Recognition.” In: Sixth International Conference on Spoken Language Processing (ICSLP 2000) / (INTERSPEECH 2000).

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=Out-Of-Vocabulary_(OOV)_Subword&oldid=885842"

Facts

... more about "Out-Of-Vocabulary (OOV) Subword"

2014 +