Japanese Speech-to-Text Transcription Task
Jump to navigation
Jump to search
A Japanese Speech-to-Text Transcription Task is a speech recognition task that is a Japanese NLP task.
References
2003
- (Kawahara et al., 2003) ⇒ Tatsuya Kawahara, Hiroaki Nanjo, Takahiro Shinozaki, and Sadaoki Furui. (2003). “Benchmark Test for Speech Recognition Using the Corpus of Spontaneous Japanese.” In: ISCA \& IEEE Workshop on Spontaneous Speech Processing and Recognition.
- QUOTE: We present benchmark results of automatic speech recognition using the Corpus of Spontaneous Japanese (CSJ), which has been developed in the five-year national project and will be the largest spontaneous speech databases. New test-sets are designed for both academic presentation speech and extemporaneous public speech, which are the two major categories in the corpus. The test-sets are selected to cover the variation of acoustic and linguistic factors in spontaneous speech: word perplexity, degree of disfluency, and the speaking rate. Baseline acoustic and language models are set up using an almost complete set (500 hours and 6.67M words) of the CSJ.