A new approach of using temporal information in mandarin speech recognition

Jyh Her Yang, Yuan Fu Liao, Yih-Ru Wang, Sin-Horng Chen

研究成果: Conference contribution同行評審

摘要

In this paper, a new approach of using temporal information to assist in Mandarin speech recognition is discussed. It incorporates two types of temporal information into the recognition search. One is a statistical syllable duration model which considers the influences of 411 basesyllables, 5 tones, 4 position-in-word factors, and 3 positionin- sentence factors on syllable duration. Another is the timing information of modeling three types of inter-syllable boundary including intra-word, inter-word without punctuation mark (PM), and inter-word with PM. The uses of these two types of temporal information are expected to be useful for improving the segmentation accuracies in both acoustic decoding and linguistic decoding. Experimental results showed that the base-syllable/character/word recognition rates were slightly improved for both MATBN and Treebank datbase.

原文English
主出版物標題3rd International Conference on Speech Prosody 2006
編輯R. Hoffmann, H. Mixdorff
發行者International Speech Communications Association
ISBN(電子)9780000000002
出版狀態Published - 2006
事件3rd International Conference on Speech Prosody, SP 2006 - Dresden, Germany
持續時間: 2 五月 20065 五月 2006

出版系列

名字Proceedings of the International Conference on Speech Prosody
ISSN(列印)2333-2042

Conference

Conference3rd International Conference on Speech Prosody, SP 2006
國家/地區Germany
城市Dresden
期間2/05/065/05/06

指紋

深入研究「A new approach of using temporal information in mandarin speech recognition」主題。共同形成了獨特的指紋。

引用此