A new model-based mandarin-speech coding system

Chen Yu Chiang*, Jyh Her Yang, Ming Chieh Liu, Yih-Ru Wang, Yuan Fu Liao, Sin-Horng Chen

*此作品的通信作者

研究成果: Conference article同行評審

2 引文 斯高帕斯(Scopus)

摘要

In this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.

原文English
頁(從 - 到)2561-2564
頁數4
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
出版狀態Published - 27 8月 2011
事件12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, 意大利
持續時間: 27 8月 201131 8月 2011

指紋

深入研究「A new model-based mandarin-speech coding system」主題。共同形成了獨特的指紋。

引用此