TY - JOUR
T1 - A new model-based mandarin-speech coding system
AU - Chiang, Chen Yu
AU - Yang, Jyh Her
AU - Liu, Ming Chieh
AU - Wang, Yih-Ru
AU - Liao, Yuan Fu
AU - Chen, Sin-Horng
PY - 2011/8/27
Y1 - 2011/8/27
N2 - In this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.
AB - In this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.
KW - Enriched transcriptions
KW - Hierarchical prosodic model
KW - Model-based speech coding
KW - Prosody-enriched ASR
UR - http://www.scopus.com/inward/record.url?scp=84865726869&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84865726869
SN - 2308-457X
SP - 2561
EP - 2564
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
Y2 - 27 August 2011 through 31 August 2011
ER -