TY - JOUR
T1 - Advanced unsupervised joint prosody labeling and modeling for Mandarin speech and its application to prosody generation for TTS
AU - Chiang, Chen Yu
AU - Chen, Sin-Horng
AU - Wang, Yih-Ru
PY - 2009/9/6
Y1 - 2009/9/6
N2 - Motivated by the success of the unsupervised joint prosody labeling and modeling (UJPLM) method for Mandarin speech on modeling of syllable pitch contour in our previous study, in this paper, the advanced UJPLM (A-UJPLM) method is proposed based on UJPLM to jointly label prosodic tags and model syllable pitch contour, duration and energy level. Experimental results on the Sinica Treebank corpus showed that most prosodic tags labeled were linguistically meaningful and the model parameters estimated were interpretable and generally agreed with other previous study. In virtue of the functions given by the model parameters, an application of A-UJPLM to the prosody generation for Mandarin TTS is proposed. Experimental results showed that the proposed method performed well. Most predicted prosodic features matched well to their original counterparts. This also reconfirmed the effectiveness of the A-UJPLM method.
AB - Motivated by the success of the unsupervised joint prosody labeling and modeling (UJPLM) method for Mandarin speech on modeling of syllable pitch contour in our previous study, in this paper, the advanced UJPLM (A-UJPLM) method is proposed based on UJPLM to jointly label prosodic tags and model syllable pitch contour, duration and energy level. Experimental results on the Sinica Treebank corpus showed that most prosodic tags labeled were linguistically meaningful and the model parameters estimated were interpretable and generally agreed with other previous study. In virtue of the functions given by the model parameters, an application of A-UJPLM to the prosody generation for Mandarin TTS is proposed. Experimental results showed that the proposed method performed well. Most predicted prosodic features matched well to their original counterparts. This also reconfirmed the effectiveness of the A-UJPLM method.
KW - Prosody generation
KW - Prosody labeling
KW - Prosody modeling
KW - Text-to-speech system
UR - http://www.scopus.com/inward/record.url?scp=70450172618&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:70450172618
SN - 2308-457X
SP - 504
EP - 507
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
Y2 - 6 September 2009 through 10 September 2009
ER -