On cross-dialect and -speaker adaptation of speaking rate-dependent hierarchical prosodic model for a Hakka text-to-speech system

Chen Yu Chiang, Hsiu Min Yu, Sin-Horng Chen

研究成果: Conference article同行評審

1 引文 斯高帕斯(Scopus)

摘要

This paper presents an effective adaptation of an existing speaking rate-dependent hierarchical prosodic model (SRHPM) for Mandarin to construct the SR-HPM for Hakka, another Chinese dialect. Based on the cross-dialectal linguistic similarities in terms of syntactic and prosodic structures, the adaptation is formulated as a maximum a posteriori estimation (MAP) problem with the existing Mandarin SR-HPM serving as an informative prior. In addition, benefiting from the welltrained Mandarin SR-HPM that models the effects of speaking rate (SR) on prosodic-acoustic features, the SR-HPM developed for Hakka could generate satisfactory prosody in various SRs. The performance of the approach proposed in this study was evaluated by an experiment of prosody generation for a SR-controlled Hakka text-to-speech system, in which the Hakka SR-HPM is trained by a Hakka corpus that is small in size and read in narrow SR. Results show that the generated Hakka prosody was judged to be quite natural by native Hakka speakers for SR varying from 3.3 syllables/sec to 6.7 syllables/sec.

原文English
頁(從 - 到)786-790
頁數5
期刊Proceedings of the International Conference on Speech Prosody
2016-January
DOIs
出版狀態Published - 2016
事件8th Speech Prosody 2016 - Boston, 美國
持續時間: 31 5月 20163 6月 2016

指紋

深入研究「On cross-dialect and -speaker adaptation of speaking rate-dependent hierarchical prosodic model for a Hakka text-to-speech system」主題。共同形成了獨特的指紋。

引用此