A High-Performance Min-Nan/Taiwanese TTS System

Wei Chih Kuo*, Xiang Rui Zhong, Yih-Ru Wang, Sin-Horng Chen

*此作品的通信作者

研究成果: Conference article同行評審

5 引文 斯高帕斯(Scopus)

摘要

In this paper, the implementation of a high-performance Min-Nan/Taiwanese TTS system is presented. The system can convert both Min-Nan/Taiwanese texts, represented in a hybrid Han-Lo written form, and Chinese texts into natural Taiwanese speeches. It is an improved version of the system developed previously, Improvements include: the add of a "Chinese-to-Min-Nan/Taiwanese" lexicon to solve the OOV problem and to increase the ability of processing Chinese text; the use of explicit tone sandhi rules to ease the learning of prosody generation; a further processing of the training database to detect all breaks not associated with PMs; and the use of four RNNs to separately generate four types of prosodic parameters. The system is implemented by software and runs in real-time on PC. An informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded natural for well-tokenized Min-Nan/Taiwanese texts and for automatic tokenized Chinese texts.

原文English
文章編號1198830
頁(從 - 到)512-515
頁數4
期刊ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1
DOIs
出版狀態Published - 6 4月 2003
事件2003 IEEE International Conference on Accoustics, Speech, and Signal Processing - Hong Kong, Hong Kong
持續時間: 6 4月 200310 4月 2003

指紋

深入研究「A High-Performance Min-Nan/Taiwanese TTS System」主題。共同形成了獨特的指紋。

引用此