TY - JOUR
T1 - A High-Performance Min-Nan/Taiwanese TTS System
AU - Kuo, Wei Chih
AU - Zhong, Xiang Rui
AU - Wang, Yih-Ru
AU - Chen, Sin-Horng
PY - 2003/4/6
Y1 - 2003/4/6
N2 - In this paper, the implementation of a high-performance Min-Nan/Taiwanese TTS system is presented. The system can convert both Min-Nan/Taiwanese texts, represented in a hybrid Han-Lo written form, and Chinese texts into natural Taiwanese speeches. It is an improved version of the system developed previously, Improvements include: the add of a "Chinese-to-Min-Nan/Taiwanese" lexicon to solve the OOV problem and to increase the ability of processing Chinese text; the use of explicit tone sandhi rules to ease the learning of prosody generation; a further processing of the training database to detect all breaks not associated with PMs; and the use of four RNNs to separately generate four types of prosodic parameters. The system is implemented by software and runs in real-time on PC. An informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded natural for well-tokenized Min-Nan/Taiwanese texts and for automatic tokenized Chinese texts.
AB - In this paper, the implementation of a high-performance Min-Nan/Taiwanese TTS system is presented. The system can convert both Min-Nan/Taiwanese texts, represented in a hybrid Han-Lo written form, and Chinese texts into natural Taiwanese speeches. It is an improved version of the system developed previously, Improvements include: the add of a "Chinese-to-Min-Nan/Taiwanese" lexicon to solve the OOV problem and to increase the ability of processing Chinese text; the use of explicit tone sandhi rules to ease the learning of prosody generation; a further processing of the training database to detect all breaks not associated with PMs; and the use of four RNNs to separately generate four types of prosodic parameters. The system is implemented by software and runs in real-time on PC. An informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded natural for well-tokenized Min-Nan/Taiwanese texts and for automatic tokenized Chinese texts.
UR - http://www.scopus.com/inward/record.url?scp=0141479969&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2003.1198830
DO - 10.1109/ICASSP.2003.1198830
M3 - Conference article
AN - SCOPUS:0141479969
SN - 1520-6149
VL - 1
SP - 512
EP - 515
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
M1 - 1198830
T2 - 2003 IEEE International Conference on Accoustics, Speech, and Signal Processing
Y2 - 6 April 2003 through 10 April 2003
ER -