The Speech Labeling and Modeling Toolkit (SLMTK) Version 1.0

Chen Yu Chiang, Wu Hao Li, Yen Ting Lin, Jia Jyu Su, Wei Cheng Chen, Cheng Che Kao, Shu Lei Lin, Pin Han Lin, Shao Wei Hong, Guan Ting Liou, Wen Yang Chang, Jen Chieh Chiang, Yen Ting Lin, Yih-Ru Wang, Sin Horng Chen

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

This paper introduces the Speech Labeling and Modeling Toolkit version 1.0 (SLMTK 1.0), which facilitates automatic labeling of text and speech for constructing text-To-speech (TTS) systems and speech analysis. The SLMTK 1.0 supports mixed Mandarin-English speech and the associated texts. The following seven steps then process the inputs: 1) text analysis, 2) acoustic feature extraction, 3) linguistic-speech alignment, 4) integration of syllable-based linguistic and prosodic-Acoustic features, 5) prosody labeling, 6) construction of prosody generation model, and 7) construction of acoustic models for speech synthesis. The outputs of the seven steps are, respectively, 1) linguistic labels, 2) acoustic features, 3) linguistic-speech alignment, 4) syllable-based linguistic and prosodic-Acoustic features, 5) prosody tags, 6) prosody generation model, and 7) acoustic models for speech synthesis. The SLMTK 1.0 has been applied to constructing personalized TTS systems for augmentative and alternative communication. In addition, the toolkit has also been applied to phonetic and prosodic labeling of L2 Mandarin speech to facilitate prosody analysis studies. The SLMTK 1.0 is available at https://slmtk.ce.ntpu.edu.tw for non-commercial use and welcomes all parties to enrich the functions of the SLMTK.

原文English
主出版物標題2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350398564
DOIs
出版狀態Published - 2022
事件25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Hanoi, Viet Nam
持續時間: 24 11月 202226 11月 2022

出版系列

名字2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings

Conference

Conference25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022
國家/地區Viet Nam
城市Hanoi
期間24/11/2226/11/22

指紋

深入研究「The Speech Labeling and Modeling Toolkit (SLMTK) Version 1.0」主題。共同形成了獨特的指紋。

引用此