Taiwanese Across Taiwan Corpus and Its Applications

Yuan Fu Liao, Jane S. Tsay, Peter Kang, Hui Lu Khoo, Le Kun Tan, Li Chen Chang, Un Gian Iunn, Huang Lan Su, Tsun Guan Thiann, Hak Khiam Tiun, Su Lian Liao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Peh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.

Original languageEnglish
Title of host publication2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350398564
DOIs
StatePublished - 2022
Event25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Hanoi, Viet Nam
Duration: 24 Nov 202226 Nov 2022

Publication series

Name2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings

Conference

Conference25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022
Country/TerritoryViet Nam
CityHanoi
Period24/11/2226/11/22

Keywords

  • automatic speech recognition
  • speech synthesis
  • Taiwanese speech corpus
  • voice conversion

Fingerprint

Dive into the research topics of 'Taiwanese Across Taiwan Corpus and Its Applications'. Together they form a unique fingerprint.

Cite this