Taiwanese Hakka Across Taiwan Corpus and Formosa Speech Recognition Challenge 2023 - Hakka ASR

Yuan Fu Liao*, Shaw Hwa Hwang, You Shuo Chen, Han Chun Lai, Yao Hsing Chung, Li Te Shen, Yen Chun Huang, Chi Jung Huang, Hsu Wen Han, Li Wei Chen, Pei Chung Su, Chao Shih Huang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

To revive the endangered Taiwanese Hakka language, the first large-scale Taiwanese Hakka speech corpus across Taiwan (HAT) was developed, representing modern Taiwanese Hakka around Taiwan. This paper briefly reviews the first part (Sixian and Hailu accents) of the HAT corpus and demonstrates some of its potential applications, including automatic speech recognition (ASR), text-to-speech (TTS), and Taiwanese Hakka speech-enabled ChatGPT. Moreover, the Formosa Speech Recognition Challenge 2023 (FSR-2023) - Hakka ASR was established to promote the corpus and evaluate the performance of state-of-the-art Taiwanese Hakka ASR systems.

Original languageEnglish
Title of host publicationProceedings of 2023 26th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350344028
DOIs
StatePublished - 2023
Event26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023 - Delhi, India
Duration: 4 Dec 20236 Dec 2023

Publication series

NameProceedings of 2023 26th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023

Conference

Conference26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023
Country/TerritoryIndia
CityDelhi
Period4/12/236/12/23

Keywords

  • Sixian and Hailu accents
  • Taiwanese Hakka speech corpus
  • Taiwanese Hakka speech-enabled ChatGPT
  • automatic speech recognition
  • speech synthesis

Fingerprint

Dive into the research topics of 'Taiwanese Hakka Across Taiwan Corpus and Formosa Speech Recognition Challenge 2023 - Hakka ASR'. Together they form a unique fingerprint.

Cite this