Development of a large-scale Mandarin Radio Speech Corpus

Yung Hsiang Shawn Chang, Yuan Fu Liao, Sheng Ming Wang, Jenq Haur Wang, Sing Yue Wang, Jhih Wei Chen, You Dian Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

The Taiwan Mandarin Radio Speech Corpus consists of roughly 300 (and growing) hours of audio recordings, selected from Taiwan's National Education Radio (NER) archive. The corpus includes speech from hundreds of speakers and various speech styles (spontaneous conversational and read news). This corpus provides a rich resource for research in speech and automatic speech recognition (ASR). In this paper, we briefly introduce the corpus development approach and report two preliminary experimental results using this corpus.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages359-360
Number of pages2
ISBN (Electronic)9781509040179
DOIs
StatePublished - 25 Jul 2017
Event4th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017 - Taipei, United States
Duration: 12 Jun 201714 Jun 2017

Publication series

Name2017 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017

Conference

Conference4th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017
Country/TerritoryUnited States
CityTaipei
Period12/06/1714/06/17

Fingerprint

Dive into the research topics of 'Development of a large-scale Mandarin Radio Speech Corpus'. Together they form a unique fingerprint.

Cite this