FORMOSa speech recognition challenge 2018: Data, plan and baselines

Yuan Fu Liao, Wu Hua Hsu, Yu Chen Lin, Yung Hsiang Shawn Chang, Matus Pleva, Jozef Juhar, Guang Feng Deng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

This paper introduces the Formosa speech recognition (FSR) challenge 2018, presents the provided data profile, evaluation plan and reports the experimental results of the baseline systems. This challenge focuses on spontaneous Taiwanese Mandarin speech recognition (TMSR) and it is based on a real-life, multi-gene broadcast radio speech corpus, NER-Trs-Vol1, selected from the Formosa speech in the wild (FSW) project. To assist participants to establish a good starting system, a set of baseline systems were published based on various deep neural network (DNN) models. NER-Trs-Vol1 is free for participants (noncommercial license), and its corresponding Kaldi recipes for the baselines have been published online. Experimental results show that the combination of NER-Trs-Vol1 and Kaldi recipes is a good resource pack for spontaneous TMSR research and could be used to initialize an advanced semi-supervised training procedure to further improve the recognition performance.

Original languageEnglish
Title of host publication2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages270-274
Number of pages5
ISBN (Electronic)9781538656273
DOIs
StatePublished - 2 Jul 2018
Event11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Taipei, Taiwan
Duration: 26 Nov 201829 Nov 2018

Publication series

Name2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

Conference

Conference11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018
Country/TerritoryTaiwan
CityTaipei
Period26/11/1829/11/18

Keywords

  • Kaldi speech toolkits
  • Neural networks
  • Semi-supervised training
  • Spontaneous speech recognition

Fingerprint

Dive into the research topics of 'FORMOSa speech recognition challenge 2018: Data, plan and baselines'. Together they form a unique fingerprint.

Cite this