FORMOSa speech recognition challenge 2018: Data, plan and baselines

Yuan Fu Liao, Wu Hua Hsu, Yu Chen Lin, Yung Hsiang Shawn Chang, Matus Pleva, Jozef Juhar, Guang Feng Deng

研究成果: Conference contribution同行評審

4 引文 斯高帕斯(Scopus)

摘要

This paper introduces the Formosa speech recognition (FSR) challenge 2018, presents the provided data profile, evaluation plan and reports the experimental results of the baseline systems. This challenge focuses on spontaneous Taiwanese Mandarin speech recognition (TMSR) and it is based on a real-life, multi-gene broadcast radio speech corpus, NER-Trs-Vol1, selected from the Formosa speech in the wild (FSW) project. To assist participants to establish a good starting system, a set of baseline systems were published based on various deep neural network (DNN) models. NER-Trs-Vol1 is free for participants (noncommercial license), and its corresponding Kaldi recipes for the baselines have been published online. Experimental results show that the combination of NER-Trs-Vol1 and Kaldi recipes is a good resource pack for spontaneous TMSR research and could be used to initialize an advanced semi-supervised training procedure to further improve the recognition performance.

原文English
主出版物標題2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面270-274
頁數5
ISBN(電子)9781538656273
DOIs
出版狀態Published - 2 7月 2018
事件11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Taipei, 台灣
持續時間: 26 11月 201829 11月 2018

出版系列

名字2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

Conference

Conference11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018
國家/地區台灣
城市Taipei
期間26/11/1829/11/18

指紋

深入研究「FORMOSa speech recognition challenge 2018: Data, plan and baselines」主題。共同形成了獨特的指紋。

引用此