Gated module neural network for multilingual speech recognition

Yuan Fu Liao, Matus Pleva, Daniel Hladek, Jan Stas, Peter Viszlay, Martin Lojka, Jozef Juhar

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

For most multilingual large vocabulary continuous speech recognition (LVCSR) systems, when multiple languages are allowed at the same time, their performance will degrade significantly due to the strong inter-language competition in the decoding phase. To increase the inter-language discrimination capacity, this paper presents a gated module neural network (GMN) approach that adapts a language identification (LID) component to directly assist the final multilingual LVCSR goal. Thanks to an international collaboration 3 large-scale speech corpora (Mandarin, English and Slovak, denoted as Zh, En and Sk) were shared for studying this problem. Hence the proposed approach was evaluated on both bilingual (Zh/En and Sk/En) and trilingual (Zh/En/Sk) LVCSR tasks. The experimental results show that the proposed GMN is promising and the performance of multilingual LVCSRs are now more comparable with the monolingual ones.

原文English
主出版物標題2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面131-135
頁數5
ISBN(電子)9781538656273
DOIs
出版狀態Published - 2 7月 2018
事件11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Taipei, 台灣
持續時間: 26 11月 201829 11月 2018

出版系列

名字2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

Conference

Conference11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018
國家/地區台灣
城市Taipei
期間26/11/1829/11/18

指紋

深入研究「Gated module neural network for multilingual speech recognition」主題。共同形成了獨特的指紋。

引用此