Towards slovak-english-Mandarin speech recognition using deep learning

Matus Pleva*, Yuan Fu Liao, Wuhua Hsu, Daniel Hladek, Jan Stas, Peter Viszlay, Martin Lojka, Jozef Juhar

*此作品的通信作者

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

This paper describes the progress of the development of multilingual speech enabled interface by exploring the state-of-the-art deep learning techniques in the frame of the bilateral project named "Deep Learning for Advanced Speech Enabled Applications". The advancement is especially expected in automatic subtitling of broadcast television and radio programs, databases creation, indexing and information retrieval. This implies investigation of deep learning techniques in the following sub-tasks: A) multilingual large vocabulary continuous speech recognition, b) audio events detection, c) speaker clustering and diarization, d) spoken discourse, speech, paragraph and sentence segmentation, e) emotion recognition and f) microphone array/multi-channel speech enhancement, g) data mining, h) multilingual speech synthesis, and i) spoken dialogue user interfaces. This paper describes the current work, description of the available data in the project and achieved results in the first task of Slovak speech recognition Kaldi module using deep learning algorithms.

原文English
主出版物標題Proceedings of ELMAR 2018 - 60th International Symposium
編輯Mislav Grgic, Dijana Vitas, Branka Zovko-Cihlar, Mario Mustra
發行者Croatian Society Electronics in Marine - ELMAR
頁面151-154
頁數4
ISBN(電子)9789531842440
DOIs
出版狀態Published - 13 11月 2018
事件60th International Symposium on ELMAR, ELMAR 2018 - Zadar, 克羅地亞
持續時間: 16 9月 201819 9月 2018

出版系列

名字Proceedings Elmar - International Symposium Electronics in Marine
2018-September
ISSN(列印)1334-2630

Conference

Conference60th International Symposium on ELMAR, ELMAR 2018
國家/地區克羅地亞
城市Zadar
期間16/09/1819/09/18

指紋

深入研究「Towards slovak-english-Mandarin speech recognition using deep learning」主題。共同形成了獨特的指紋。

引用此