A generative auditory model embedded neural network for speech processing

Yu Wen Lo, Yih Liang Shen, Yuan Fu Liao, Tai-Shih Chi

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Before the era of the neural network (NN), features extracted from auditory models have been applied to various speech applications and been demonstrated more robust against noise than conventional speech-processing features. What's the role of auditory models in the current NN era? Are they obsolete? To answer this question, we construct a NN with a generative auditory model embedded to process speech signals. The generative auditory model consists of two stages, the stage of spectrum estimation in the logarithmic-frequency axis by the cochlea and the stage of spectral-temporal analysis in the modulation domain by the auditory cortex. The NN is evaluated in a simple speaker identification task. Experiment results show that the auditory model embedded NN is still more robust against noise, especially in low SNR conditions, than the randomly-initialized NN in speaker identification.

原文English
主出版物標題2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面5179-5183
頁數5
ISBN(列印)9781538646588
DOIs
出版狀態Published - 10 9月 2018
事件2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Calgary, 加拿大
持續時間: 15 4月 201820 4月 2018

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2018-April
ISSN(列印)1520-6149

Conference

Conference2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
國家/地區加拿大
城市Calgary
期間15/04/1820/04/18

指紋

深入研究「A generative auditory model embedded neural network for speech processing」主題。共同形成了獨特的指紋。

引用此