TY - GEN
T1 - Plastic multi-resolution auditory model based neural network for speech enhancement
AU - Lai, Chen Yen
AU - Lo, Yu Wen
AU - Shen, Yih Liang
AU - Chi, Tai-Shih
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - In this paper, we propose a plastic auditory model based neural network for speech enhancement. The proposed system integrates a spectro-temporal analytical auditory model with a multi-layer fully-connected network to form a quasi-CNN structure. The initial kernels of the convolutional layer are derived from the neuro-physiological auditory model. To simulate the plasticity of cortical neurons for attentional hearing, the kernels are allowed to adjust themselves pertaining to the task at hand. For the application of speech enhancement, the Fourier spectrogram instead of the auditory spectrogram is used as input to the proposed neural network such that the cleaned speech signal can be well reconstructed. The proposed system performs comparably with standard DNN and CNN systems when plenty resources are available. Meanwhile, under the limited-resource condition, the proposed system outperforms standard systems in all test settings.
AB - In this paper, we propose a plastic auditory model based neural network for speech enhancement. The proposed system integrates a spectro-temporal analytical auditory model with a multi-layer fully-connected network to form a quasi-CNN structure. The initial kernels of the convolutional layer are derived from the neuro-physiological auditory model. To simulate the plasticity of cortical neurons for attentional hearing, the kernels are allowed to adjust themselves pertaining to the task at hand. For the application of speech enhancement, the Fourier spectrogram instead of the auditory spectrogram is used as input to the proposed neural network such that the cleaned speech signal can be well reconstructed. The proposed system performs comparably with standard DNN and CNN systems when plenty resources are available. Meanwhile, under the limited-resource condition, the proposed system outperforms standard systems in all test settings.
UR - http://www.scopus.com/inward/record.url?scp=85050793845&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2017.8282096
DO - 10.1109/APSIPA.2017.8282096
M3 - Conference contribution
AN - SCOPUS:85050793845
T3 - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
SP - 605
EP - 609
BT - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
Y2 - 12 December 2017 through 15 December 2017
ER -