TY - GEN
T1 - A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise
AU - Wang, Syu Siang
AU - Tsao, Yu
AU - Wang, Hsiao Lan Sharon
AU - Lai, Ying Hui
AU - Li, Lieber Po Hung
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - This paper presents the clinical results of the application of a deep-learning-based noise reduction (NR) approach to improve speech intelligibility for cochlear implant (CI) recipients in the presence of competing speech noise. The deep denoising autoencoder (DDAE) model was used as a representative deep-learning-based NR model to reduce the noise components from the noisy input. The enhanced speech was subsequently played to six Mandarin-speaking CI recipients to perform recognition tests. All the subjects used their own clinical speech processors during testing. Two traditional NR approaches were also implemented to test the performance for a comparison. The Taiwan Mandarin version of the hearing in noise test (TMHINT) sentences were adopted and further corrupted by competing two talker speech noise at signal-to-noise ratio (SNR) levels of 0 and 5 dB. The experimental results showed that the DDAE NR approach can yield higher intelligibility scores than the two classical NR techniques in the presence of competing speech. The results of qualitative analysis further showed that the DDAE NR approach notably reduced the envelope distortions. The good results also suggest that the proposed DDAE NR approach can combine well with the existing CI processors to overcome the issue of degradation of speech perception, which is caused by competing speech noise.
AB - This paper presents the clinical results of the application of a deep-learning-based noise reduction (NR) approach to improve speech intelligibility for cochlear implant (CI) recipients in the presence of competing speech noise. The deep denoising autoencoder (DDAE) model was used as a representative deep-learning-based NR model to reduce the noise components from the noisy input. The enhanced speech was subsequently played to six Mandarin-speaking CI recipients to perform recognition tests. All the subjects used their own clinical speech processors during testing. Two traditional NR approaches were also implemented to test the performance for a comparison. The Taiwan Mandarin version of the hearing in noise test (TMHINT) sentences were adopted and further corrupted by competing two talker speech noise at signal-to-noise ratio (SNR) levels of 0 and 5 dB. The experimental results showed that the DDAE NR approach can yield higher intelligibility scores than the two classical NR techniques in the presence of competing speech. The results of qualitative analysis further showed that the DDAE NR approach notably reduced the envelope distortions. The good results also suggest that the proposed DDAE NR approach can combine well with the existing CI processors to overcome the issue of degradation of speech perception, which is caused by competing speech noise.
UR - http://www.scopus.com/inward/record.url?scp=85047937784&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2017.8282144
DO - 10.1109/APSIPA.2017.8282144
M3 - Conference contribution
AN - SCOPUS:85047937784
T3 - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
SP - 808
EP - 812
BT - Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
Y2 - 12 December 2017 through 15 December 2017
ER -