An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids

Yu Ching Chung, Ji Yan Han, Bo Sin Wang, Wei Zhong Zheng, Kung Yao Shen, Ying Hui Lai*

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

Previous research has shown that auditory and visual inputs are not asynchronous in the human brain, and that visual cues can enhance attention in the hearing process. Therefore, this study proposes audio-visual speech enhancement (SE) with 3D image features (AV-3D-SE) that imitates the auditory process of humans to elevate listening quality. More specifically, AV-3D-SE uses the FlowNet3D model to predict temporal facial motion from the recorded 3D image combining with features for SE applications. The evaluation results showed that the average scores of perceptual evaluation of speech quality and short-time objective intelligibility in 3 dB signal-to-noise ratio increased to 3.229 and 0.914, respectively, while the average hearing aid speech quality index significantly outperformed baseline SE systems (audio-only and audio-visual-2D) in seven typical types of hearing loss with high hearing aid speech perception index. In conclusion, the proposed AV-3D-SE enhances the effectiveness of the SE system and can increase the listening satisfaction of hearing aid users.

原文English
主出版物標題2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1131-1137
頁數7
ISBN(電子)9798350300673
DOIs
出版狀態Published - 2023
事件2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, 台灣
持續時間: 31 10月 20233 11月 2023

出版系列

名字2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

Conference

Conference2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
國家/地區台灣
城市Taipei
期間31/10/233/11/23

指紋

深入研究「An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids」主題。共同形成了獨特的指紋。

引用此