TDOA information based vad for robust speech recognition in directional and diffuse noise field

Kuan Lang Huang*, Tai-Shih Chi

*此作品的通信作者

研究成果: Conference contribution同行評審

2 引文 斯高帕斯(Scopus)

摘要

A two-microphone algorithm is proposed to improve automatic speech recognition (ASR) rates when target speech is corrupted by directional interferences and diffuse noise simultaneously. The algorithm adopts the time difference of arrival (TDOA) to suppress directional interferences and a TDOA-information based voice activity detector (VAD) to suppress diffuse noise. Simulation results show the proposed algorithm is effective in improving ASR rates in a sound field mixed with a directional interference and diffuse noise. Compared with the phase difference (PD) algorithm, the proposed method gives comparable recognition rates when facing a directional interference and much higher and more robust recognition rates when diffuse noise emerges.

原文English
主出版物標題2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
頁面126-130
頁數5
DOIs
出版狀態Published - 1 12月 2012
事件2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012 - Hong Kong, 中國
持續時間: 5 12月 20128 12月 2012

出版系列

名字2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012

Conference

Conference2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
國家/地區中國
城市Hong Kong
期間5/12/128/12/12

指紋

深入研究「TDOA information based vad for robust speech recognition in directional and diffuse noise field」主題。共同形成了獨特的指紋。

引用此