摘要
Generally, the feature parameters used for speech detection are highly sensitive to the environment. The performance of speech detection is severely degraded under realistic noisy environments since the characteristics of a speech signal cannot be fully expressed by those feature parameters. As a result, this study seeks the acoustic fingerprints of speech spectrogram as a robust feature to distinguish a speech from a non-speech, especially in adverse environments, and the fact that the frequency energies of difference types of noise are concentrated on different frequency bands [12], an ABSE (Adaptive Band-partitioning Spectral Entropy)-based speech detection algorithm is proposed to detect speech signals in adverse environments. Additionally, the ABSE-based algorithm is demonstrated to work in real-time with minimal processing delay. Experimental results indicate that the ABSE parameter is very effective for several SNRs (Signal to Noise Ratios) and various noise conditions. Furthermore, the proposed ABSE-based algorithm outperforms other approaches and is reliable in a real car.
原文 | English |
---|---|
頁面 | 957-960 |
頁數 | 4 |
出版狀態 | Published - 10月 2004 |
事件 | 8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, 韓國 持續時間: 4 10月 2004 → 8 10月 2004 |
Conference
Conference | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
---|---|
國家/地區 | 韓國 |
城市 | Jeju, Jeju Island |
期間 | 4/10/04 → 8/10/04 |