Spectro-temporal smoothed auditory spectra for robust speaker identification

Ting H. Lin*, Chung Chien Hsu, Tai-Shih Chi

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

The performance of conventional speaker identification systems is severely compromised by interference, such as additive or convolutional noises. High-level information of the speaker provides more robust cues for identifying speakers. This paper proposes an auditory-model based spectro-temporal modulation filtering (STMF) process to capture high-level information for robust speaker identification. Text-independent closed-set speaker identification simulations are conducted on TIMIT and GRID corpora to evaluate the robustness of Auditory Cepstral Coefficients (ACCs) after the STMF process. Simulation results show ACCs' substantial improvement over conventional MFCCs in all SNR conditions. The superior noise-suppression performance of STMF to newly developed Auditory-based Nonnegative Tensor Cepstral Coefficients (ANTCCs) is also demonstrated in low SNR conditions.

原文English
主出版物標題2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings
頁面313-317
頁數5
DOIs
出版狀態Published - 2010
事件2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Tainan, 台灣
持續時間: 29 11月 20103 12月 2010

出版系列

名字2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

Conference

Conference2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010
國家/地區台灣
城市Tainan
期間29/11/103/12/10

指紋

深入研究「Spectro-temporal smoothed auditory spectra for robust speaker identification」主題。共同形成了獨特的指紋。

引用此