A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation

Tai-Shih Chi, Ching Wen Huang, Wen Sheng Chou

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)

摘要

A frequency bin-wise nonlinear masking algorithm is proposed in the spectrogram domain for speech segregation in convolutive mixtures. The contributive weight from each speech source to a time-frequency unit of the mixture spectrogram is estimated by a nonlinear function based on location cues. For each sound source, a non-binary mask is formed from the estimated weights and is multiplied to the mixture spectrogram to extract the sound. Head-related transfer functions (HRTFs) are used to simulate convolutive sound mixtures perceived by listeners. Simulation results show our proposed method outperforms convolutive independent component analysis and degenerate unmixing and estimation technique methods in almost all test conditions.

原文English
頁(從 - 到)EL361-EL367
期刊Journal of the Acoustical Society of America
131
發行號5
DOIs
出版狀態Published - 5月 2012

指紋

深入研究「A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation」主題。共同形成了獨特的指紋。

引用此