摘要
The two-dimensional spectro-temporal modulation filtering concept of the auditory model T. Chi, P. Ru, and S. A. Shamma, J. Acoust. Soc. Am. 118(2), 887-906 (2005) is implemented on the Fourier spectrogram. The Fourier magnitude spectrogram is analyzed in terms of its joint spectro-temporal modulations, which embed the temporal dynamics and spectral structures. Instead of iterative projection methods, the overlap-and-add method is adopted to invert modified Fourier spectrograms back to sounds. The proposed framework not only provides a similar spectro-temporal analytical process for sounds as the auditory model but also produces synthesized sounds with better quality in a timely manner, which makes proposed framework feasible to human speech recognition (HSR) applications as well.
原文 | English |
---|---|
頁(從 - 到) | EL190-EL196 |
期刊 | Journal of the Acoustical Society of America |
卷 | 129 |
發行號 | 5 |
DOIs | |
出版狀態 | Published - 5月 2011 |