Spectro-temporal modulation based singing detection combined with pitchbased grouping for singing voice separation

Tse En Lin, Chung Chien Hsu, Yi Cheng Chen, Jian Hueng Chen, Tai-Shih Chi

研究成果: Conference article同行評審

1 引文 斯高帕斯(Scopus)

摘要

A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectrotemporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing-voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task.

指紋

深入研究「Spectro-temporal modulation based singing detection combined with pitchbased grouping for singing voice separation」主題。共同形成了獨特的指紋。

引用此