Harmonic-aware tri-path convolution recurrent network for singing voice separation

Yih Liang Shen, Ya Ching Lai, Tai Shih Chi*

*此作品的通信作者

研究成果: Article同行評審

摘要

Temporal coherence and spectral regularity are critical cues for human auditory streaming processes and are considered in many sound separation models. Some examples include the Conv-tasnet model, which focuses on temporal coherence using short length kernels to analyze sound, and the dual-path convolution recurrent network (DPCRN) model, which uses two recurring neural networks to analyze general patterns along the temporal and spectral dimensions on a spectrogram. By expanding DPCRN, a harmonic-aware tri-path convolution recurrent network model via the addition of an inter-band RNN is proposed. Evaluation results on public datasets show that this addition can further boost the separation performances of DPCRN.

原文English
文章編號074801
期刊JASA Express Letters
3
發行號7
DOIs
出版狀態Published - 1 7月 2023

指紋

深入研究「Harmonic-aware tri-path convolution recurrent network for singing voice separation」主題。共同形成了獨特的指紋。

引用此