Layered nonnegative matrix factorization for speech separation

Chung Chien Hsu, Jen-Tzung Chien, Tai-Shih Chi

研究成果: Conference article同行評審

8 引文 斯高帕斯(Scopus)

摘要

This paper proposes a layered nonnegative matrix factorization (L-NMF) algorithm for speech separation. The standard NMF method extracts parts-based bases out of nonnegative training data and is often used to separate mixed spectrograms. The proposed L-NMF algorithm comprises of several layers of standard NMF blocks. During training, each layer of the L-NMF is initialized separately and then fine-tuned by minimizing the propagated reconstruction error. More complicated bases of the training data are emerged in deeper layers of the L-NMF by progressively combining parts-based bases extracted in the first layer. In other words, these complicated bases contain collective information of the parts-based bases. The bases deciphered by all layers are then used to separate spectrograms in the conventional NMF way. Simulation results show the proposed LNMF outperforms the standard NMF in terms of the source-todistortion ratio (SDR).

原文English
頁(從 - 到)628-632
頁數5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2015-January
DOIs
出版狀態Published - 9月 2015
事件16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, 德國
持續時間: 6 9月 201510 9月 2015

指紋

深入研究「Layered nonnegative matrix factorization for speech separation」主題。共同形成了獨特的指紋。

引用此