A Hybrid Neural Network Based on the Duplex Model of Pitch Perception for Singing Melody Extraction

Hsin Chou, Ming Tso Chen, Tai-Shih Chi

研究成果: Conference contribution同行評審

19 引文 斯高帕斯(Scopus)

摘要

In this paper, we build up a hybrid neural network (NN) for singing melody extraction from polyphonic music by imitating human pitch perception. For human hearing, there are two pitch perception models, the spectral model and the temporal model, in accordance with whether harmonics are resolved or not. Here, we first use NNs to implement individual models and evaluate their performance in the task of singing melody extraction. Then, we combine the NNs to constitute the composite NN to simulate the duplex model, which complements the pitch perception from unresolved harmonics of the spectral model using the temporal model. Simulation results show the proposed composite NN outperforms other conventional methods in singing melody extraction.

原文English
主出版物標題2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面381-385
頁數5
ISBN(列印)9781538646588
DOIs
出版狀態Published - 10 9月 2018
事件2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Calgary, 加拿大
持續時間: 15 4月 201820 4月 2018

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2018-April
ISSN(列印)1520-6149

Conference

Conference2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
國家/地區加拿大
城市Calgary
期間15/04/1820/04/18

指紋

深入研究「A Hybrid Neural Network Based on the Duplex Model of Pitch Perception for Singing Melody Extraction」主題。共同形成了獨特的指紋。

引用此