Exploring redundancy of HRTFs for fast training DNN-based HRTF personalization

Tzu Yu Chen, Po Wen Hsiao, Tai-Shih Chi

研究成果: Conference contribution同行評審

2 引文 斯高帕斯(Scopus)

摘要

A deep neural network (DNN) is constructed to predict the magnitude responses of the head-related transfer functions (HRTFs) of users for a specific direction and a specific ear. Using the CIPIC HRTF database (including 25 azimuth angles and 50 elevation angles for both ears), we trained 2500 DNNs to predict magnitude responses of all HRTFs of a user. To reduce training time, we propose to use the final weights of the trained DNN of a nearby direction as the initial weights of the current DNN under training since magnitude responses of the HRTFs are smoothly changing across nearby directions. Analysis of variance (ANOVA) was performed to show that the proposed training scheme produces equivalent magnitude responses of HRTFs as the standard training scheme with random initial weights in terms of the log-spectral distortion (LSD) measure. Meanwhile, the proposed training scheme can dramatically reduce training time by more than 95%.

原文English
主出版物標題2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1929-1933
頁數5
ISBN(電子)9789881476852
DOIs
出版狀態Published - 2 7月 2018
事件10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Honolulu, 美國
持續時間: 12 11月 201815 11月 2018

出版系列

名字2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings

Conference

Conference10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018
國家/地區美國
城市Honolulu
期間12/11/1815/11/18

指紋

深入研究「Exploring redundancy of HRTFs for fast training DNN-based HRTF personalization」主題。共同形成了獨特的指紋。

引用此