Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm

Syu Siang Wang, Hsin Te Hwang, Ying Hui Lai, Yu Tsao, Xugang Lu, Hsin Min Wang, Borching Su

研究成果: Conference contribution同行評審

10 引文 斯高帕斯(Scopus)

摘要

This paper investigates the use of the speech parameter generation (SPG) algorithm, which has been successfully adopted in deep neural network (DNN)-based voice conversion (VC) and speech synthesis (SS), for incorporating temporal information to improve the deep denoising auto-encoder (DDAE)-based speech enhancement. In our previous studies, we have confirmed that DDAE could effectively suppress noise components from noise corrupted speech. However, because DDAE converts speech in a frame by frame manner, the enhanced speech shows some level of discontinuity even though context features are used as input to the DDAE. To handle this issue, this study proposes using the SPG algorithm as a post-processor to transform the DDAE processed feature sequence to one with a smoothed trajectory. Two types of temporal information with SPG are investigated in this study: static-dynamic and context features. Experimental results show that the SPG with context features outperforms the SPG with static-dynamic features and the baseline system, which considers context features without SPG, in terms of standardized objective tests in different noise types and SNRs.

原文English
主出版物標題2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015
發行者Institute of Electrical and Electronics Engineers Inc.
頁面365-369
頁數5
ISBN(電子)9789881476807
DOIs
出版狀態Published - 19 2月 2016
事件2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015 - Hong Kong, Hong Kong
持續時間: 16 12月 201519 12月 2015

出版系列

名字2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015

Conference

Conference2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015
國家/地區Hong Kong
城市Hong Kong
期間16/12/1519/12/15

指紋

深入研究「Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm」主題。共同形成了獨特的指紋。

引用此