Wavelet speech enhancement based on robust principal component analysis

Chia Lung Wu, Hsiang Ping Hsu, Syu Siang Wang, Jeih Weih Hung, Ying Hui Lai, Hsin Min Wang, Yu Tsao

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Most state-of-the-art speech enhancement (SE) techniques prefer to enhance utterances in the frequency domain rather than in the time domain. However, the overlap-add (OLA) operation in the short-time Fourier transform (STFT) for speech signal processing possibly distorts the signal and limits the performance of the SE techniques. In this study, a novel SE method that integrates the discrete wavelet packet transform (DWPT) and a novel subspace-based method, robust principal component analysis (RPCA), is proposed to enhance noise-corrupted signals directly in the time domain. We evaluate the proposed SE method on the Mandarin hearing in noise test (MHINT) sentences. The experimental results show that the new method reduces the signal distortions dramatically, thereby improving speech quality and intelligibility significantly. In addition, the newly proposed method outperforms the STFT-RPCA-based speech enhancement system.

Original languageEnglish
Pages (from-to)439-443
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2017-August
DOIs
StatePublished - 2017
Event18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden
Duration: 20 Aug 201724 Aug 2017

Keywords

  • Discrete wavelet packet transform
  • Robust principal component analysis
  • Short-time Fourier transform
  • Speech enhancement

Fingerprint

Dive into the research topics of 'Wavelet speech enhancement based on robust principal component analysis'. Together they form a unique fingerprint.

Cite this