Dysarthric Speech Enhancement Based on Convolution Neural Network

Syu Siang Wang, Yu Tsao, Wei Zhong Zheng, Hsiu Wei Yeh, Pei Chun Li, Shih Hau Fang, Ying Hui Lai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Generally, those patients with dysarthria utter a distorted sound and the restrained intelligibility of a speech for both human and machine. To enhance the intelligibility of dysarthric speech, we applied a deep learning-based speech enhancement (SE) system in this task. Conventional SE approaches are used for shrinking noise components from the noise-corrupted input, and thus improve the sound quality and intelligibility simultaneously. In this study, we are focusing on reconstructing the severely distorted signal from the dysarthric speech for improving intelligibility. The proposed SE system prepares a convolutional neural network (CNN) model in the training phase, which is then used to process the dysarthric speech in the testing phase. During training, paired dysarthric-normal speech utterances are required. We adopt a dynamic time warping technique to align the dysarthric-normal utter-ances. The gained training data are used to train a CNN - based SE model. The proposed SE system is evaluated on the Google automatic speech recognition (ASR) system and a subjective listening test. The results showed that the proposed method could notably enhance the recognition performance for more than 10% in each of ASR and human recognitions from the unprocessed dysarthric speech.

Original languageEnglish
Title of host publication44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages60-64
Number of pages5
ISBN (Electronic)9781728127828
DOIs
StatePublished - 2022
Event44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 - Glasgow, United Kingdom
Duration: 11 Jul 202215 Jul 2022

Publication series

NameProceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume2022-July
ISSN (Print)1557-170X

Conference

Conference44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
Country/TerritoryUnited Kingdom
CityGlasgow
Period11/07/2215/07/22

Fingerprint

Dive into the research topics of 'Dysarthric Speech Enhancement Based on Convolution Neural Network'. Together they form a unique fingerprint.

Cite this