Segmental contribution to predicting speech intelligibility in noisy conditions

Lei Wang, Fei Chen, Ying Hui Lai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

It is necessary to identify speech segments carrying important information for speech intelligibility, particularly in noise. Earlier work based on a relative rootmean-square (RMS) level based segmentation suggested that middle-level (ranging from the overall RMS level to 10 dB below) segments contained more vowel-consonant boundaries wherein the spectral change was often most prominent, and perhaps most robust, in the presence of noise, and hence yielded improved performance of objective intelligibility modeling. Since the three levels (i.e., high-, middle- and low-levels) were defined empirically when proposed, the present work assessed how the boundaries of RMS-level based segmentation affected the performance of speech intelligibility prediction. When evaluated with speech recognition scores obtained with normal-hearing listeners and with a total of 72 noisedistorted and noise-suppressed conditions, it was shown that choosing 0 and - 10 dB to split middle-level led to maximized correlation in predicting the intelligibility of speech in noise.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 2nd International Conference on Multimedia Big Data, BigMM 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages476-480
Number of pages5
ISBN (Electronic)9781509021789
DOIs
StatePublished - 16 Aug 2016
Event2nd IEEE International Conference on Multimedia Big Data, BigMM 2016 - Taipei, Taiwan
Duration: 20 Apr 201622 Apr 2016

Publication series

NameProceedings - 2016 IEEE 2nd International Conference on Multimedia Big Data, BigMM 2016

Conference

Conference2nd IEEE International Conference on Multimedia Big Data, BigMM 2016
Country/TerritoryTaiwan
CityTaipei
Period20/04/1622/04/16

Keywords

  • Intelligibility prediction
  • Relative RMS-level based segmentation
  • Speech intelligibility

Fingerprint

Dive into the research topics of 'Segmental contribution to predicting speech intelligibility in noisy conditions'. Together they form a unique fingerprint.

Cite this