TY - JOUR
T1 - Binary mask estimation based on frequency modulations
AU - Hsu, Chung Chien
AU - Chien, Jen-Tzung
AU - Chi, Tai-Shih
N1 - Publisher Copyright:
Copyright © 2014 ISCA.
PY - 2014
Y1 - 2014
N2 - In this paper, a binary mask estimation algorithm is proposed based on modulations of speech. A multi-resolution spectrotemporal analytical auditory model is utilized to extract modulation features to estimate the binary mask, which is often used in speech segregation applications. The proposed method estimates noise from the beginning of each test sentence, a common approach seen in many conventional speech enhancement algorithms, to further enhance the modulation features. Experimental results demonstrate that the proposed method outperforms the AMS-GMM system in terms of the HIT-FA rate when estimating the binary mask.s
AB - In this paper, a binary mask estimation algorithm is proposed based on modulations of speech. A multi-resolution spectrotemporal analytical auditory model is utilized to extract modulation features to estimate the binary mask, which is often used in speech segregation applications. The proposed method estimates noise from the beginning of each test sentence, a common approach seen in many conventional speech enhancement algorithms, to further enhance the modulation features. Experimental results demonstrate that the proposed method outperforms the AMS-GMM system in terms of the HIT-FA rate when estimating the binary mask.s
KW - Frequency modulation
KW - Mask estimation
KW - Spectro-temporal modulation
UR - http://www.scopus.com/inward/record.url?scp=84910059076&partnerID=8YFLogxK
U2 - 10.21437/interspeech.2014-261
DO - 10.21437/interspeech.2014-261
M3 - Conference article
AN - SCOPUS:84910059076
SN - 2308-457X
SP - 993
EP - 997
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014
Y2 - 14 September 2014 through 18 September 2014
ER -