TY - JOUR
T1 - A mismatch-aware stochastic matching algorithm for robust speech recognition
AU - Liao, Yuan Fu
AU - Lin, Jeng Shien
AU - Chen, Sin-Horng
PY - 2003/4/6
Y1 - 2003/4/6
N2 - In this paper, we present a mismatch-aware stochastic matching (MASM) algorithm to alleviate the performance degradation under mismatched training and testing conditions. MASM first computes a reliability measure of applying a set of pre-trained speech models to a mismatch test utterance along the time axis or among different feature vector components. It then estimates and compensates the mismatch using the reliability measure to guide the speech segmentation. Experiments on a serious mismatched condition with training on PSTN-speech database and testing on mobile GSM-speech database showed that MASM outperformed the stochastic match (SM) method, especially, for short utterances.
AB - In this paper, we present a mismatch-aware stochastic matching (MASM) algorithm to alleviate the performance degradation under mismatched training and testing conditions. MASM first computes a reliability measure of applying a set of pre-trained speech models to a mismatch test utterance along the time axis or among different feature vector components. It then estimates and compensates the mismatch using the reliability measure to guide the speech segmentation. Experiments on a serious mismatched condition with training on PSTN-speech database and testing on mobile GSM-speech database showed that MASM outperformed the stochastic match (SM) method, especially, for short utterances.
UR - http://www.scopus.com/inward/record.url?scp=0141702092&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2003.1202304
DO - 10.1109/ICASSP.2003.1202304
M3 - Conference article
AN - SCOPUS:0141702092
SN - 1520-6149
VL - 2
SP - 101
EP - 104
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
M1 - 1202304
T2 - 2003 IEEE International Conference on Accoustics, Speech, and Signal Processing
Y2 - 6 April 2003 through 10 April 2003
ER -