Factor analysis of acoustic features for streamed hidden Markov modeling

Chuan Wei Ting*, Jen-Tzung Chien

*此作品的通信作者

研究成果同行評審

5 引文 斯高帕斯(Scopus)

摘要

This paper presents a new streamed hidden Markov model (HMM) framework for speech recognition. The factor analysis (FA) is performed to discover the common factors of acoustic features. The streaming regularities are governed by the correlation between features, which is inherent in common factors. Those features corresponding to the same factor are generated by identical HMM state. Accordingly, we use multiple Markov chains to represent the variation trends in cepstral features. We develop a FA streamed HMM (FASHMM) and go beyond the conventional HMM assuming that all features at a speech frame conduct the same state emission. This streamed HMM is more delicate than the factorial HMM where the streaming was empirically determined. We also exploit a new decoding algorithm for FASHMM speech recognition. In this manner, we fulfill the flexible Markov chains for an input sequence of multivariate Gaussian mixture observations. In the experiments, the proposed method can reduce word error rate by 36% at most.

原文English
頁面30-35
頁數6
DOIs
出版狀態Published - 1 12月 2007
事件2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007 - Kyoto, 日本
持續時間: 9 12月 200713 12月 2007

Conference

Conference2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007
國家/地區日本
城市Kyoto
期間9/12/0713/12/07

指紋

深入研究「Factor analysis of acoustic features for streamed hidden Markov modeling」主題。共同形成了獨特的指紋。

引用此