Online speaker adaptation based on quasi-Bayes linear regression

Jen-Tzung Chien*, C. H. Huang

*此作品的通信作者

研究成果: Conference article同行評審

摘要

This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent (SI) speech recognizer to meet nonstationary environments via linear regression adaptation of SI HMM's. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute online adaptation where the regression matrix is estimated using QB theory. In the estimation, we moderately specify the prior density of regression matrix as a matrix variate normal distribution and exactly derive the pooled posterior density belonging to the same distribution family. Accordingly, the optimal regression matrix can be easily calculated. Also, the reproducible prior/posterior density pair provides meaningful mechanism for sequential learning of prior statistics. At each sequential epoch, only the updated prior statistics and the current observed data are required for adaptation. In general, the proposed QBLR is universal and can be reduced to well-known maximum likelihood linear regression (MLLR) and maximum a posteriori linear regression (MAPLR). Experiments show that the QBLR is effective for speaker adaptation in car environments.

原文English
頁(從 - 到)329-332
頁數4
期刊ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1
DOIs
出版狀態Published - 26 九月 2001
事件2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States
持續時間: 7 五月 200111 五月 2001

指紋

深入研究「Online speaker adaptation based on quasi-Bayes linear regression」主題。共同形成了獨特的指紋。

引用此