Improved acoustics modeling for speech recognition using transformation techniques

Carrson Fung, Oscar C. Au, Wanggen Wan, Chi H. Yim, Cyan L. Keung

研究成果: Conference contribution同行評審

摘要

In statistical speech recognition, misclassification often occurs when there is a mismatch between the incoming signal and the acoustics model inside the recognizer. In order to combat this problem, techniques such as Cepstral Mean Subtraction, Vocal Tract Normalization, adaptation and pronunciation model can be used. In this paper, we proposed a new approach based on transformation technique where the output distribution function in the HMM model, a Gaussian probability density function, could be transformed to match the estimated distribution of the incoming signal by using a memoryless invertible nonlinearity function. Since the new density still has a Gaussian form, the function could be completely characterized by using the Expectation Maximization (EM) algorithm.

原文English
主出版物標題6th International Conference on Spoken Language Processing, ICSLP 2000
發行者International Speech Communication Association
頁數3
ISBN(電子)7801501144, 9787801501141
出版狀態Published - 10月 2000
事件6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
持續時間: 16 10月 200020 10月 2000

出版系列

名字6th International Conference on Spoken Language Processing, ICSLP 2000

Conference

Conference6th International Conference on Spoken Language Processing, ICSLP 2000
國家/地區China
城市Beijing
期間16/10/0020/10/00

指紋

深入研究「Improved acoustics modeling for speech recognition using transformation techniques」主題。共同形成了獨特的指紋。

引用此