摘要
A two-stage latent prosody model-language model (LPM-LM)-based approach is proposed to identify two Mandarin accent types spoken by native speakers in Mainland China and Taiwan. The frontend LPM tokenizes and jointly models the affections of speaker, tone and prosody state of an utterance. The backend LM takes the decoded prosody state sequences and builds n-grams to model the prosodic differences of the two accent types. Experimental results on a mixed TRSC and MAT database showed that fusion of the proposed LPM-LM with a SDC/GMM+PPR-LM+UPR-LM baseline system could further reduced the average accent identification error rate from 20.7% to 16.2%. Therefore, the proposed LPM-LM method is a promising approach.
原文 | English |
---|---|
頁面 | 125-135 |
頁數 | 11 |
出版狀態 | Published - 2009 |
事件 | 21st Conference on Computational Linguistics and Speech Processing, ROCLING 2009 - Taichung, 台灣 持續時間: 1 9月 2009 → 2 9月 2009 |
Conference
Conference | 21st Conference on Computational Linguistics and Speech Processing, ROCLING 2009 |
---|---|
國家/地區 | 台灣 |
城市 | Taichung |
期間 | 1/09/09 → 2/09/09 |