摘要
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MATBN database. It considers the acoustic modeling for Mandarin base-syllables, particles, and paralinguistic phenomena. It also considers environment-dependent acoustic modeling for three recording environments: studio anchors, outdoor reporters, and outdoor interviewee. Moreover, it incorporates a bigram language model with adaptation using data in MATBN. Syllable recognition rates of 89.64, 84.42and 61.62% were achieved for the three environments of anchors, reporters and interviewees, respectively.
原文 | American English |
---|---|
頁面 | 257-260 |
頁數 | 4 |
DOIs | |
出版狀態 | Published - 1 12月 2004 |
事件 | 2004 International Symposium on Chinese Spoken Language Processing - Hong Kong, China, 香港 持續時間: 15 12月 2004 → 18 12月 2004 |
Conference
Conference | 2004 International Symposium on Chinese Spoken Language Processing |
---|---|
國家/地區 | 香港 |
城市 | Hong Kong, China |
期間 | 15/12/04 → 18/12/04 |