Abstract
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MATBN database. It considers the acoustic modeling for Mandarin base-syllables, particles, and paralinguistic phenomena. It also considers environment-dependent acoustic modeling for three recording environments: studio anchors, outdoor reporters, and outdoor interviewee. Moreover, it incorporates a bigram language model with adaptation using data in MATBN. Syllable recognition rates of 89.64, 84.42and 61.62% were achieved for the three environments of anchors, reporters and interviewees, respectively.
Original language | American English |
---|---|
Pages | 257-260 |
Number of pages | 4 |
DOIs | |
State | Published - 1 Dec 2004 |
Event | 2004 International Symposium on Chinese Spoken Language Processing - Hong Kong, China, Hong Kong Duration: 15 Dec 2004 → 18 Dec 2004 |
Conference
Conference | 2004 International Symposium on Chinese Spoken Language Processing |
---|---|
Country/Territory | Hong Kong |
City | Hong Kong, China |
Period | 15/12/04 → 18/12/04 |