摘要
In this paper, a novel model-based pitch conversion method for Mandarin is presented and compared with other two conventional conversion methods, i.e. the mean/variance transformation approach and the GMM-based mapping approach. Syllable pitch contour is first quantized by 3 rd order orthogonal expansion coefficients; then, the source and target speakers' prosodic models are constructed, respectively. Two mapping methods based on the prosodic model are presented. Objective tests confirmed that one of the proposed methods are superior the conventional methods. Some findings in informal listening tests and objective tests are worthwhile to further investigate.
原文 | English |
---|---|
頁(從 - 到) | 2643-2646 |
頁數 | 4 |
期刊 | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
出版狀態 | Published - 26 11月 2009 |
事件 | 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, 英國 持續時間: 6 9月 2009 → 10 9月 2009 |