Study on the consistency analysis between the prosody and the spectrum for mandarin speech

Cheng Yu Yeh, Kuan Lin Chen, Shaw-Hwa Hwang, Long Jhe Yan

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In this work, a consistency analysis between the prosody and the spectrum for Mandarin speech is presented. Found by an inspection on the pronunciation process of human beings, the consistency can be interpreted as a close correlated relation of a warping curve between the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the hidden Markov model (HMM) algorithm is used firstly to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Secondly, based on a designated syllable, the vector quantisation (VQ) with the Linde-Buzo-Gray algorithm is used to train the VQ codebooks of each segment. Thirdly, the prosodic vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyse the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the warping process between the spectrum and the prosody intra a syllable must be considered in a text-to-speech system to improve the speech quality.

Original languageEnglish
Pages (from-to)158-165
Number of pages8
JournalIET Signal Processing
Volume7
Issue number2
DOIs
StatePublished - 1 Jan 2013

Fingerprint

Dive into the research topics of 'Study on the consistency analysis between the prosody and the spectrum for mandarin speech'. Together they form a unique fingerprint.

Cite this