TY - JOUR
T1 - A Data-driven Approach to Constructing a Prosodic Grammar for Mandarin Read Speech
AU - Hong, Yu Siang
AU - Chen, Sin Horng
N1 - Publisher Copyright:
© 2022 International Speech Communications Association. All rights reserved.
PY - 2022
Y1 - 2022
N2 - A new approach to constructing a prosodic grammar of Mandarin read speech which describes the mapping from syntactic patterns to prosodic patterns is proposed. It first prepares a large read-speech corpus with syntactic-tree parsing for texts and break-index labeling representing four-layer non-recursive prosodic hierarchical structures for utterances. Then, all realizations of syntactic pattern-break pattern pairs are extracted to learn prosodic grammatical rules. For a syntactic pattern, rules are inferred via calculating the break-type distributions of Pre-B, Post-B, and intra-pattern word junctures from these realizations. In the study, we only investigate the prosodic grammatical rules for four syntactic patterns of determinative-measure (DM) compound, DM+N, DM+DE+N and DM+Modifier+DE+N to verify the feasibility of the proposed approach. With considering the ten syntactic functions of Subject, Object, Topic, Head, Modifier, Attributive, Noun Predicate, Quantitative Complement, and embedded in DE phrase and in Prepositional Phrase, the entropies of pre- and post-boundaries of these four syntactic patterns are reduced significantly. Moreover, detailed rules are inferred via exploring linguistic/semantic interpretations for the occurrence of main prosodic pattern and outliers using the information of phonetic constituents and contexts of syntactic-pattern realizations. Some important factors such as length and semantic relation are found to seriously affect the syntax-prosody mapping.
AB - A new approach to constructing a prosodic grammar of Mandarin read speech which describes the mapping from syntactic patterns to prosodic patterns is proposed. It first prepares a large read-speech corpus with syntactic-tree parsing for texts and break-index labeling representing four-layer non-recursive prosodic hierarchical structures for utterances. Then, all realizations of syntactic pattern-break pattern pairs are extracted to learn prosodic grammatical rules. For a syntactic pattern, rules are inferred via calculating the break-type distributions of Pre-B, Post-B, and intra-pattern word junctures from these realizations. In the study, we only investigate the prosodic grammatical rules for four syntactic patterns of determinative-measure (DM) compound, DM+N, DM+DE+N and DM+Modifier+DE+N to verify the feasibility of the proposed approach. With considering the ten syntactic functions of Subject, Object, Topic, Head, Modifier, Attributive, Noun Predicate, Quantitative Complement, and embedded in DE phrase and in Prepositional Phrase, the entropies of pre- and post-boundaries of these four syntactic patterns are reduced significantly. Moreover, detailed rules are inferred via exploring linguistic/semantic interpretations for the occurrence of main prosodic pattern and outliers using the information of phonetic constituents and contexts of syntactic-pattern realizations. Some important factors such as length and semantic relation are found to seriously affect the syntax-prosody mapping.
KW - Mandarin read speech
KW - determinative-measure compound
KW - prosodic grammar
KW - syntax-prosody mapping
UR - http://www.scopus.com/inward/record.url?scp=85146370050&partnerID=8YFLogxK
U2 - 10.21437/SpeechProsody.2022-179
DO - 10.21437/SpeechProsody.2022-179
M3 - Conference article
AN - SCOPUS:85146370050
SN - 2333-2042
VL - 2022-May
SP - 881
EP - 885
JO - Proceedings of the International Conference on Speech Prosody
JF - Proceedings of the International Conference on Speech Prosody
T2 - 11th International Conference on Speech Prosody, Speech Prosody 2022
Y2 - 23 May 2022 through 26 May 2022
ER -