TY - JOUR
T1 - A two-stage sample-based phone boundary detector using segmental similarity features
AU - Wang, Yih-Ru
PY - 2011/8/27
Y1 - 2011/8/27
N2 - In this paper, a two-stage sample-based phone boundary detection algorithm is proposed. In the first stage, some local sample-based acoustic parameters are used to pre-select some phone boundary candidates. Then, in the second stage, some high-order statistics of the log-likelihood differences of two adjacent speech segments around each boundary candidate are calculated to serve as similarity measure for candidate verification. Experimental results on the TIMIT speech corpus showed that EERs of 8.6% and 7.6% were achieved for onestage and two-stage sample-based phone boundary detections, respectively. Moreover, for the two-stage system, 42.1% and 81.9% of boundaries detected were within 5- and 15-sample error tolerance from manual labeling results.
AB - In this paper, a two-stage sample-based phone boundary detection algorithm is proposed. In the first stage, some local sample-based acoustic parameters are used to pre-select some phone boundary candidates. Then, in the second stage, some high-order statistics of the log-likelihood differences of two adjacent speech segments around each boundary candidate are calculated to serve as similarity measure for candidate verification. Experimental results on the TIMIT speech corpus showed that EERs of 8.6% and 7.6% were achieved for onestage and two-stage sample-based phone boundary detections, respectively. Moreover, for the two-stage system, 42.1% and 81.9% of boundaries detected were within 5- and 15-sample error tolerance from manual labeling results.
KW - Phone boundary detection
KW - Similarity measure
UR - http://www.scopus.com/inward/record.url?scp=84865747216&partnerID=8YFLogxK
U2 - 10.21437/interspeech.2011-163
DO - 10.21437/interspeech.2011-163
M3 - Conference article
AN - SCOPUS:84865747216
SN - 2308-457X
SP - 413
EP - 416
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
Y2 - 27 August 2011 through 31 August 2011
ER -