TY - GEN
T1 - A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids
AU - Huang, Yi Cheng
AU - Chiang, Yi Fan
AU - Jou, Shyh-Jye
PY - 2013/1/1
Y1 - 2013/1/1
N2 - This paper presents a pitch based voice activity detection (PBVAD) algorithm adopting a quasi-ANSI 1/3 octave filter bank which has low group delay for realistic implementation in hearing aids systems. For compensating the drawback of low resolution resulted from quasi-ASNI filter bank, this pitch based VAD algorithm integrals the features of monosyllable speech such as pitch and corresponding harmonics, onset and time of word length. Simulation results reveal that with more harmonics detection, the accuracy of the proposed PBVAD algorithm improves from 78.9% to 87.7%. Additionally, the proposed VAD algorithm is implemented in ANSI filter bank for comparisons. With the integration of features, the result shows the proposed algorithm can achieve similar VAD accuracy, less than 2.5%, in quasi-ANSI filter bank and ANSI filter bank. Thus, the proposed algorithm can tackle the drawback of quasi- ANSI filter bank and is also suitable for ANSI filter bank. Moreover, the latency incurred by quasi-ANSI filter bank and the proposed VAD algorithm is 11.3ms and this satisfies the requirement of HA systems for practical implementation.
AB - This paper presents a pitch based voice activity detection (PBVAD) algorithm adopting a quasi-ANSI 1/3 octave filter bank which has low group delay for realistic implementation in hearing aids systems. For compensating the drawback of low resolution resulted from quasi-ASNI filter bank, this pitch based VAD algorithm integrals the features of monosyllable speech such as pitch and corresponding harmonics, onset and time of word length. Simulation results reveal that with more harmonics detection, the accuracy of the proposed PBVAD algorithm improves from 78.9% to 87.7%. Additionally, the proposed VAD algorithm is implemented in ANSI filter bank for comparisons. With the integration of features, the result shows the proposed algorithm can achieve similar VAD accuracy, less than 2.5%, in quasi-ANSI filter bank and ANSI filter bank. Thus, the proposed algorithm can tackle the drawback of quasi- ANSI filter bank and is also suitable for ANSI filter bank. Moreover, the latency incurred by quasi-ANSI filter bank and the proposed VAD algorithm is 11.3ms and this satisfies the requirement of HA systems for practical implementation.
KW - Hearing aids
KW - Mandarin
KW - Non-stationary
KW - Pitch
KW - Voice activity detection
UR - http://www.scopus.com/inward/record.url?scp=84896447598&partnerID=8YFLogxK
U2 - 10.1109/SiPS.2013.6674479
DO - 10.1109/SiPS.2013.6674479
M3 - Conference contribution
AN - SCOPUS:84896447598
SN - 9781467362382
T3 - IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
SP - 48
EP - 53
BT - 2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
Y2 - 16 October 2013 through 18 October 2013
ER -