A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids

Yi Cheng Huang, Yi Fan Chiang, Shyh-Jye Jou

    研究成果: Conference contribution同行評審

    2 引文 斯高帕斯(Scopus)

    摘要

    This paper presents a pitch based voice activity detection (PBVAD) algorithm adopting a quasi-ANSI 1/3 octave filter bank which has low group delay for realistic implementation in hearing aids systems. For compensating the drawback of low resolution resulted from quasi-ASNI filter bank, this pitch based VAD algorithm integrals the features of monosyllable speech such as pitch and corresponding harmonics, onset and time of word length. Simulation results reveal that with more harmonics detection, the accuracy of the proposed PBVAD algorithm improves from 78.9% to 87.7%. Additionally, the proposed VAD algorithm is implemented in ANSI filter bank for comparisons. With the integration of features, the result shows the proposed algorithm can achieve similar VAD accuracy, less than 2.5%, in quasi-ANSI filter bank and ANSI filter bank. Thus, the proposed algorithm can tackle the drawback of quasi- ANSI filter bank and is also suitable for ANSI filter bank. Moreover, the latency incurred by quasi-ANSI filter bank and the proposed VAD algorithm is 11.3ms and this satisfies the requirement of HA systems for practical implementation.

    原文English
    主出版物標題2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
    發行者Institute of Electrical and Electronics Engineers Inc.
    頁面48-53
    頁數6
    ISBN(列印)9781467362382
    DOIs
    出版狀態Published - 1 1月 2013
    事件2013 IEEE Workshop on Signal Processing Systems, SiPS 2013 - Taipei, 台灣
    持續時間: 16 10月 201318 10月 2013

    出版系列

    名字IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
    ISSN(列印)1520-6130

    Conference

    Conference2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
    國家/地區台灣
    城市Taipei
    期間16/10/1318/10/13

    指紋

    深入研究「A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids」主題。共同形成了獨特的指紋。

    引用此