A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids

Yi Cheng Huang, Yi Fan Chiang, Shyh-Jye Jou

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    2 Scopus citations

    Abstract

    This paper presents a pitch based voice activity detection (PBVAD) algorithm adopting a quasi-ANSI 1/3 octave filter bank which has low group delay for realistic implementation in hearing aids systems. For compensating the drawback of low resolution resulted from quasi-ASNI filter bank, this pitch based VAD algorithm integrals the features of monosyllable speech such as pitch and corresponding harmonics, onset and time of word length. Simulation results reveal that with more harmonics detection, the accuracy of the proposed PBVAD algorithm improves from 78.9% to 87.7%. Additionally, the proposed VAD algorithm is implemented in ANSI filter bank for comparisons. With the integration of features, the result shows the proposed algorithm can achieve similar VAD accuracy, less than 2.5%, in quasi-ANSI filter bank and ANSI filter bank. Thus, the proposed algorithm can tackle the drawback of quasi- ANSI filter bank and is also suitable for ANSI filter bank. Moreover, the latency incurred by quasi-ANSI filter bank and the proposed VAD algorithm is 11.3ms and this satisfies the requirement of HA systems for practical implementation.

    Original languageEnglish
    Title of host publication2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages48-53
    Number of pages6
    ISBN (Print)9781467362382
    DOIs
    StatePublished - 1 Jan 2013
    Event2013 IEEE Workshop on Signal Processing Systems, SiPS 2013 - Taipei, Taiwan
    Duration: 16 Oct 201318 Oct 2013

    Publication series

    NameIEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
    ISSN (Print)1520-6130

    Conference

    Conference2013 IEEE Workshop on Signal Processing Systems, SiPS 2013
    Country/TerritoryTaiwan
    CityTaipei
    Period16/10/1318/10/13

    Keywords

    • Hearing aids
    • Mandarin
    • Non-stationary
    • Pitch
    • Voice activity detection

    Fingerprint

    Dive into the research topics of 'A pitch based VAD adopting quasi-ANSI 1/3 octave filter bank with 11.3 ms latency for monosyllable hearing aids'. Together they form a unique fingerprint.

    Cite this