Classifier grouping to enhance data locality for a multi-threaded object detection algorithm

Bo-Cheng Lai*, Chih Hsuan Chiang, Guan Ru Li

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Scopus citations

    Abstract

    Object detection has become an enabling function for modern smart embedded devices to perform intelligent applications and interact with the environment appropriately and promptly. However, the limited computation resource of embedded devices has become a barrier to execute the computation intensive object detection algorithm. Leveraging the multi-threading scheme on embedded multi-core systems provides an opportunity to boost the performance. However, the memory bottleneck limits the performance scalability. Improving data locality of applications and maximizing the data reuse for on-chip caches have therefore become critical design concerns. This paper comprehensively analyzes the memory behavior and data locality of a multi-threaded object detection algorithm. A novel Classifier-Grouping scheme is proposed to significantly enhance the data reuse for on-chip caches of embedded multicore systems. By executing a multi-threaded object detection algorithm on a cycle-accurate multi-core simulator, the proposed approach can achieve up to 62% better performance when compared with the original parallel program.

    Original languageEnglish
    Title of host publicationProceedings - 2011 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011
    Pages268-275
    Number of pages8
    DOIs
    StatePublished - 2011
    Event2011 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011 - Tainan, Taiwan
    Duration: 7 Dec 20119 Dec 2011

    Publication series

    NameProceedings of the International Conference on Parallel and Distributed Systems - ICPADS
    ISSN (Print)1521-9097

    Conference

    Conference2011 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011
    Country/TerritoryTaiwan
    CityTainan
    Period7/12/119/12/11

    Keywords

    • Data locality
    • Embedded device
    • Multi-core
    • Object detection
    • Parallel processing

    Fingerprint

    Dive into the research topics of 'Classifier grouping to enhance data locality for a multi-threaded object detection algorithm'. Together they form a unique fingerprint.

    Cite this