A locality-aware dynamic thread scheduler for GPGPUs

Yu Hao Huang*, Ying Yu Tseng, Hsien Kai Kuo, Ta Kan Yen, Bo Cheng Charles Lai

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    1 Scopus citations

    Abstract

    Modern GPGPUs implement on-chip shared cache to better exploit the data reuse of various general purpose applications. Given the massive amount of concurrent threads in a GPGPU, striking the balance between Data Locality and Load Balance has become a critical design concern. To achieve the best performance, the trade-off between these two factors needs to be performed concurrently. This paper proposes a dynamic thread scheduler which co-optimizes both the data locality and load balance on a GPGPU. The proposed approach is evaluated using three applications with various input datasets. The results show that the proposed approach reduces the overall execution cycles by up to 16% when compared with other approaches concerning only one objective.

    Original languageEnglish
    Title of host publicationParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
    EditorsShi-Jinn Horng
    PublisherIEEE Computer Society
    Pages254-258
    Number of pages5
    ISBN (Electronic)9781479924189
    DOIs
    StatePublished - 18 Sep 2014
    Event14th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2013 - Taipei, Taiwan
    Duration: 16 Dec 201318 Dec 2013

    Publication series

    NameParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings

    Conference

    Conference14th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2013
    Country/TerritoryTaiwan
    CityTaipei
    Period16/12/1318/12/13

    Keywords

    • Data Locality
    • GPU
    • parallel computing

    Fingerprint

    Dive into the research topics of 'A locality-aware dynamic thread scheduler for GPGPUs'. Together they form a unique fingerprint.

    Cite this