TY - JOUR
T1 - A Quantitative Method to Data Reuse Patterns of SIMT Applications
AU - Lai, Bo-Cheng
AU - Garrido Platero, Luis
AU - Kuo, Hsien Kai
PY - 2016/7/1
Y1 - 2016/7/1
N2 - Understanding data reuse patterns of a computing system is crucial to effective design optimization. The emerging Single Instruction Multiple Threads (SIMT) processor adopts a programming model that is fundamentally disparate from conventional scalar processors. There is a lack of analytical approaches to quantify the data reuse of SIMT applications. This paper presents a quantitative method to study the data reuse inherent to SIMT applications. A metric, Data Reuse Degree, is defined to measure the amount of reused data between memory references, and associate each data reuse degree to a temporal distance representing the virtual time of the execution process. The experiments are performed on an abstracted SIMT processor that considers the programming model and runtime specifics. The experiments illustrate diverse data reuse patterns of SIMT applications and explore the impacts of architectural limitations.
AB - Understanding data reuse patterns of a computing system is crucial to effective design optimization. The emerging Single Instruction Multiple Threads (SIMT) processor adopts a programming model that is fundamentally disparate from conventional scalar processors. There is a lack of analytical approaches to quantify the data reuse of SIMT applications. This paper presents a quantitative method to study the data reuse inherent to SIMT applications. A metric, Data Reuse Degree, is defined to measure the amount of reused data between memory references, and associate each data reuse degree to a temporal distance representing the virtual time of the execution process. The experiments are performed on an abstracted SIMT processor that considers the programming model and runtime specifics. The experiments illustrate diverse data reuse patterns of SIMT applications and explore the impacts of architectural limitations.
KW - Parallel architectures, cache memory, parallel processing
UR - http://www.scopus.com/inward/record.url?scp=85027317513&partnerID=8YFLogxK
U2 - 10.1109/LCA.2015.2491279
DO - 10.1109/LCA.2015.2491279
M3 - Article
AN - SCOPUS:85027317513
SN - 1556-6056
VL - 15
SP - 73
EP - 76
JO - IEEE Computer Architecture Letters
JF - IEEE Computer Architecture Letters
IS - 2
M1 - 7299295
ER -