TY - JOUR
T1 - Compressing three-dimensional sparse arrays using inter- and intra-task parallelization strategies on Intel Xeon and Xeon Phi
AU - Lin, Chun Yuan
AU - Yen, Huang Ting
AU - Hung, Che Lun
N1 - Publisher Copyright:
© 2016, Springer Science+Business Media New York.
PY - 2017/8/1
Y1 - 2017/8/1
N2 - Array operations are useful in a lot of scientific codes. In recent years, several applications, such as the geological analysis and the medical images processing, are processed using array operations for three-dimensional (abbreviate to “3D”) sparse arrays. Due to the huge computation time, it is necessary to compress 3D sparse arrays and use parallel computing technologies to speed up sparse array operations. How to compress the sparse arrays efficiently is an important task for practical applications. Hence, in this paper, two strategies, inter- and intra-task parallelization (abbreviate to “ETP” and “RTP”), are presented to compress 3D sparse arrays, respectively. Each strategy was designed and implemented on Intel Xeon and Xeon Phi, respectively. From experimental results, the ETP strategy achieves 17.5× and 18.2× speedup ratios based on Intel Xeon E5-2670 v2 and Intel Xeon Phi SE10X, respectively; 4.5× and 4.5× speedup ratios for the RTP strategy based on these two environments, respectively.
AB - Array operations are useful in a lot of scientific codes. In recent years, several applications, such as the geological analysis and the medical images processing, are processed using array operations for three-dimensional (abbreviate to “3D”) sparse arrays. Due to the huge computation time, it is necessary to compress 3D sparse arrays and use parallel computing technologies to speed up sparse array operations. How to compress the sparse arrays efficiently is an important task for practical applications. Hence, in this paper, two strategies, inter- and intra-task parallelization (abbreviate to “ETP” and “RTP”), are presented to compress 3D sparse arrays, respectively. Each strategy was designed and implemented on Intel Xeon and Xeon Phi, respectively. From experimental results, the ETP strategy achieves 17.5× and 18.2× speedup ratios based on Intel Xeon E5-2670 v2 and Intel Xeon Phi SE10X, respectively; 4.5× and 4.5× speedup ratios for the RTP strategy based on these two environments, respectively.
KW - Accelerator
KW - Data compression method
KW - Multicomputer
KW - Multiprocessor
KW - Parallel processing
KW - Sparse array operation
UR - http://www.scopus.com/inward/record.url?scp=84979234324&partnerID=8YFLogxK
U2 - 10.1007/s11227-016-1820-x
DO - 10.1007/s11227-016-1820-x
M3 - Article
AN - SCOPUS:84979234324
SN - 0920-8542
VL - 73
SP - 3391
EP - 3410
JO - Journal of Supercomputing
JF - Journal of Supercomputing
IS - 8
ER -