MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine

Endri Taka*, Aman Arora, Kai Chiang Wu, Diana Marculescu

*此作品的通信作者

研究成果: Conference contribution同行評審

5 引文 斯高帕斯(Scopus)

摘要

The increasing computational and memory requirements of Deep Learning (DL) workloads has led to outstanding innovations in hardware architectures. An archetype of such architectures is the novel Versal AI Engine (AIE) by AMD/Xilinx. The AIE comprises multiple programmable processors optimized for vector-based algorithms. An AIE array consisting of 400 processor cores, operating at 1.25 GHz is able to deliver a peak throughput of S TFLOPs for 32-bit floating-point (fp32), and 128 TOPs for 8-bit integer (int8) precision. In this work, we propose MaxEVA: a novel framework to efficiently map Matrix Multiplication (MatMul) workloads on Versal AIE devices. Our framework maximizes the performance and energy efficiency of MatMul applications by efficiently exploiting features of the AIE architecture and resolving performance bottlenecks from multiple angles. When demonstrating on the VC1902 device of the VCK190 board, MaxEVA accomplishes up to 5.44 TFLOPs and 77.01 TOPs throughput for fp32 and intS precisions, respectively. In terms of energy efficiency, MaxEVA attains up to 124.16 GFLOPs/W for fp32, and 1.16 TOPs/W for intS. Our proposed method substantially outperforms the state-of-the-art approach by exhibiting up to 2.19× throughput gain and 20.4% higher energy efficiency. The MaxEVA framework provides notable insights to fill the knowledge gap in effectively designing MatMul-based DL workloads on the new Versal AIE devices.

原文English
主出版物標題Proceedings - 2023 International Conference on Field-Programmable Technology, ICFPT 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面96-105
頁數10
ISBN(電子)9798350359114
DOIs
出版狀態Published - 2023
事件22nd International Conference on Field-Programmable Technology, ICFPT 2023 - Yokohama, 日本
持續時間: 12 12月 202314 12月 2023

出版系列

名字Proceedings - International Conference on Field-Programmable Technology, ICFPT
ISSN(列印)2837-0430
ISSN(電子)2837-0449

Conference

Conference22nd International Conference on Field-Programmable Technology, ICFPT 2023
國家/地區日本
城市Yokohama
期間12/12/2314/12/23

指紋

深入研究「MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine」主題。共同形成了獨特的指紋。

引用此