A Multi-Bit Near-RRAM based Computing Macro with Highly Computing Parallelism for CNN Application

Kuan Chih Lin, Hao Zuo, Hsiang Yu Wang, Yuan Ping Huang, Ci Hao Wu, Yan Cheng Guo, Shyh Jye Jou, Tuo Hung Hou, Tian Sheuan Chang

研究成果: Conference contribution同行評審

摘要

Resistive random-access memory (RRAM) based compute-in-memory (CIM) is an emerging approach to address the demand for practical implementation of artificial intelligence (AI) on resource constrained edge devices by reducing the power-hungry data transfer between memory and processing unit. However, the state-of-the-art RRAM CIM designs fail to strike a balance between precision, energy efficiency, throughput, and latency. This work merges the techniques of CIM and compute-near-memory (CNM) to deliver high precision, high energy efficiency, high throughput, and low latency. In this paper, a 256Kb RRAM based CNM macro fabricated in TSMC 40 nm process is presented featuring: 1) opposite weight mapping with variation-robust SA to mitigate the impact of RRAM device variations on MAC (Multiply-Accumulate) results; 2) switched-capacitor-based analog multiplication circuit to achieve highly parallel computing of 128 4-bit by 4-bit MAC result with low power consumption and high operation speed; and 3) joint optimization of hardware and software to compensate for the accuracy loss after considering the non-idealities of circuits. The macro achieves a low latency of 17ns and high energy efficiency of 71 TOPS/W for MAC operations with 4-bit input, 4-bit weight and 4-bit output precision. It is used to accelerate the convolution process in the Light-CSPDenseN et AI model, resulting in a high accuracy of 86.33% on Visual Wake Words dataset.

原文English
主出版物標題2024 Design, Automation and Test in Europe Conference and Exhibition, DATE 2024 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350348590
出版狀態Published - 2024
事件2024 Design, Automation and Test in Europe Conference and Exhibition, DATE 2024 - Valencia, Spain
持續時間: 25 3月 202427 3月 2024

出版系列

名字Proceedings -Design, Automation and Test in Europe, DATE
ISSN(列印)1530-1591

Conference

Conference2024 Design, Automation and Test in Europe Conference and Exhibition, DATE 2024
國家/地區Spain
城市Valencia
期間25/03/2427/03/24

指紋

深入研究「A Multi-Bit Near-RRAM based Computing Macro with Highly Computing Parallelism for CNN Application」主題。共同形成了獨特的指紋。

引用此