An Energy-Efficient Ring-Based CIM Accelerator using High-Linearity eNVM for Deep Neural Networks

Po-Tsang Huang, Ting Wei Liu, Wei Lu, Yu Hsien Lin, Wei Hwang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Computation-in-memory (CIM) accelerators reduce the energy consumption of weight accesses from off-chip memory by storing synaptic weights into on-chip embedded NVM (eNVM) devices, such as RRAM, charge-Trap transistor and FeFET. However, for mapping a deep neural network (DNN) more than 20 layers into an eNVM-based accelerator, the throughput, energy-efficiency and accuracy are limited due to the non-linearity of weights and energy-consuming weight updating. In this work, the proposed CIM accelerator exploits low-voltage and high-linearity eNVM to reach both the power efficiency of weight updating and the high accuracy. By adopting the layer-level weight stationary, mini-Array clusters and a ring-based architecture, the resource utilization of eNVM devices is increased. In addition, channel-wise weight mapping schemes for standard convolution and pointwise convolution can support the structure pruning technique of DNNs. The proposed accelerator achieves 1.814 TOPS/W with only 4.7% accuracy loss on YOLOv3 by Ni-crystal RRAM.

Original languageEnglish
Title of host publicationProceedings - International SoC Design Conference 2021, ISOCC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages260-261
Number of pages2
ISBN (Electronic)9781665401746
DOIs
StatePublished - 2021
Event18th International System-on-Chip Design Conference, ISOCC 2021 - Jeju Island, Korea, Republic of
Duration: 6 Oct 20219 Oct 2021

Publication series

NameProceedings - International SoC Design Conference 2021, ISOCC 2021

Conference

Conference18th International System-on-Chip Design Conference, ISOCC 2021
Country/TerritoryKorea, Republic of
CityJeju Island
Period6/10/219/10/21

Fingerprint

Dive into the research topics of 'An Energy-Efficient Ring-Based CIM Accelerator using High-Linearity eNVM for Deep Neural Networks'. Together they form a unique fingerprint.

Cite this