Precision-Aware Workload Distribution and Dataflow for a Hybrid Digital-CIM Deep CNN Accelerator

Jui I. Kao, Wei Lu, Po Tsang Huang, Hung Ming Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

SRAM-based Computing-in-memory (CIM) circuits have been demonstrated as a promising solution to effectively accelerate the inference of convolutional neural networks (CNNs) by shifting computation into the memory arrays. However, the advantages of CIM accelerators will disappear as increasing the bit precision and adopting advanced process technology due to the overhead caused by ADC/DAC and poor technology scaling capability of analog circuits. In this paper, a hybrid digital-CIM accelerator was proposed to solve above problems and the weights and activations of different layers are quantized to different precision (high, medium, and low precision). Moreover, precision-aware workload distribution and dataflow are proposed for the hybrid digital-CIM accelerator. Overall, the proposed accelerator can achieve 12.481 TOPS/W.

Original languageEnglish
Title of host publicationProceedings - International SoC Design Conference 2022, ISOCC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages171-172
Number of pages2
ISBN (Electronic)9781665459716
DOIs
StatePublished - 2022
Event19th International System-on-Chip Design Conference, ISOCC 2022 - Gangneung-si, Korea, Republic of
Duration: 19 Oct 202222 Oct 2022

Publication series

NameProceedings - International SoC Design Conference 2022, ISOCC 2022

Conference

Conference19th International System-on-Chip Design Conference, ISOCC 2022
Country/TerritoryKorea, Republic of
CityGangneung-si
Period19/10/2222/10/22

Keywords

  • hybrid digital-CIM
  • precision-aware

Fingerprint

Dive into the research topics of 'Precision-Aware Workload Distribution and Dataflow for a Hybrid Digital-CIM Deep CNN Accelerator'. Together they form a unique fingerprint.

Cite this