Digital Computation-in-Memory Design with Adaptive Floating Point for Deep Neural Networks

Yun Ru Yang, Wei Lu, Po Tsang Huang, Hung Ming Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

All-digital deep neural network (DNN) accelerators or processors suffer from the Von-Neumann bottleneck, because of the massive data movement required in DNNs. Computation-in-memory (CIM) can reduce the data movement by performing the computations in the memory to save the above problem. However, the analog CIM is susceptible to PVT variations and limited by the analog-digital/digital-analog conversions (ADC/DAC). Most of the current digital CIM techniques adopt integer operation and the bit-serial method, which limits the throughput to the total number of bits. Moreover, they use the adder tree for accumulation, which causes severe area overhead. In this paper, a folded architecture based on time-division multiplexing is proposed to reduce the area and improve the energy efficiency without reducing the throughput. We quantize and ternarize the adaptive floating point (ADP) format with low bits, which can achieve the same or better accuracy than integer quantization, to improve the energy cost of calculation and data movement. This proposed technique can improve the overall throughput and energy efficiency up to 3.83x and 2.19x, respectively, compared to other state-of-the-art digital CIMs with integer.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE 15th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages216-223
Number of pages8
ISBN (Electronic)9781665464994
DOIs
StatePublished - 2022
Event15th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022 - Penang, Malaysia
Duration: 19 Dec 202222 Dec 2022

Publication series

NameProceedings - 2022 IEEE 15th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022

Conference

Conference15th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022
Country/TerritoryMalaysia
CityPenang
Period19/12/2222/12/22

Keywords

  • adaptive floating point
  • digital computation-in-memory
  • folded architecture
  • time interleaving
  • time-division multiplexing

Fingerprint

Dive into the research topics of 'Digital Computation-in-Memory Design with Adaptive Floating Point for Deep Neural Networks'. Together they form a unique fingerprint.

Cite this