A Coarse-Grained Dual-Convolver Based CNN Accelerator with High Computing Resource Utilization

Yi Lu, Yi Lin Wu, Juinn Dar Huang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Deep learning technologies have been developed rapidly in recent years and have played an important role in our lives. Among them, convolutional neural network (CNN) performs well in many applications. The quality of result is generally getting better as the number of convolutional layers increases, which also increases the computational complexity. Hence, a highly resource-efficient accelerator is demanded. In this paper, we propose a new CNN accelerator that features a delay-chain-free input data aligner as well as a dual-convolver processing element (DCPE). Our architecture does not require delay chains with a large number of registers for input data alignment, which not only reduces the area and power but improves the overall resource utilization. In addition, a set of DCPEs shares the same input aligner to produce multiple output feature maps concurrently, which offers the desirable computing power and reduces the external memory traffic. An accelerator instance with 8 DCPEs (144 MACs) has been implemented using TSMC 40nm process. The internal logic only consumes 285K gates and the total internal memory size is merely 44KB. As running VGG-16, the average performance is 190GOPS (@750MHz), the resource (MAC) utilization reaches 8S.3%, and the energy efficiency is 481GOPS/W.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages198-202
Number of pages5
ISBN (Electronic)9781728149226
DOIs
StatePublished - Aug 2020
Event2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020 - Genova, Italy
Duration: 31 Aug 20202 Sep 2020

Publication series

NameProceedings - 2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020

Conference

Conference2020 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2020
Country/TerritoryItaly
CityGenova
Period31/08/202/09/20

Keywords

  • convolutional neural network CNN
  • hardware accelerator
  • high resource utilization
  • low data bandwidth

Fingerprint

Dive into the research topics of 'A Coarse-Grained Dual-Convolver Based CNN Accelerator with High Computing Resource Utilization'. Together they form a unique fingerprint.

Cite this