Design Exploration of An Energy-Efficient Acceleration System for CNNs on Low-Cost Resource-Constraint SoC-FPGAs

Shao Cheng Wen, Po Tsang Huang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Deep convolutional neural networks (CNNs) require enormous computation capacity, great amounts of memory accesses and data movement among parallel processing elements (PEs). From an energy perspective, CNNs are difficult to be fully deployed to low-cost resource-constraint edge devices because of both memory-intensive and computation-intensive workloads. In this paper, energy-efficient software/hardware co-design is explored for CNN acceleration on a Xilinx resource-constraint SoC-FPGA device. The acceleration system is optimized based on the constraints of DRAM bandwidths, BRAM resources, computing resources, optimal frequency and the complexity of wire routing. Moreover, the efficient workload distribution and dataflow control are also implemented by both software and hardware to achieve the maximum resource utilization. Based on a low-cost Xilinx Zynq XC7Z020 SoC-FPGA device, the proposed acceleration system achieves the throughput of VGG16 and YOLOv3-tiny by 4.3 frame/s and 21 frame/s, respectively. Moreover, 34 GOPS/W and 38.9 GOPS/W can be realized for VGG16 and YOLOv3-tiny. Compared to other state-of-art designs on resource-constraint SoC-FPGA devices, the proposed acceleration system achieves the best energy efficiency with high resource utilization.

Original languageEnglish
Title of host publicationProceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages234-237
Number of pages4
ISBN (Electronic)9781665409964
DOIs
StatePublished - 2022
Event4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022 - Incheon, Korea, Republic of
Duration: 13 Jun 202215 Jun 2022

Publication series

NameProceeding - IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022

Conference

Conference4th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2022
Country/TerritoryKorea, Republic of
CityIncheon
Period13/06/2215/06/22

Fingerprint

Dive into the research topics of 'Design Exploration of An Energy-Efficient Acceleration System for CNNs on Low-Cost Resource-Constraint SoC-FPGAs'. Together they form a unique fingerprint.

Cite this