Toward Fast Platform-Aware Neural Architecture Search for FPGA-Accelerated Edge AI Applications

Yi Chuan Liang, Ying Chiao Liao, Chen Ching Lin, Shih Hao Hung

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

Neural Architecture Search (NAS) is a technique for finding suitable neural network architecture models for given applications. Previously, such search methods are usually based on reinforcement learning, with a recurrent neural network to generate neural network models. However, most NAS methods aim to find a set of candidates with best cost-performance ratios, e.g. high accuracy and low computing time, based on rough estimates derived from the workload generically. As today's deep learning chips accelerate neural network operations with a variety of hardware tricks such as vectors and low-precision data formats, the estimated metrics derived from generic computing operations such as float-point operations (FLOPS) would be very different from the actual latency, throughput, power consumption, etc., which are highly sensitive to the hardware design and even the software optimization in edge AI applications. Thus, instead of taking a long time to pick and train so called good candidates repeatedly based on unreliable estimates, we propose a NAS framework which accelerates the search process by including the actual performance measurements in the search process. The inclusion of actual measurements enables the proposed NAS framework to find candidates based on correct information and reduce the possibility of selecting wrong candidates and wasting search time on wrong candidates. To illustrate the effectiveness of our framework, we prototyped the framework to work with Intel OpenVINO and Field Programmable Gate Arrays (FPGA) to meet the accuracy and latency required by the user. The framework takes the dataset, accuracy and latency requirements from the user and automatically search for candidates to meet the requirements. Case studies and experimental results are presented in this paper to evaluate the effectiveness of our framework for Edge AI applications in real-time image classification.

原文English
主出版物標題Proceedings of the 2020 Research in Adaptive and Convergent Systems, RACS 2020
發行者Association for Computing Machinery
頁面219-225
頁數7
ISBN(電子)9781450380256
DOIs
出版狀態Published - 13 10月 2020
事件2020 Research in Adaptive and Convergent Systems, RACS 2020 - Gwangju, 韓國
持續時間: 13 10月 202016 10月 2020

出版系列

名字ACM International Conference Proceeding Series

Conference

Conference2020 Research in Adaptive and Convergent Systems, RACS 2020
國家/地區韓國
城市Gwangju
期間13/10/2016/10/20

指紋

深入研究「Toward Fast Platform-Aware Neural Architecture Search for FPGA-Accelerated Edge AI Applications」主題。共同形成了獨特的指紋。

引用此