Abstract
With the recent success of the deep neural networks (DNNs) in the field of artificial intelligence, the urge of deploying DNNs has drawn tremendous attention because it can benefit a wide range of applications on edge or embedded devices. Lightweight deep learning indicates the procedures of compressing DNN models into more compact ones, which are suitable to be executed on edge devices due to their limited resources and computational capabilities while maintaining comparable performance to the original. Currently, the approaches of model compression include but are not limited to network pruning, quantization, knowledge distillation, and neural architecture search. In this work, we present a fresh overview to summarize recent development and challenges for model compression.
Original language | English |
---|---|
Pages | 51-64 |
Number of pages | 14 |
Volume | 13 |
No | 4 |
Specialist publication | IEEE Consumer Electronics Magazine |
DOIs | |
State | Published - 1 Jul 2024 |