摘要
Deep learning-based super-resolution (SR) is challenging to implement in resource-constrained edge devices for resolutions beyond full HD due to its high computational complexity and memory bandwidth requirements. This paper introduces an 8K@30FPS SR accelerator with edge-selective dynamic input processing. Dynamic processing chooses the appropriate subnets for different patches based on simple input edge criteria, achieving a 50% MAC reduction with only a 0.1dB PSNR decrease. The quality of reconstruction images is guaranteed and maximized its potential with resource adaptive model switching even under resource constraints. In conjunction with hardware-specific refinements, the model size is reduced by 84% to 51K, but with a decrease of less than 0.6dB PSNR. Additionally, to support dynamic processing with high utilization, this design incorporates a configurable group of layer mapping that synergizes with the structure-friendly fusion block, resulting in 77% hardware utilization and up to 79% reduction in feature SRAM access. The implementation, using the TSMC 28nm process, can achieve 8K@30FPS throughput at 800MHz with a gate count of 2749K, 0.2075W power consumption, and 4797Mpixels/J energy efficiency, exceeding previous work.
| 原文 | English |
|---|---|
| 頁(從 - 到) | 1693-1705 |
| 頁數 | 13 |
| 期刊 | IEEE Transactions on Circuits and Systems I: Regular Papers |
| 卷 | 71 |
| 發行號 | 4 |
| DOIs | |
| 出版狀態 | Published - 1 4月 2024 |
UN SDG
此研究成果有助於以下永續發展目標
-
SDG 7 經濟實惠的清潔能源
指紋
深入研究「ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver