On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Yi Hsin Chen, Kuan Wei Ho, Martin Benjak, Jorn Ostermann, Wen Hsiao Peng

研究成果: Conference contribution同行評審

摘要

This paper aims to delve into the rate-distortion-complexity trade-offs of modern neural video coding. Recent years have witnessed much research effort being focused on exploring the full potential of neural video coding. Conditional auto encoders have emerged as the mainstream approach to efficient neural video coding. The central theme of conditional auto encoders is to leverage both spatial and temporal information for better conditional coding. However, a recent study indicates that conditional coding may suffer from information bottlenecks, potentially performing worse than traditional residual coding. To address this issue, recent conditional coding methods incorporate a large number of high-resolution features as the condition signal, leading to a considerable increase in the number of multiply-accumulate operations, memory footprint, and model size. Taking DCVC as the common code base, we investigate how the newly proposed conditional residual coding, an emerging new school of thought, and its variants may strike a better balance among rate, distortion, and complexity.

原文English
主出版物標題2024 IEEE 26th International Workshop on Multimedia Signal Processing, MMSP 2024
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798350387254
DOIs
出版狀態Published - 2024
事件26th IEEE International Workshop on Multimedia Signal Processing, MMSP 2024 - West Lafayette, 美國
持續時間: 2 10月 20244 10月 2024

出版系列

名字2024 IEEE 26th International Workshop on Multimedia Signal Processing, MMSP 2024

Conference

Conference26th IEEE International Workshop on Multimedia Signal Processing, MMSP 2024
國家/地區美國
城市West Lafayette
期間2/10/244/10/24

指紋

深入研究「On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding」主題。共同形成了獨特的指紋。

引用此