End-to-end learned image compression with augmented normalizing flows

Yung Han Ho, Chih Chun Chan, Wen-Hsiao Peng, Hsueh-Ming Hang

研究成果: Conference contribution同行評審

摘要

This paper presents a new attempt at using augmented normalizing flows (ANF) for lossy image compression. ANF is a specific type of normalizing flow models that augment the input with an independent noise, allowing a smoother transformation from the augmented input space to the latent space. Inspired by the fact that ANF can offer greater expressivity by stacking multiple variational autoencoders (VAE), we generalize the popular VAE-based compression framework by the autoencoding transforms of ANF. When evaluated on Kodak dataset, our ANF-based model provides 3.4% higher BD-rate saving as compared with a VAE-based baseline that implements hyper-prior with mean prediction. Interestingly, it benefits even more from the incorporation of a post-processing network, showing 11.8% rate saving as compared to 6.0% with the baseline plus post-processing.

原文English
主出版物標題Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021
發行者IEEE Computer Society
頁面1931-1935
頁數5
ISBN(電子)9781665448994
DOIs
出版狀態Published - 6月 2021
事件2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021 - Virtual, Online, United States
持續時間: 19 6月 202125 6月 2021

出版系列

名字IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN(列印)2160-7508
ISSN(電子)2160-7516

Conference

Conference2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021
國家/地區United States
城市Virtual, Online
期間19/06/2125/06/21

指紋

深入研究「End-to-end learned image compression with augmented normalizing flows」主題。共同形成了獨特的指紋。

引用此