Transformer-Based Learned Image Compression for Joint Decoding and Denoising

Yi Hsin Chen, Kuan Wei Ho, Shiau Rung Tsai, Guan Hsun Lin, Alessandro Gnutti, Wen Hsiao Peng, Riccardo Leonardi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate decoders for these tasks, we incorporate two add-on modules to adapt a pre-trained image decoder from performing the standard image reconstruction to joint decoding and denoising. Our scheme adopts a two-pronged approach. It features a latent refinement module to refine the latent representation of a noisy input image for reconstructing a noise-free image. Additionally, it incorporates an instance-specific prompt generator that adapts the decoding process to improve on the latent refinement. Experimental results show that our method achieves a similar level of denoising quality to training a separate decoder for joint decoding and denoising at the expense of only a modest increase in the decoder's model size and computational complexity.

Original languageEnglish
Title of host publication2024 Picture Coding Symposium, PCS 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350358483
DOIs
StatePublished - 2024
Event2024 Picture Coding Symposium, PCS 2024 - Taichung, Taiwan
Duration: 12 Jun 202414 Jun 2024

Publication series

Name2024 Picture Coding Symposium, PCS 2024 - Proceedings

Conference

Conference2024 Picture Coding Symposium, PCS 2024
Country/TerritoryTaiwan
CityTaichung
Period12/06/2414/06/24

Keywords

  • Learned image compression
  • Transformer
  • compressed-domain image denoising

Fingerprint

Dive into the research topics of 'Transformer-Based Learned Image Compression for Joint Decoding and Denoising'. Together they form a unique fingerprint.

Cite this