DiffO: Single-step Diffusion for Image Compression at Ultra-Low Bitrates

19 June 2025

Main:9 Pages

16 Figures

Bibliography:4 Pages

2 Tables

Appendix:5 Pages

Abstract

Although image compression is fundamental to visual data processing and has inspired numerous standard and learned codecs, these methods still suffer severe quality degradation at extremely low bits per pixel. While recent diffusion based models provided enhanced generative performance at low bitrates, they still yields limited perceptual quality and prohibitive decoding latency due to multiple denoising steps. In this paper, we propose the first single step diffusion model for image compression (DiffO) that delivers high perceptual quality and fast decoding at ultra low bitrates. DiffO achieves these goals by coupling two key innovations: (i) VQ Residual training, which factorizes a structural base code and a learned residual in latent space, capturing both global geometry and high frequency details; and (ii) rate adaptive noise modulation, which tunes denoising strength on the fly to match the desired bitrate. Extensive experiments show that DiffO surpasses state of the art compression performance while improving decoding speed by about 50x compared to prior diffusion-based methods, greatly improving the practicality of generative codecs. The code will be available atthis https URL.

View on arXiv

@article{park2025_2506.16572,
  title={ DiffO: Single-step Diffusion for Image Compression at Ultra-Low Bitrates },
  author={ Chanung Park and Joo Chan Lee and Jong Hwan Ko },
  journal={arXiv preprint arXiv:2506.16572},
  year={ 2025 }
}

Comments on this paper