We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.IV

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Image and Video Processing

Title: Lossy Image Compression with Quantized Hierarchical VAEs

Abstract: Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory. Motivated by this, we consider the problem of lossy image compression from the perspective of generative modeling. Starting with ResNet VAEs, which are originally designed for data (image) distribution modeling, we redesign their latent variable model using a quantization-aware posterior and prior, enabling easy quantization and entropy coding at test time. Along with improved neural network architecture, we present a powerful and efficient model that outperforms previous methods on natural image lossy compression. Our model compresses images in a coarse-to-fine fashion and supports parallel encoding and decoding, leading to fast execution on GPUs. Code is available at this https URL
Comments: WACV 2023 Best Algorithms Paper Award, revised version
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
DOI: 10.1109/WACV56688.2023.00028
Cite as: arXiv:2208.13056 [eess.IV]
  (or arXiv:2208.13056v2 [eess.IV] for this version)

Submission history

From: Zhihao Duan [view email]
[v1] Sat, 27 Aug 2022 17:15:38 GMT (2764kb,D)
[v2] Sat, 25 Mar 2023 15:52:29 GMT (2789kb,D)

Link back to: arXiv, form interface, contact.