Multistage Spatial Context Models for Learned Image Compression

Lin, Fangzheng; Sun, Heming; Liu, Jinming; Katto, Jiro

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2302

Computer Science > Computer Vision and Pattern Recognition

Title: Multistage Spatial Context Models for Learned Image Compression

Authors: Fangzheng Lin, Heming Sun, Jinming Liu, Jiro Katto

(Submitted on 18 Feb 2023)

Abstract: Recent state-of-the-art Learned Image Compression methods feature spatial context models, achieving great rate-distortion improvements over hyperprior methods. However, the autoregressive context model requires serial decoding, limiting runtime performance. The Checkerboard context model allows parallel decoding at a cost of reduced RD performance. We present a series of multistage spatial context models allowing both fast decoding and better RD performance. We split the latent space into square patches and decode serially within each patch while different patches are decoded in parallel. The proposed method features a comparable decoding speed to Checkerboard while reaching the RD performance of Autoregressive and even also outperforming Autoregressive. Inside each patch, the decoding order must be carefully decided as a bad order negatively impacts performance; therefore, we also propose a decoding order optimization algorithm.

Comments:	Accepted to IEEE ICASSP 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2302.09263 [cs.CV]
	(or arXiv:2302.09263v1 [cs.CV] for this version)

Submission history

From: Fangzheng Lin [view email]
[v1] Sat, 18 Feb 2023 08:55:54 GMT (62kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2302.09263

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Multistage Spatial Context Models for Learned Image Compression

Submission history