We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: An Unsupervised Information-Theoretic Perceptual Quality Metric

Abstract: Tractable models of human perception have proved to be challenging to build. Hand-designed models such as MS-SSIM remain popular predictors of human image quality judgements due to their simplicity and speed. Recent modern deep learning approaches can perform better, but they rely on supervised data which can be costly to gather: large sets of class labels such as ImageNet, image quality ratings, or both. We combine recent advances in information-theoretic objective functions with a computational architecture informed by the physiology of the human visual system and unsupervised training on pairs of video frames, yielding our Perceptual Information Metric (PIM). We show that PIM is competitive with supervised metrics on the recent and challenging BAPPS image quality assessment dataset and outperforms them in predicting the ranking of image compression methods in CLIC 2020. We also perform qualitative experiments using the ImageNet-C dataset, and establish that PIM is robust with respect to architectural details.
Comments: 19 pages, 10 figures. Presented at NeurIPS 2020. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2006.06752 [cs.CV]
  (or arXiv:2006.06752v3 [cs.CV] for this version)

Submission history

From: Johannes Ballé [view email]
[v1] Thu, 11 Jun 2020 19:11:28 GMT (3832kb,D)
[v2] Sat, 24 Oct 2020 01:33:55 GMT (4014kb,D)
[v3] Sun, 10 Jan 2021 19:28:57 GMT (4014kb,D)

Link back to: arXiv, form interface, contact.