Improving Super-Resolution Performance using Meta-Attention Layers

Aquilina, Matthew; Galea, Christian; Abela, John; Camilleri, Kenneth P.; Farrugia, Reuben A.

doi:10.1109/LSP.2021.3116518

Full-text links:

Download:

Current browse context:

eess.IV

< prev | next >

new | recent | 2110

Electrical Engineering and Systems Science > Image and Video Processing

Title: Improving Super-Resolution Performance using Meta-Attention Layers

Authors: Matthew Aquilina, Christian Galea, John Abela, Kenneth P. Camilleri, Reuben A. Farrugia

(Submitted on 27 Oct 2021)

Abstract: Convolutional Neural Networks (CNNs) have achieved impressive results across many super-resolution (SR) and image restoration tasks. While many such networks can upscale low-resolution (LR) images using just the raw pixel-level information, the ill-posed nature of SR can make it difficult to accurately super-resolve an image which has undergone multiple different degradations. Additional information (metadata) describing the degradation process (such as the blur kernel applied, compression level, etc.) can guide networks to super-resolve LR images with higher fidelity to the original source. Previous attempts at informing SR networks with degradation parameters have indeed been able to improve performance in a number of scenarios. However, due to the fully-convolutional nature of many SR networks, most of these metadata fusion methods either require a complete architectural change, or necessitate the addition of significant extra complexity. Thus, these approaches are difficult to introduce into arbitrary SR networks without considerable design alterations. In this paper, we introduce meta-attention, a simple mechanism which allows any SR CNN to exploit the information available in relevant degradation parameters. The mechanism functions by translating the metadata into a channel attention vector, which in turn selectively modulates the network's feature maps. Incorporating meta-attention into SR networks is straightforward, as it requires no specific type of architecture to function correctly. Extensive testing has shown that meta-attention can consistently improve the pixel-level accuracy of state-of-the-art (SOTA) networks when provided with relevant degradation metadata. For PSNR, the gain on blurred/downsampled (X4) images is of 0.2969 dB (on average) and 0.3320 dB for SOTA general and face SR models, respectively.

Comments:	Accepted for publication in the IEEE Signal Processing Letters. This is the accepted version of the paper, for the final formatted version and supplementary information, please visit the IEEE's publication at the linked DOI
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
DOI:	10.1109/LSP.2021.3116518
Cite as:	arXiv:2110.14638 [eess.IV]
	(or arXiv:2110.14638v1 [eess.IV] for this version)

Submission history

From: Matthew Aquilina [view email]
[v1] Wed, 27 Oct 2021 09:20:21 GMT (2104kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2110.14638

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Image and Video Processing

Title: Improving Super-Resolution Performance using Meta-Attention Layers

Submission history