Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Wu, Yan; Huang, Zhiwu; Kumar, Suryansh; Sukthanker, Rhea Sanjay; Timofte, Radu; Van Gool, Luc

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2101

Computer Science > Computer Vision and Pattern Recognition

Title: Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Authors: Yan Wu, Zhiwu Huang, Suryansh Kumar, Rhea Sanjay Sukthanker, Radu Timofte, Luc Van Gool

(Submitted on 17 Jan 2021 (v1), last revised 23 Apr 2021 (this version, v2))

Abstract: Modern solutions to the single image super-resolution (SISR) problem using deep neural networks aim not only at better performance accuracy but also at a lighter and computationally efficient model. To that end, recently, neural architecture search (NAS) approaches have shown some tremendous potential. Following the same underlying, in this paper, we suggest a novel trilevel NAS method that provides a better balance between different efficiency metrics and performance to solve SISR. Unlike available NAS, our search is more complete, and therefore it leads to an efficient, optimized, and compressed architecture. We innovatively introduce a trilevel search space modeling, i.e., hierarchical modeling on network-, cell-, and kernel-level structures. To make the search on trilevel spaces differentiable and efficient, we exploit a new sparsestmax technique that is excellent at generating sparse distributions of individual neural architecture candidates so that they can be better disentangled for the final selection from the enlarged search space. We further introduce the sorting technique to the sparsestmax relaxation for better network-level compression. The proposed NAS optimization additionally facilitates simultaneous search and training in a single phase, reducing search time and train time. Comprehensive evaluations on the benchmark datasets show our method's clear superiority over the state-of-the-art NAS in terms of a good trade-off between model size, performance, and efficiency.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2101.06658 [cs.CV]
	(or arXiv:2101.06658v2 [cs.CV] for this version)

Submission history

From: Zhiwu Huang [view email]
[v1] Sun, 17 Jan 2021 12:19:49 GMT (7835kb,D)
[v2] Fri, 23 Apr 2021 15:50:09 GMT (2488kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2101.06658

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Submission history