Learning multiple visual domains with residual adapters

Rebuffi, Sylvestre-Alvise; Bilen, Hakan; Vedaldi, Andrea

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1705

Computer Science > Computer Vision and Pattern Recognition

Title: Learning multiple visual domains with residual adapters

Authors: Sylvestre-Alvise Rebuffi, Hakan Bilen, Andrea Vedaldi

(Submitted on 22 May 2017 (v1), last revised 27 Nov 2017 (this version, v5))

Abstract: There is a growing interest in learning data representations that work well for many different types of problems and data. In this paper, we look in particular at the task of learning a single visual representation that can be successfully utilized in the analysis of very different types of images, from dog breeds to stop signs and digits. Inspired by recent work on learning networks that predict the parameters of another, we develop a tunable deep network architecture that, by means of adapter residual modules, can be steered on the fly to diverse visual domains. Our method achieves a high degree of parameter sharing while maintaining or even improving the accuracy of domain-specific representations. We also introduce the Visual Decathlon Challenge, a benchmark that evaluates the ability of representations to capture simultaneously ten very different visual domains and measures their ability to recognize well uniformly.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1705.08045 [cs.CV]
	(or arXiv:1705.08045v5 [cs.CV] for this version)

Submission history

From: Sylvestre-Alvise Rebuffi [view email]
[v1] Mon, 22 May 2017 23:59:23 GMT (350kb,D)
[v2] Wed, 7 Jun 2017 23:05:40 GMT (350kb,D)
[v3] Tue, 27 Jun 2017 16:56:16 GMT (350kb,D)
[v4] Tue, 22 Aug 2017 07:27:04 GMT (350kb,D)
[v5] Mon, 27 Nov 2017 17:35:38 GMT (357kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1705.08045

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning multiple visual domains with residual adapters

Submission history