We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Exploring the ability of CNNs to generalise to previously unseen scales over wide scale ranges

Abstract: The ability to handle large scale variations is crucial for many real world visual tasks. A straightforward approach for handling scale in a deep network is to process an image at several scales simultaneously in a set of scale channels. Scale invariance can then, in principle, be achieved by using weight sharing between the scale channels together with max or average pooling over the outputs from the scale channels. The ability of such scale channel networks to generalise to scales not present in the training set over significant scale ranges has, however, not previously been explored. We, therefore, present a theoretical analysis of invariance and covariance properties of scale channel networks and perform an experimental evaluation of the ability of different types of scale channel networks to generalise to previously unseen scales. We identify limitations of previous approaches and propose a new type of foveated scale channel architecture, where the scale channels process increasingly larger parts of the image with decreasing resolution. Our proposed FovMax and FovAvg networks perform almost identically over a scale range of 8, also when training on single scale training data, and do also give improvements in the small sample regime.
Comments: 14 pages, 6 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Journal reference: Shortened version in International Conference on Pattern Recognition (ICPR 2020), pages 1181-1188, Jan 2021
DOI: 10.1109/ICPR48806.2021.9413276
Cite as: arXiv:2004.01536 [cs.CV]
  (or arXiv:2004.01536v7 [cs.CV] for this version)

Submission history

From: Ylva Jansson [view email]
[v1] Fri, 3 Apr 2020 13:00:35 GMT (4052kb,D)
[v2] Thu, 21 May 2020 19:04:47 GMT (4592kb,D)
[v3] Thu, 18 Jun 2020 10:14:16 GMT (4665kb,D)
[v4] Mon, 29 Jun 2020 13:06:41 GMT (4665kb,D)
[v5] Tue, 15 Sep 2020 12:32:33 GMT (8810kb,D)
[v6] Mon, 12 Apr 2021 09:07:54 GMT (8810kb,D)
[v7] Tue, 18 May 2021 09:27:23 GMT (8810kb,D)

Link back to: arXiv, form interface, contact.