We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Same-different problems strain convolutional neural networks

Abstract: The robust and efficient recognition of visual relations in images is a hallmark of biological vision. We argue that, despite recent progress in visual recognition, modern machine vision algorithms are severely limited in their ability to learn visual relations. Through controlled experiments, we demonstrate that visual-relation problems strain convolutional neural networks (CNNs). The networks eventually break altogether when rote memorization becomes impossible, as when intra-class variability exceeds network capacity. Motivated by the comparable success of biological vision, we argue that feedback mechanisms including attention and perceptual grouping may be the key computational components underlying abstract visual reasoning.\
Comments: 6 Pages, 4 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
Cite as: arXiv:1802.03390 [cs.CV]
  (or arXiv:1802.03390v3 [cs.CV] for this version)

Submission history

From: Junkyung Kim [view email]
[v1] Fri, 9 Feb 2018 18:55:34 GMT (609kb,D)
[v2] Mon, 12 Feb 2018 22:29:20 GMT (610kb,D)
[v3] Fri, 25 May 2018 17:00:23 GMT (602kb,D)

Link back to: arXiv, form interface, contact.