We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: A model for full local image interpretation

Abstract: We describe a computational model of humans' ability to provide a detailed interpretation of components in a scene. Humans can identify in an image meaningful components almost everywhere, and identifying these components is an essential part of the visual process, and of understanding the surrounding scene and its potential meaning to the viewer. Detailed interpretation is beyond the scope of current models of visual recognition. Our model suggests that this is a fundamental limitation, related to the fact that existing models rely on feed-forward but limited top-down processing. In our model, a first recognition stage leads to the initial activation of class candidates, which is incomplete and with limited accuracy. This stage then triggers the application of class-specific interpretation and validation processes, which recover richer and more accurate interpretation of the visible scene. We discuss implications of the model for visual interpretation by humans and by computer vision models.
Comments: Published in the Proceedings of the 37th Annual Meeting of the Cognitive Science Society (CogSci), 2015
Subjects: Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
Journal reference: https://cogsci.mindmodeling.org/2015/papers/0048/
Cite as: arXiv:2110.08744 [cs.AI]
  (or arXiv:2110.08744v1 [cs.AI] for this version)

Submission history

From: Guy Ben-Yosef [view email]
[v1] Sun, 17 Oct 2021 07:20:53 GMT (707kb)

Link back to: arXiv, form interface, contact.