Current browse context:
cs.CV
Change to browse by:
References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Affinity CNN: Learning Pixel-Centric Pairwise Relations for Figure/Ground Embedding
(Submitted on 9 Dec 2015 (v1), last revised 11 Apr 2016 (this version, v2))
Abstract: Spectral embedding provides a framework for solving perceptual organization problems, including image segmentation and figure/ground organization. From an affinity matrix describing pairwise relationships between pixels, it clusters pixels into regions, and, using a complex-valued extension, orders pixels according to layer. We train a convolutional neural network (CNN) to directly predict the pairwise relationships that define this affinity matrix. Spectral embedding then resolves these predictions into a globally-consistent segmentation and figure/ground organization of the scene. Experiments demonstrate significant benefit to this direct coupling compared to prior works which use explicit intermediate stages, such as edge detection, on the pathway from image to affinities. Our results suggest spectral embedding as a powerful alternative to the conditional random field (CRF)-based globalization schemes typically coupled to deep neural networks.
Submission history
From: Michael Maire [view email][v1] Wed, 9 Dec 2015 06:45:23 GMT (5282kb,D)
[v2] Mon, 11 Apr 2016 22:03:38 GMT (5364kb,D)
Link back to: arXiv, form interface, contact.