Learning Semantic Correspondence Exploiting an Object-level Prior

Lee, Junghyup; Kim, Dohyung; Lee, Wonkyung; Ponce, Jean; Ham, Bumsub

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1911

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Semantic Correspondence Exploiting an Object-level Prior

Authors: Junghyup Lee, Dohyung Kim, Wonkyung Lee, Jean Ponce, Bumsub Ham

(Submitted on 29 Nov 2019 (v1), last revised 21 Jul 2020 (this version, v2))

Abstract: We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal provides an object-level prior for the semantic correspondence task and offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It leverages a new and differentiable version of the argmax function for end-to-end training, with a loss that combines mask and flow consistency with smoothness terms. Experimental results demonstrate the effectiveness of our approach, which significantly outperforms the state of the art on standard benchmarks.

Comments:	Accepted to TPAMI. arXiv admin note: substantial text overlap with arXiv:1904.01810
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1911.12914 [cs.CV]
	(or arXiv:1911.12914v2 [cs.CV] for this version)

Submission history

From: Dohyung Kim MR [view email]
[v1] Fri, 29 Nov 2019 01:13:11 GMT (3897kb,D)
[v2] Tue, 21 Jul 2020 06:29:40 GMT (6591kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.12914

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Semantic Correspondence Exploiting an Object-level Prior

Submission history