We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Self-Supervised Learning of Image Scale and Orientation

Abstract: We study the problem of learning to assign a characteristic pose, i.e., scale and orientation, for an image region of interest. Despite its apparent simplicity, the problem is non-trivial; it is hard to obtain a large-scale set of image regions with explicit pose annotations that a model directly learns from. To tackle the issue, we propose a self-supervised learning framework with a histogram alignment technique. It generates pairs of image patches by random rescaling/rotating and then train an estimator to predict their scale/orientation values so that their relative difference is consistent with the rescaling/rotating used. The estimator learns to predict a non-parametric histogram distribution of scale/orientation without any supervision. Experiments show that it significantly outperforms previous methods in scale/orientation estimation and also improves image matching and 6 DoF camera pose estimation by incorporating our patch poses into a matching process.
Comments: Presented in BMVC 2021, code is available on this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2206.07259 [cs.CV]
  (or arXiv:2206.07259v1 [cs.CV] for this version)

Submission history

From: Jongmin Lee [view email]
[v1] Wed, 15 Jun 2022 02:43:39 GMT (28136kb,D)

Link back to: arXiv, form interface, contact.