We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model

Abstract: In many real-world applications, the relative depth of objects in an image is crucial for scene understanding. Recent approaches mainly tackle the problem of depth prediction in monocular images by treating the problem as a regression task. Yet, being interested in an order relation in the first place, ranking methods suggest themselves as a natural alternative to regression, and indeed, ranking approaches leveraging pairwise comparisons as training information ("object A is closer to the camera than B") have shown promising performance on this problem. In this paper, we elaborate on the use of so-called listwise ranking as a generalization of the pairwise approach. Our method is based on the Plackett-Luce (PL) model, a probability distribution on rankings, which we combine with a state-of-the-art neural network architecture and a simple sampling strategy to reduce training complexity. Moreover, taking advantage of the representation of PL as a random utility model, the proposed predictor offers a natural way to recover (shift-invariant) metric depth information from ranking-only data provided at training time. An empirical evaluation on several benchmark datasets in a "zero-shot" setting demonstrates the effectiveness of our approach compared to existing ranking and regression methods.
Comments: 15 pages, 5 figures, 7 tables, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Journal reference: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 14595-14604
Cite as: arXiv:2010.13118 [cs.CV]
  (or arXiv:2010.13118v4 [cs.CV] for this version)

Submission history

From: Julian Lienen [view email]
[v1] Sun, 25 Oct 2020 13:40:10 GMT (747kb,D)
[v2] Wed, 28 Oct 2020 09:37:54 GMT (747kb,D)
[v3] Sat, 31 Oct 2020 15:14:37 GMT (747kb,D)
[v4] Wed, 7 Jul 2021 07:43:54 GMT (7465kb,D)

Link back to: arXiv, form interface, contact.