We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Weakly But Deeply Supervised Occlusion-Reasoned Parametric Road Layouts

Abstract: We propose an end-to-end network that takes a single perspective RGB image of a complex road scene as input, to produce occlusion-reasoned layouts in perspective space as well as a parametric bird's-eye-view (BEV) space. In contrast to prior works that require dense supervision such as semantic labels in perspective view, our method only requires human annotations for parametric attributes that are cheaper and less ambiguous to obtain. To solve this challenging task, our design is comprised of modules that incorporate inductive biases to learn occlusion-reasoning, geometric transformation and semantic abstraction, where each module may be supervised by appropriately transforming the parametric annotations. We demonstrate how our design choices and proposed deep supervision help achieve meaningful representations and accurate predictions. We validate our approach on two public datasets, KITTI and NuScenes, to achieve state-of-the-art results with considerably less human supervision.
Comments: to be appeared in CVPR22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2104.06730 [cs.CV]
  (or arXiv:2104.06730v2 [cs.CV] for this version)

Submission history

From: Buyu Liu [view email]
[v1] Wed, 14 Apr 2021 09:32:29 GMT (40995kb,D)
[v2] Wed, 13 Apr 2022 09:42:20 GMT (26691kb,D)

Link back to: arXiv, form interface, contact.