We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Semantics-Driven Unsupervised Learning for Monocular Depth and Ego-Motion Estimation

Abstract: We propose a semantics-driven unsupervised learning approach for monocular depth and ego-motion estimation from videos in this paper. Recent unsupervised learning methods employ photometric errors between synthetic view and actual image as a supervision signal for training. In our method, we exploit semantic segmentation information to mitigate the effects of dynamic objects and occlusions in the scene, and to improve depth prediction performance by considering the correlation between depth and semantics. To avoid costly labeling process, we use noisy semantic segmentation results obtained by a pre-trained semantic segmentation network. In addition, we minimize the position error between the corresponding points of adjacent frames to utilize 3D spatial information. Experimental results on the KITTI dataset show that our method achieves good performance in both depth and ego-motion estimation tasks.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2006.04371 [cs.CV]
  (or arXiv:2006.04371v1 [cs.CV] for this version)

Submission history

From: Xiaobin Wei [view email]
[v1] Mon, 8 Jun 2020 05:55:07 GMT (1104kb,D)

Link back to: arXiv, form interface, contact.