We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments

Abstract: 3D reconstruction of depth and motion from monocular video in dynamic environments is a highly ill-posed problem due to scale ambiguities when projecting to the 2D image domain. In this work, we investigate the performance of the current State-of-the-Art (SotA) deep multi-view systems in such environments. We find that current supervised methods work surprisingly well despite not modelling individual object motions, but make systematic errors due to a lack of dense ground truth data. To detect such errors during usage, we extend the cost volume based Deep Video to Depth (DeepV2D) framework \cite{teed2018deepv2d} with a learned uncertainty. Our Deep Video to certain Depth (DeepV2cD) model allows i) to perform en par or better with current SotA and ii) achieve a better uncertainty measure than the naive Shannon entropy. Our experiments show that a simple filter strategy based on the uncertainty can significantly reduce systematic errors. This results in cleaner reconstructions both on static and dynamic parts of the scene.
Comments: 20 pages, 5 figures, 3 tables, submitted to ICPRAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2201.08633 [cs.CV]
  (or arXiv:2201.08633v1 [cs.CV] for this version)

Submission history

From: Christian Homeyer [view email]
[v1] Fri, 21 Jan 2022 10:42:57 GMT (2301kb,D)

Link back to: arXiv, form interface, contact.