References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Self-Supervised Monocular Scene Decomposition and Depth Estimation
(Submitted on 21 Oct 2021)
Abstract: Self-supervised monocular depth estimation approaches either ignore independently moving objects in the scene or need a separate segmentation step to identify them. We propose MonoDepthSeg to jointly estimate depth and segment moving objects from monocular video without using any ground-truth labels. We decompose the scene into a fixed number of components where each component corresponds to a region on the image with its own transformation matrix representing its motion. We estimate both the mask and the motion of each component efficiently with a shared encoder. We evaluate our method on three driving datasets and show that our model clearly improves depth estimation while decomposing the scene into separately moving components.
Submission history
From: Sadra Safadoust [view email][v1] Thu, 21 Oct 2021 17:03:08 GMT (32464kb,D)
Link back to: arXiv, form interface, contact.