References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Self-Supervised Attention Learning for Depth and Ego-motion Estimation
(Submitted on 27 Apr 2020 (v1), last revised 5 Dec 2022 (this version, v2))
Abstract: We address the problem of depth and ego-motion estimation from image sequences. Recent advances in the domain propose to train a deep learning model for both tasks using image reconstruction in a self-supervised manner. We revise the assumptions and the limitations of the current approaches and propose two improvements to boost the performance of the depth and ego-motion estimation. We first use Lie group properties to enforce the geometric consistency between images in the sequence and their reconstructions. We then propose a mechanism to pay an attention to image regions where the image reconstruction get corrupted. We show how to integrate the attention mechanism in the form of attention gates in the pipeline and use attention coefficients as a mask. We evaluate the new architecture on the KITTI datasets and compare it to the previous techniques. We show that our approach improves the state-of-the-art results for ego-motion estimation and achieve comparable results for depth estimation.
Submission history
From: Boris Chidlovskii [view email][v1] Mon, 27 Apr 2020 18:19:22 GMT (6846kb,D)
[v2] Mon, 5 Dec 2022 19:51:01 GMT (7381kb,D)
Link back to: arXiv, form interface, contact.