References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: ResDepth: Learned Residual Stereo Reconstruction
(Submitted on 22 Jan 2020 (v1), last revised 18 Jun 2021 (this version, v3))
Abstract: We propose an embarrassingly simple but very effective scheme for high-quality dense stereo reconstruction: (i) generate an approximate reconstruction with your favourite stereo matcher; (ii) rewarp the input images with that approximate model; (iii) with the initial reconstruction and the warped images as input, train a deep network to enhance the reconstruction by regressing a residual correction; and (iv) if desired, iterate the refinement with the new, improved reconstruction. The strategy to only learn the residual greatly simplifies the learning problem. A standard Unet without bells and whistles is enough to reconstruct even small surface details, like dormers and roof substructures in satellite images. We also investigate residual reconstruction with less information and find that even a single image is enough to greatly improve an approximate reconstruction. Our full model reduces the mean absolute error of state-of-the-art stereo reconstruction systems by >50%, both in our target domain of satellite stereo and on stereo pairs from the ETH3D benchmark.
Submission history
From: Corinne Stucker [view email][v1] Wed, 22 Jan 2020 14:12:43 GMT (9067kb,D)
[v2] Thu, 30 Apr 2020 17:28:05 GMT (9698kb,D)
[v3] Fri, 18 Jun 2021 16:06:20 GMT (33482kb,D)
Link back to: arXiv, form interface, contact.