We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Geometry-Aware Segmentation of Remote Sensing Images via implicit height estimation

Authors: Xiang Li, Yi Fang
Abstract: Convolutional neural networks have made significant breakthroughs in the field of remote sensing and greatly advanced the performance of the semantic segmentation of remote sensing images. Recent studies have shown the benefits of using additional elevation data (e.g., DSM) for enhancing the performance of the semantic segmentation of aerial images. However, previous methods mostly adopt 3D elevation information as additional inputs. While in many real-world applications, one does not have the corresponding DSM information at hand and the spatial resolution of acquired DSM images usually do not match the aerial images. To alleviate this data constraint and also take the advantage of 3D elevation information, in this paper, we propose a geometry-aware segmentation model that achieves accurate semantic segmentation of aerial images via implicit height estimation. Instead of using a single-stream encoder-decoder network for semantic labeling, we design a separate decoder branch to predict the height map and use the DSM images as side supervision to train this newly designed decoder branch. With the newly designed decoder branch, our model can distill the 3D geometric features from 2D appearance features under the supervision of ground truth DSM images. Moreover, we develop a new geometry-aware convolution module that fuses the 3D geometric features from the height decoder branch and the 2D contextual features from the semantic segmentation branch. The fused feature embeddings can produce geometry-aware segmentation maps with enhanced performance. Experiments on ISPRS Vaihingen and Potsdam datasets demonstrate the effectiveness of our proposed method for the semantic segmentation of aerial images. Our proposed model achieves remarkable performance on both datasets without using any hand-crafted features or post-processing.
Comments: 13 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2006.05848 [cs.CV]
  (or arXiv:2006.05848v1 [cs.CV] for this version)

Submission history

From: Yi Fang [view email]
[v1] Wed, 10 Jun 2020 14:24:10 GMT (8663kb,D)
[v2] Tue, 22 Sep 2020 01:48:22 GMT (19675kb,D)

Link back to: arXiv, form interface, contact.