xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Jaritz, Maximilian; Vu, Tuan-Hung; de Charette, Raoul; Wirbel, Émilie; Pérez, Patrick

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1911

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Authors: Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, Émilie Wirbel, Patrick Pérez

(Submitted on 28 Nov 2019 (v1), last revised 30 Mar 2020 (this version, v2))

Abstract: Unsupervised Domain Adaptation (UDA) is crucial to tackle the lack of annotations in a new domain. There are many multi-modal datasets, but most UDA approaches are uni-modal. In this work, we explore how to learn from multi-modality and propose cross-modal UDA (xMUDA) where we assume the presence of 2D images and 3D point clouds for 3D semantic segmentation. This is challenging as the two input spaces are heterogeneous and can be impacted differently by domain shift. In xMUDA, modalities learn from each other through mutual mimicking, disentangled from the segmentation objective, to prevent the stronger modality from adopting false predictions from the weaker one. We evaluate on new UDA scenarios including day-to-night, country-to-country and dataset-to-dataset, leveraging recent autonomous driving datasets. xMUDA brings large improvements over uni-modal UDA on all tested scenarios, and is complementary to state-of-the-art UDA techniques. Code is available at this https URL

Comments:	Accepted at CVPR 2020. For a demo video, see this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1911.12676 [cs.CV]
	(or arXiv:1911.12676v2 [cs.CV] for this version)

Submission history

From: Maximilian Jaritz [view email]
[v1] Thu, 28 Nov 2019 12:38:05 GMT (7411kb,D)
[v2] Mon, 30 Mar 2020 19:24:04 GMT (6007kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.12676

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Submission history