We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Large-scale Unsupervised Semantic Segmentation

Abstract: Powered by the ImageNet dataset, unsupervised learning on large-scale data has made significant advances for classification tasks. There are two major challenges to allow such an attractive learning modality for segmentation tasks: i) a large-scale benchmark for assessing algorithms is missing; ii) unsupervised shape representation learning is difficult. We propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to track the research progress. Based on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 40k high-quality semantic segmentation annotations for evaluation. Our benchmark has a high data diversity and a clear task objective. We also present a simple yet effective baseline method that works surprisingly well for LUSS. In addition, we benchmark related un/weakly supervised methods accordingly, identifying the challenges and possible directions of LUSS.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2106.03149 [cs.CV]
  (or arXiv:2106.03149v1 [cs.CV] for this version)

Submission history

From: Shanghua Gao [view email]
[v1] Sun, 6 Jun 2021 15:02:11 GMT (2881kb,D)
[v2] Sun, 30 Jan 2022 13:07:37 GMT (2289kb,D)
[v3] Thu, 3 Nov 2022 12:31:02 GMT (1943kb,D)

Link back to: arXiv, form interface, contact.