We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast

Abstract: Though image-level weakly supervised semantic segmentation (WSSS) has achieved great progress with Class Activation Maps (CAMs) as the cornerstone, the large supervision gap between classification and segmentation still hampers the model to generate more complete and precise pseudo masks for segmentation. In this study, we propose weakly-supervised pixel-to-prototype contrast that can provide pixel-level supervisory signals to narrow the gap. Guided by two intuitive priors, our method is executed across different views and within per single view of an image, aiming to impose cross-view feature semantic consistency regularization and facilitate intra(inter)-class compactness(dispersion) of the feature space. Our method can be seamlessly incorporated into existing WSSS models without any changes to the base networks and does not incur any extra inference burden. Extensive experiments manifest that our method consistently improves two strong baselines by large margins, demonstrating the effectiveness. Specifically, built on top of SEAM, we improve the initial seed mIoU on PASCAL VOC 2012 from 55.4% to 61.5%. Moreover, armed with our method, we increase the segmentation mIoU of EPS from 70.8% to 73.6%, achieving new state-of-the-art.
Comments: 10 pages, 5 figures. Accepted by CVPR'22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2110.07110 [cs.CV]
  (or arXiv:2110.07110v3 [cs.CV] for this version)

Submission history

From: Ye Du [view email]
[v1] Thu, 14 Oct 2021 01:44:57 GMT (1914kb,D)
[v2] Tue, 16 Nov 2021 01:42:09 GMT (2613kb,D)
[v3] Mon, 14 Mar 2022 01:52:36 GMT (2617kb,D)

Link back to: arXiv, form interface, contact.