Learning structure-aware semantic segmentation with image-level supervision

Liu, Jiawei; Zhang, Jing; Hong, Yicong; Barnes, Nick

doi:10.1109/IJCNN52387.2021.9533846

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2104

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Learning structure-aware semantic segmentation with image-level supervision

Authors: Jiawei Liu, Jing Zhang, Yicong Hong, Nick Barnes

(Submitted on 15 Apr 2021)

Abstract: Compared with expensive pixel-wise annotations, image-level labels make it possible to learn semantic segmentation in a weakly-supervised manner. Within this pipeline, the class activation map (CAM) is obtained and further processed to serve as a pseudo label to train the semantic segmentation model in a fully-supervised manner. In this paper, we argue that the lost structure information in CAM limits its application in downstream semantic segmentation, leading to deteriorated predictions. Furthermore, the inconsistent class activation scores inside the same object contradicts the common sense that each region of the same object should belong to the same semantic category. To produce sharp prediction with structure information, we introduce an auxiliary semantic boundary detection module, which penalizes the deteriorated predictions. Furthermore, we adopt smoothness loss to encourage prediction inside the object to be consistent. Experimental results on the PASCAL-VOC dataset illustrate the effectiveness of the proposed solution.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Journal reference:	2021 International Joint Conference on Neural Networks (IJCNN)
DOI:	10.1109/IJCNN52387.2021.9533846
Cite as:	arXiv:2104.07216 [cs.CV]
	(or arXiv:2104.07216v1 [cs.CV] for this version)

Submission history

From: Jiawei Liu [view email]
[v1] Thu, 15 Apr 2021 03:33:20 GMT (12631kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2104.07216

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning structure-aware semantic segmentation with image-level supervision

Submission history