Boosting Few-shot Semantic Segmentation with Transformers

Sun, Guolei; Liu, Yun; Liang, Jingyun; Van Gool, Luc

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2108

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Boosting Few-shot Semantic Segmentation with Transformers

Authors: Guolei Sun, Yun Liu, Jingyun Liang, Luc Van Gool

(Submitted on 4 Aug 2021)

Abstract: Due to the fact that fully supervised semantic segmentation methods require sufficient fully-labeled data to work well and can not generalize to unseen classes, few-shot segmentation has attracted lots of research attention. Previous arts extract features from support and query images, which are processed jointly before making predictions on query images. The whole process is based on convolutional neural networks (CNN), leading to the problem that only local information is used. In this paper, we propose a TRansformer-based Few-shot Semantic segmentation method (TRFS). Specifically, our model consists of two modules: Global Enhancement Module (GEM) and Local Enhancement Module (LEM). GEM adopts transformer blocks to exploit global information, while LEM utilizes conventional convolutions to exploit local information, across query and support features. Both GEM and LEM are complementary, helping to learn better feature representations for segmenting query images. Extensive experiments on PASCAL-5i and COCO datasets show that our approach achieves new state-of-the-art performance, demonstrating its effectiveness.

Comments:	Technical report. Code and pretrained models will be available: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.02266 [cs.CV]
	(or arXiv:2108.02266v1 [cs.CV] for this version)

Submission history

From: Guolei Sun [view email]
[v1] Wed, 4 Aug 2021 20:09:21 GMT (6848kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.02266

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Boosting Few-shot Semantic Segmentation with Transformers

Submission history