Attention-guided Chained Context Aggregation for Semantic Segmentation

Tang, Quan; Liu, Fagui; Jiang, Jun; Zhang, Yu

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2002

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Attention-guided Chained Context Aggregation for Semantic Segmentation

Authors: Quan Tang, Fagui Liu, Jun Jiang, Yu Zhang

(Submitted on 27 Feb 2020 (this version), latest version 21 May 2021 (v4))

Abstract: Recent breakthroughs in semantic segmentation methods based on Fully Convolutional Networks (FCNs) have aroused great research interest. One of the critical issues is how to aggregate multi-scale contextual information effectively to obtain reliable results. To address this problem, we propose a novel paradigm called the Chained Context Aggregation Module (CAM). CAM gains features of various spatial scales through chain-connected ladder-style information flows. The features are then guided by Flow Guidance Connections to interact and fuse in a two-stage process, which we refer to as pre-fusion and re-fusion. We further adopt attention models in CAM to productively recombine and select those fused features to refine performance. Based on these developments, we construct the Chained Context Aggregation Network (CANet), which employs a two-step decoder to recover precise spatial details of prediction maps. We conduct extensive experiments on three challenging datasets, including Pascal VOC 2012, CamVid and SUN-RGBD. Results evidence that our CANet achieves state-of-the-art performance. Codes will be available on the publication of this paper.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2002.12041 [cs.CV]
	(or arXiv:2002.12041v1 [cs.CV] for this version)

Submission history

From: Quan Tang [view email]
[v1] Thu, 27 Feb 2020 11:26:56 GMT (519kb,D)
[v2] Sun, 17 Jan 2021 07:00:54 GMT (1064kb,D)
[v3] Tue, 19 Jan 2021 08:01:47 GMT (0kb,I)
[v4] Fri, 21 May 2021 03:25:20 GMT (1098kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.12041v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Attention-guided Chained Context Aggregation for Semantic Segmentation

Submission history