References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Hypercorrelation Squeeze for Few-Shot Segmentation
(Submitted on 4 Apr 2021 (v1), last revised 14 Oct 2021 (this version, v3))
Abstract: Few-shot semantic segmentation aims at learning to segment a target object from a query image using only a few annotated support images of the target class. This challenging task requires to understand diverse levels of visual cues and analyze fine-grained correspondence relations between the query and the support images. To address the problem, we propose Hypercorrelation Squeeze Networks (HSNet) that leverages multi-level feature correlation and efficient 4D convolutions. It extracts diverse features from different levels of intermediate convolutional layers and constructs a collection of 4D correlation tensors, i.e., hypercorrelations. Using efficient center-pivot 4D convolutions in a pyramidal architecture, the method gradually squeezes high-level semantic and low-level geometric cues of the hypercorrelation into precise segmentation masks in coarse-to-fine manner. The significant performance improvements on standard few-shot segmentation benchmarks of PASCAL-5i, COCO-20i, and FSS-1000 verify the efficacy of the proposed method.
Submission history
From: Juhong Min [view email][v1] Sun, 4 Apr 2021 05:27:13 GMT (15026kb,D)
[v2] Fri, 20 Aug 2021 21:10:34 GMT (13894kb,D)
[v3] Thu, 14 Oct 2021 18:27:04 GMT (13897kb,D)
Link back to: arXiv, form interface, contact.