We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Simple and effective localized attribute representations for zero-shot learning

Abstract: Zero-shot learning (ZSL) aims to discriminate images from unseen classes by exploiting relations to seen classes via their semantic descriptions. Some recent papers have shown the importance of localized features together with fine-tuning the feature extractor to obtain discriminative and transferable features. However, these methods require complex attention or part detection modules to perform explicit localization in the visual space. In contrast, in this paper we propose localizing representations in the semantic/attribute space, with a simple but effective pipeline where localization is implicit. Focusing on attribute representations, we show that our method obtains state-of-the-art performance on CUB and SUN datasets, and also achieves competitive results on AWA2 dataset, outperforming generally more complex methods with explicit localization in the visual space. Our method can be implemented easily, which can be used as a new baseline for zero shot-learning. In addition, our localized representations are highly interpretable as attribute-specific heatmaps.
Comments: A journal version of the paper is arXiv:arXiv:2103.04704
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2006.05938 [cs.CV]
  (or arXiv:2006.05938v3 [cs.CV] for this version)

Submission history

From: Shiqi Yang [view email]
[v1] Wed, 10 Jun 2020 16:46:12 GMT (3994kb,D)
[v2] Wed, 17 Jun 2020 18:07:23 GMT (1495kb,D)
[v3] Tue, 9 Mar 2021 09:44:15 GMT (1495kb,D)

Link back to: arXiv, form interface, contact.