We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Graph Sampling Based Deep Metric Learning for Generalizable Person Re-Identification

Abstract: Recent studies show that, both explicit deep feature matching as well as large-scale and diverse training data can significantly improve the generalization of person re-identification. However, the efficiency of learning deep matchers on large-scale data has not yet been adequately studied. Though learning with classification parameters or class memory is a popular way, it incurs large memory and computational costs. In contrast, pairwise deep metric learning within mini batches would be a better choice. However, the most popular random sampling method, the well-known PK sampler, is not informative and efficient for deep metric learning. Though online hard example mining has improved the learning efficiency to some extent, the mining in mini batches after random sampling is still limited. This inspires us to explore the use of hard example mining earlier, in the data sampling stage. To do so, in this paper, we propose an efficient mini-batch sampling method, called graph sampling (GS), for large-scale deep metric learning. The basic idea is to build a nearest neighbor relationship graph for all classes at the beginning of each epoch. Then, each mini batch is composed of a randomly selected class and its nearest neighboring classes so as to provide informative and challenging examples for learning. Together with an adapted competitive baseline, we improve the state of the art in generalizable person re-identification significantly, by 25.1% in Rank-1 on MSMT17 when trained on RandPerson. Besides, the proposed method also outperforms the competitive baseline, by 6.8% in Rank-1 on CUHK03-NP when trained on MSMT17. Meanwhile, the training time is significantly reduced, from 25.4 hours to 2 hours when trained on RandPerson with 8,000 identities. Code is available at this https URL
Comments: This paper has been accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2104.01546 [cs.CV]
  (or arXiv:2104.01546v4 [cs.CV] for this version)

Submission history

From: Shengcai Liao [view email]
[v1] Sun, 4 Apr 2021 06:44:15 GMT (1162kb,D)
[v2] Tue, 6 Apr 2021 05:26:26 GMT (1162kb,D)
[v3] Tue, 7 Dec 2021 18:44:31 GMT (593kb,D)
[v4] Wed, 6 Apr 2022 10:52:30 GMT (719kb,D)

Link back to: arXiv, form interface, contact.