We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Google Landmark Retrieval 2021 Competition Third Place Solution

Abstract: We present our solutions to the Google Landmark Challenges 2021, for both the retrieval and the recognition tracks. Both solutions are ensembles of transformers and ConvNet models based on Sub-center ArcFace with dynamic margins. Since the two tracks share the same training data, we used the same pipeline and training approach, but with different model selections for the ensemble and different post-processing. The key improvement over last year is newer state-of-the-art vision architectures, especially transformers which significantly outperform ConvNets for the retrieval task. We finished third and fourth places for the retrieval and recognition tracks respectively.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2110.04619 [cs.CV]
  (or arXiv:2110.04619v1 [cs.CV] for this version)

Submission history

From: Bo Liu [view email]
[v1] Sat, 9 Oct 2021 17:56:40 GMT (5kb)

Link back to: arXiv, form interface, contact.