We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle Retrieval

Abstract: Natural language (NL) based vehicle retrieval aims to search specific vehicle given text description. Different from the image-based vehicle retrieval, NL-based vehicle retrieval requires considering not only vehicle appearance, but also surrounding environment and temporal relations. In this paper, we propose a Symmetric Network with Spatial Relationship Modeling (SSM) method for NL-based vehicle retrieval. Specifically, we design a symmetric network to learn the unified cross-modal representations between text descriptions and vehicle images, where vehicle appearance details and vehicle trajectory global information are preserved. Besides, to make better use of location information, we propose a spatial relationship modeling methods to take surrounding environment and mutual relationship between vehicles into consideration. The qualitative and quantitative experiments verify the effectiveness of the proposed method. We achieve 43.92% MRR accuracy on the test set of the 6th AI City Challenge on natural language-based vehicle retrieval track, yielding the 1st place among all valid submissions on the public leaderboard. The code is available at this https URL
Comments: 8 pages, 3 figures, publised to CVPRW
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Journal reference: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 3226-3233
Cite as: arXiv:2206.10879 [cs.CV]
  (or arXiv:2206.10879v1 [cs.CV] for this version)

Submission history

From: Haobo Chen [view email]
[v1] Wed, 22 Jun 2022 07:02:04 GMT (310kb,D)

Link back to: arXiv, form interface, contact.