References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions
(Submitted on 12 Jan 2021 (v1), last revised 5 Apr 2021 (this version, v3))
Abstract: Natural Language (NL) descriptions can be one of the most convenient or the only way to interact with systems built to understand and detect city scale traffic patterns and vehicle-related events. In this paper, we extend the widely adopted CityFlow Benchmark with NL descriptions for vehicle targets and introduce the CityFlow-NL Benchmark. The CityFlow-NL contains more than 5,000 unique and precise NL descriptions of vehicle targets, making it the first multi-target multi-camera tracking with NL descriptions dataset to our knowledge. Moreover, the dataset facilitates research at the intersection of multi-object tracking, retrieval by NL descriptions, and temporal localization of events. In this paper, we focus on two foundational tasks: the Vehicle Retrieval by NL task and the Vehicle Tracking by NL task, which take advantage of the proposed CityFlow-NL benchmark and provide a strong basis for future research on the multi-target multi-camera tracking by NL description task.
Submission history
From: Qi Feng [view email][v1] Tue, 12 Jan 2021 20:26:17 GMT (16242kb,D)
[v2] Thu, 14 Jan 2021 03:51:21 GMT (16242kb,D)
[v3] Mon, 5 Apr 2021 18:38:10 GMT (14495kb,D)
Link back to: arXiv, form interface, contact.