GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with Multi-Feature Learning

Weng, Xinshuo; Wang, Yongxin; Man, Yunze; Kitani, Kris

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2006

Computer Science > Computer Vision and Pattern Recognition

Title: GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with Multi-Feature Learning

Authors: Xinshuo Weng, Yongxin Wang, Yunze Man, Kris Kitani

(Submitted on 12 Jun 2020)

Abstract: 3D Multi-object tracking (MOT) is crucial to autonomous systems. Recent work uses a standard tracking-by-detection pipeline, where feature extraction is first performed independently for each object in order to compute an affinity matrix. Then the affinity matrix is passed to the Hungarian algorithm for data association. A key process of this standard pipeline is to learn discriminative features for different objects in order to reduce confusion during data association. In this work, we propose two techniques to improve the discriminative feature learning for MOT: (1) instead of obtaining features for each object independently, we propose a novel feature interaction mechanism by introducing the Graph Neural Network. As a result, the feature of one object is informed of the features of other objects so that the object feature can lean towards the object with similar feature (i.e., object probably with a same ID) and deviate from objects with dissimilar features (i.e., object probably with different IDs), leading to a more discriminative feature for each object; (2) instead of obtaining the feature from either 2D or 3D space in prior work, we propose a novel joint feature extractor to learn appearance and motion features from 2D and 3D space simultaneously. As features from different modalities often have complementary information, the joint feature can be more discriminate than feature from each individual modality. To ensure that the joint feature extractor does not heavily rely on one modality, we also propose an ensemble training paradigm. Through extensive evaluation, our proposed method achieves state-of-the-art performance on KITTI and nuScenes 3D MOT benchmarks. Our code will be made available at this https URL

Comments:	CVPR 2020. My website for all my research works: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2006.07327 [cs.CV]
	(or arXiv:2006.07327v1 [cs.CV] for this version)

Submission history

From: Xinshuo Weng [view email]
[v1] Fri, 12 Jun 2020 17:08:14 GMT (1164kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.07327

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with Multi-Feature Learning

Submission history