Two is a crowd: tracking relations in videos

Moskalev, Artem; Sosnovik, Ivan; Smeulders, Arnold

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2108

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Two is a crowd: tracking relations in videos

Authors: Artem Moskalev, Ivan Sosnovik, Arnold Smeulders

(Submitted on 11 Aug 2021)

Abstract: Tracking multiple objects individually differs from tracking groups of related objects. When an object is a part of the group, its trajectory depends on the trajectories of the other group members. Most of the current state-of-the-art trackers follow the approach of tracking each object independently, with the mechanism to handle the overlapping trajectories where necessary. Such an approach does not take inter-object relations into account, which may cause unreliable tracking for the members of the groups, especially in crowded scenarios, where individual cues become unreliable due to occlusions. To overcome these limitations and to extend such trackers to crowded scenes, we propose a plug-in Relation Encoding Module (REM). REM encodes relations between tracked objects by running a message passing over a corresponding spatio-temporal graph, computing relation embeddings for the tracked objects. Our experiments on MOT17 and MOT20 demonstrate that the baseline tracker improves its results after a simple extension with REM. The proposed module allows for tracking severely or even fully occluded objects by utilizing relational cues.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.05331 [cs.CV]
	(or arXiv:2108.05331v1 [cs.CV] for this version)

Submission history

From: Artem Moskalev [view email]
[v1] Wed, 11 Aug 2021 17:19:34 GMT (13511kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.05331

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Two is a crowd: tracking relations in videos

Submission history