An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers

Ryu, Jeongwon; Yoon, Kwangjin

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers

Authors: Jeongwon Ryu, Kwangjin Yoon

(Submitted on 8 Oct 2021)

Abstract: In this paper, we present an algorithm to tackle a video panoptic segmentation problem, a newly emerging area of research. The video panoptic segmentation is a task that unifies the typical task of panoptic segmentation and multi-object tracking. In other words, it requires generating the instance tracking IDs along with panoptic segmentation results across video sequences. Our proposed video panoptic segmentation algorithm uses the transformer and it can be trained in end-to-end with an input of multiple video frames. We test our method on the STEP dataset and report its performance with recently proposed STQ metric. The method archived 57.81\% on the KITTI-STEP dataset and 31.8\% on the MOTChallenge-STEP dataset.

Comments:	This contains a brief abstract
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2110.04009 [cs.CV]
	(or arXiv:2110.04009v1 [cs.CV] for this version)

Submission history

From: Kwangjin Yoon [view email]
[v1] Fri, 8 Oct 2021 10:13:37 GMT (3194kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.04009

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers

Submission history