We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers

Abstract: In this paper, we present an algorithm to tackle a video panoptic segmentation problem, a newly emerging area of research. The video panoptic segmentation is a task that unifies the typical task of panoptic segmentation and multi-object tracking. In other words, it requires generating the instance tracking IDs along with panoptic segmentation results across video sequences. Our proposed video panoptic segmentation algorithm uses the transformer and it can be trained in end-to-end with an input of multiple video frames. We test our method on the STEP dataset and report its performance with recently proposed STQ metric. The method archived 57.81\% on the KITTI-STEP dataset and 31.8\% on the MOTChallenge-STEP dataset.
Comments: This contains a brief abstract
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2110.04009 [cs.CV]
  (or arXiv:2110.04009v1 [cs.CV] for this version)

Submission history

From: Kwangjin Yoon [view email]
[v1] Fri, 8 Oct 2021 10:13:37 GMT (3194kb,D)

Link back to: arXiv, form interface, contact.