EfficientPS: Efficient Panoptic Segmentation

Mohan, Rohit; Valada, Abhinav

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2004

Computer Science > Computer Vision and Pattern Recognition

Title: EfficientPS: Efficient Panoptic Segmentation

Authors: Rohit Mohan, Abhinav Valada

(Submitted on 5 Apr 2020 (v1), last revised 1 Feb 2021 (this version, v3))

Abstract: Understanding the scene in which an autonomous robot operates is critical for its competent functioning. Such scene comprehension necessitates recognizing instances of traffic participants along with general scene semantics which can be effectively addressed by the panoptic segmentation task. In this paper, we introduce the Efficient Panoptic Segmentation (EfficientPS) architecture that consists of a shared backbone which efficiently encodes and fuses semantically rich multi-scale features. We incorporate a new semantic head that aggregates fine and contextual features coherently and a new variant of Mask R-CNN as the instance head. We also propose a novel panoptic fusion module that congruously integrates the output logits from both the heads of our EfficientPS architecture to yield the final panoptic segmentation output. Additionally, we introduce the KITTI panoptic segmentation dataset that contains panoptic annotations for the popularly challenging KITTI benchmark. Extensive evaluations on Cityscapes, KITTI, Mapillary Vistas and Indian Driving Dataset demonstrate that our proposed architecture consistently sets the new state-of-the-art on all these four benchmarks while being the most efficient and fast panoptic segmentation architecture to date.

Comments:	Ranked # 1 on Cityscapes panoptic segmentation benchmark, ranked # 2 among the published methods on Cityscapes semantic segmentation benchmark, and ranked # 2 among the published methods on Cityscapes instance segmentation benchmark. Demo, code and models are available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Journal reference:	International Journal of Computer Vision (IJCV), vol. 129, no. 5, pp. 1551-1579, 2021
Cite as:	arXiv:2004.02307 [cs.CV]
	(or arXiv:2004.02307v3 [cs.CV] for this version)

Submission history

From: Abhinav Valada [view email]
[v1] Sun, 5 Apr 2020 20:15:59 GMT (8670kb,D)
[v2] Tue, 19 May 2020 00:23:26 GMT (8669kb,D)
[v3] Mon, 1 Feb 2021 09:33:18 GMT (8991kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2004.02307

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: EfficientPS: Efficient Panoptic Segmentation

Submission history