Boosting Video Object Segmentation based on Scale Inconsistency

Wang, Hengyi; Oh, Changjae

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Boosting Video Object Segmentation based on Scale Inconsistency

Authors: Hengyi Wang, Changjae Oh

(Submitted on 2 May 2022)

Abstract: We present a refinement framework to boost the performance of pre-trained semi-supervised video object segmentation (VOS) models. Our work is based on scale inconsistency, which is motivated by the observation that existing VOS models generate inconsistent predictions from input frames with different sizes. We use the scale inconsistency as a clue to devise a pixel-level attention module that aggregates the advantages of the predictions from different-size inputs. The scale inconsistency is also used to regularize the training based on a pixel-level variance measured by an uncertainty estimation. We further present a self-supervised online adaptation, tailored for test-time optimization, that bootstraps the predictions without ground-truth masks based on the scale inconsistency. Experiments on DAVIS 16 and DAVIS 17 datasets show that our framework can be generically applied to various VOS models and improve their performance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.01197 [cs.CV]
	(or arXiv:2205.01197v1 [cs.CV] for this version)

Submission history

From: Hengyi Wang [view email]
[v1] Mon, 2 May 2022 20:22:29 GMT (13683kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.01197

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Boosting Video Object Segmentation based on Scale Inconsistency

Submission history