Current browse context:
cs.CV
Change to browse by:
References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation
(Submitted on 8 Jun 2021 (v1), last revised 9 Jun 2021 (this version, v2))
Abstract: Recent advances in deep learning have brought significant progress in visual grounding tasks such as language-guided video object segmentation. However, collecting large datasets for these tasks is expensive in terms of annotation time, which represents a bottleneck. To this end, we propose a novel method, namely SynthRef, for generating synthetic referring expressions for target objects in an image (or video frame), and we also present and disseminate the first large-scale dataset with synthetic referring expressions for video object segmentation. Our experiments demonstrate that by training with our synthetic referring expressions one can improve the ability of a model to generalize across different datasets, without any additional annotation cost. Moreover, our formulation allows its application to any object detection or segmentation dataset.
Submission history
From: Xavier Giró-i-Nieto [view email][v1] Tue, 8 Jun 2021 14:28:13 GMT (6742kb,D)
[v2] Wed, 9 Jun 2021 05:39:51 GMT (6742kb,D)
Link back to: arXiv, form interface, contact.