SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation

Kazakos, Ioannis; Ventura, Carles; Bellver, Miriam; Silberer, Carina; Giro-i-Nieto, Xavier

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2106

Computer Science > Computer Vision and Pattern Recognition

Title: SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation

Authors: Ioannis Kazakos, Carles Ventura, Miriam Bellver, Carina Silberer, Xavier Giro-i-Nieto

(Submitted on 8 Jun 2021 (v1), last revised 9 Jun 2021 (this version, v2))

Abstract: Recent advances in deep learning have brought significant progress in visual grounding tasks such as language-guided video object segmentation. However, collecting large datasets for these tasks is expensive in terms of annotation time, which represents a bottleneck. To this end, we propose a novel method, namely SynthRef, for generating synthetic referring expressions for target objects in an image (or video frame), and we also present and disseminate the first large-scale dataset with synthetic referring expressions for video object segmentation. Our experiments demonstrate that by training with our synthetic referring expressions one can improve the ability of a model to generalize across different datasets, without any additional annotation cost. Moreover, our formulation allows its application to any object detection or segmentation dataset.

Comments:	Accepted as poster at the NAACL 2021 Visually Grounded Interaction and Language (ViGIL) Workshop. 4 pages. Project website: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
Cite as:	arXiv:2106.04403 [cs.CV]
	(or arXiv:2106.04403v2 [cs.CV] for this version)

Submission history

From: Xavier Giró-i-Nieto [view email]
[v1] Tue, 8 Jun 2021 14:28:13 GMT (6742kb,D)
[v2] Wed, 9 Jun 2021 05:39:51 GMT (6742kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.04403

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation

Submission history