Learning Visual Classifiers using Human-centric Annotations

Misra, Ishan; Zitnick, C. Lawrence; Mitchell, Margaret; Girshick, Ross

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1512

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Visual Classifiers using Human-centric Annotations

Authors: Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross Girshick

(Submitted on 22 Dec 2015 (this version), latest version 12 Apr 2016 (v2))

Abstract: Many datasets contain human-centric annotations that are the result of humans applying their own subjective judgements on what to describe and what to ignore. Examples include image tags and keywords found on photo sharing sites, or in datasets containing image captions. In this paper, we explore the use of human-centric annotations for learning image classifiers. Due to human reporting bias, these annotations miss a significant amount of the information present in an image. We propose an algorithm to decouple the human reporting bias from the correct visually grounded labels. Our algorithm provides results that are highly interpretable for reporting "what's in the image" versus "what's worth saying." We show improvements over traditional learning algorithms for both image classification and image captioning, and evaluate the algorithm's efficacy along a variety of metrics and datasets, including MS COCO and Yahoo Flickr 100M.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1512.06974 [cs.CV]
	(or arXiv:1512.06974v1 [cs.CV] for this version)

Submission history

From: Ishan Misra [view email]
[v1] Tue, 22 Dec 2015 07:28:06 GMT (5337kb,D)
[v2] Tue, 12 Apr 2016 19:58:29 GMT (2324kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1512.06974v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Visual Classifiers using Human-centric Annotations

Submission history