Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering

Mallya, Arun; Lazebnik, Svetlana

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1604

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering

Authors: Arun Mallya, Svetlana Lazebnik

(Submitted on 16 Apr 2016 (v1), last revised 28 Jul 2016 (this version, v2))

Abstract: This paper proposes deep convolutional network models that utilize local and global context to make human activity label predictions in still images, achieving state-of-the-art performance on two recent datasets with hundreds of labels each. We use multiple instance learning to handle the lack of supervision on the level of individual person instances, and weighted loss to handle unbalanced training data. Further, we show how specialized features trained on these datasets can be used to improve accuracy on the Visual Question Answering (VQA) task, in the form of multiple choice fill-in-the-blank questions (Visual Madlibs). Specifically, we tackle two types of questions on person activity and person-object relationship and show improvements over generic features trained on the ImageNet classification task.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1604.04808 [cs.CV]
	(or arXiv:1604.04808v2 [cs.CV] for this version)

Submission history

From: Arun Mallya [view email]
[v1] Sat, 16 Apr 2016 22:54:05 GMT (9614kb,D)
[v2] Thu, 28 Jul 2016 04:44:36 GMT (8825kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.04808

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering

Submission history