Subjects and Their Objects: Localizing Interactees for a Person-Centric View of Importance

Chen, Chao-Yeh; Grauman, Kristen

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1604

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Subjects and Their Objects: Localizing Interactees for a Person-Centric View of Importance

Authors: Chao-Yeh Chen, Kristen Grauman

(Submitted on 17 Apr 2016)

Abstract: Understanding images with people often entails understanding their \emph{interactions} with other objects or people. As such, given a novel image, a vision system ought to infer which other objects/people play an important role in a given person's activity. However, existing methods are limited to learning action-specific interactions (e.g., how the pose of a tennis player relates to the position of his racquet when serving the ball) for improved recognition, making them unequipped to reason about novel interactions with actions or objects unobserved in the training data.
We propose to predict the "interactee" in novel images---that is, to localize the \emph{object} of a person's action. Given an arbitrary image with a detected person, the goal is to produce a saliency map indicating the most likely positions and scales where that person's interactee would be found. To that end, we explore ways to learn the generic, action-independent connections between (a) representations of a person's pose, gaze, and scene cues and (b) the interactee object's position and scale. We provide results on a newly collected UT Interactee dataset spanning more than 10,000 images from SUN, PASCAL, and COCO. We show that the proposed interaction-informed saliency metric has practical utility for four tasks: contextual object detection, image retargeting, predicting object importance, and data-driven natural language scene description. All four scenarios reveal the value in linking the subject to its object in order to understand the story of an image.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1604.04842 [cs.CV]
	(or arXiv:1604.04842v1 [cs.CV] for this version)

Submission history

From: Chao-Yeh Chen [view email]
[v1] Sun, 17 Apr 2016 08:26:31 GMT (21995kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.04842

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Subjects and Their Objects: Localizing Interactees for a Person-Centric View of Importance

Submission history