References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Learning Goals from Failure
(Submitted on 28 Jun 2020 (v1), last revised 13 Dec 2020 (this version, v2))
Abstract: We introduce a framework that predicts the goals behind observable human action in video. Motivated by evidence in developmental psychology, we leverage video of unintentional action to learn video representations of goals without direct supervision. Our approach models videos as contextual trajectories that represent both low-level motion and high-level action features. Experiments and visualizations show our trained model is able to predict the underlying goals in video of unintentional action. We also propose a method to "automatically correct" unintentional action by leveraging gradient signals of our model to adjust latent trajectories. Although the model is trained with minimal supervision, it is competitive with or outperforms baselines trained on large (supervised) datasets of successfully executed goals, showing that observing unintentional action is crucial to learning about goals in video. Project page: this https URL
Submission history
From: Dave Epstein [view email][v1] Sun, 28 Jun 2020 17:16:49 GMT (9406kb,D)
[v2] Sun, 13 Dec 2020 01:44:08 GMT (41888kb,D)
Link back to: arXiv, form interface, contact.