Multi-Object Representation Learning with Iterative Variational Inference

Greff, Klaus; Kaufman, Raphaël Lopez; Kabra, Rishabh; Watters, Nick; Burgess, Chris; Zoran, Daniel; Matthey, Loic; Botvinick, Matthew; Lerchner, Alexander

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1903

Computer Science > Machine Learning

Title: Multi-Object Representation Learning with Iterative Variational Inference

Authors: Klaus Greff, Raphaël Lopez Kaufman, Rishabh Kabra, Nick Watters, Chris Burgess, Daniel Zoran, Loic Matthey, Matthew Botvinick, Alexander Lerchner

(Submitted on 1 Mar 2019 (v1), last revised 27 Jul 2020 (this version, v3))

Abstract: Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even considering multiple objects, or treats segmentation as an (often supervised) preprocessing step. Instead, we argue for the importance of learning to segment and represent objects jointly. We demonstrate that, starting from the simple assumption that a scene is composed of multiple entities, it is possible to learn to segment images into interpretable objects with disentangled representations. Our method learns -- without supervision -- to inpaint occluded parts, and extrapolates to scenes with more objects and to unseen objects with novel feature combinations. We also show that, due to the use of iterative variational inference, our system is able to learn multi-modal posteriors for ambiguous inputs and extends naturally to sequences.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Journal reference:	ICML 2019 (PMLR 97:2424-2433)
Cite as:	arXiv:1903.00450 [cs.LG]
	(or arXiv:1903.00450v3 [cs.LG] for this version)

Submission history

From: Klaus Greff [view email]
[v1] Fri, 1 Mar 2019 18:21:02 GMT (18525kb,D)
[v2] Wed, 15 May 2019 23:21:01 GMT (16337kb,D)
[v3] Mon, 27 Jul 2020 19:55:14 GMT (25946kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1903.00450

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Multi-Object Representation Learning with Iterative Variational Inference

Submission history