We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: VAE with a VampPrior

Abstract: Many different methods to train deep generative models have been introduced in the past. In this paper, we propose to extend the variational auto-encoder (VAE) framework with a new type of prior which we call "Variational Mixture of Posteriors" prior, or VampPrior for short. The VampPrior consists of a mixture distribution (e.g., a mixture of Gaussians) with components given by variational posteriors conditioned on learnable pseudo-inputs. We further extend this prior to a two layer hierarchical model and show that this architecture with a coupled prior and posterior, learns significantly better models. The model also avoids the usual local optima issues related to useless latent dimensions that plague VAEs. We provide empirical studies on six datasets, namely, static and binary MNIST, OMNIGLOT, Caltech 101 Silhouettes, Frey Faces and Histopathology patches, and show that applying the hierarchical VampPrior delivers state-of-the-art results on all datasets in the unsupervised permutation invariant setting and the best results or comparable to SOTA methods for the approach with convolutional networks.
Comments: 16 pages, final version, AISTATS 2018
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as: arXiv:1705.07120 [cs.LG]
  (or arXiv:1705.07120v5 [cs.LG] for this version)

Submission history

From: Jakub Tomczak Ph.D. [view email]
[v1] Fri, 19 May 2017 10:07:00 GMT (921kb,D)
[v2] Mon, 24 Jul 2017 14:14:08 GMT (1480kb,D)
[v3] Mon, 21 Aug 2017 12:21:47 GMT (2445kb,D)
[v4] Fri, 13 Oct 2017 17:54:28 GMT (2679kb,D)
[v5] Mon, 26 Feb 2018 15:23:53 GMT (2681kb,D)

Link back to: arXiv, form interface, contact.