Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Learning Deep Generative Models with Short Run Inference Dynamics
(Submitted on 4 Dec 2019 (v1), revised 8 Dec 2019 (this version, v2), latest version 17 Jul 2020 (v5))
Abstract: This paper studies the fundamental problem of learning deep generative models that consist of one or more layers of latent variables organized in top-down architectures. Learning such a generative model requires inferring the latent variables for each training example based on the posterior distribution of these latent variables. The inference typically requires Markov chain Monte Caro (MCMC) that can be time consuming. In this paper, we propose to use short run inference dynamics guided by the log-posterior, such as finite-step gradient descent algorithm initialized from the prior distribution of the latent variables, as an approximate sampler of the posterior distribution, where the step size of the gradient descent dynamics is optimized by minimizing the Kullback-Leibler divergence between the distribution produced by the short run inference dynamics and the posterior distribution. Our experiments show that the proposed method outperforms variational auto-encoder (VAE) in terms of reconstruction error and synthesis quality. The advantage of the proposed method is that it is natural and automatic, even for models with multiple layers of latent variables.
Submission history
From: Erik Nijkamp [view email][v1] Wed, 4 Dec 2019 11:42:14 GMT (1868kb,D)
[v2] Sun, 8 Dec 2019 20:14:18 GMT (1868kb,D)
[v3] Sat, 14 Dec 2019 21:20:30 GMT (1869kb,D)
[v4] Thu, 18 Jun 2020 10:16:11 GMT (2979kb,D)
[v5] Fri, 17 Jul 2020 22:54:26 GMT (2958kb,D)
Link back to: arXiv, form interface, contact.