Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Learning Multi-layer Latent Variable Model with Short Run MCMC Inference Dynamics
(Submitted on 4 Dec 2019 (v1), revised 18 Jun 2020 (this version, v4), latest version 17 Jul 2020 (v5))
Abstract: This paper studies the fundamental problem of learning deep generative models that consist of multiple layers of latent variables organized in top-down architectures. Such models have high expressivity and allow for learning hierarchical representations. Learning such a generative model requires inferring the latent variables for each training example based on the posterior distribution of these latent variables. The inference typically requires Markov chain Monte Caro (MCMC) that can be time consuming. In this paper, we propose to use short run MCMC inference dynamics, such as finite step Langevin algorithm initialized from the prior distribution of the latent variables, as an approximate sampler of the posterior distribution, where the step size of the Langevin dynamics is optimized by minimizing the Kullback-Leibler divergence between the distribution produced by the short run MCMC inference dynamics and the posterior distribution. Our experiments show that the proposed method outperforms variational auto-encoder (VAE) in terms of reconstruction error and synthesis quality. The advantage of the proposed method is that it is simple and automatic without the need to design an inference model.
Submission history
From: Erik Nijkamp [view email][v1] Wed, 4 Dec 2019 11:42:14 GMT (1868kb,D)
[v2] Sun, 8 Dec 2019 20:14:18 GMT (1868kb,D)
[v3] Sat, 14 Dec 2019 21:20:30 GMT (1869kb,D)
[v4] Thu, 18 Jun 2020 10:16:11 GMT (2979kb,D)
[v5] Fri, 17 Jul 2020 22:54:26 GMT (2958kb,D)
Link back to: arXiv, form interface, contact.