Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Naesseth, Christian A.; Ruiz, Francisco J. R.; Linderman, Scott W.; Blei, David M.

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1610

Statistics > Machine Learning

Title: Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Authors: Christian A. Naesseth, Francisco J. R. Ruiz, Scott W. Linderman, David M. Blei

(Submitted on 18 Oct 2016 (v1), last revised 12 Feb 2020 (this version, v3))

Abstract: Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization trick is applicable when we can simulate a random variable by applying a differentiable deterministic function on an auxiliary random variable whose distribution is fixed. For many distributions of interest (such as the gamma or Dirichlet), simulation of random variables relies on acceptance-rejection sampling. The discontinuity introduced by the accept-reject step means that standard reparameterization tricks are not applicable. We propose a new method that lets us leverage reparameterization gradients even when variables are outputs of a acceptance-rejection sampling algorithm. Our approach enables reparameterization on a larger class of variational distributions. In several studies of real and synthetic data, we show that the variance of the estimator of the gradient is significantly lower than other state-of-the-art methods. This leads to faster convergence of stochastic gradient variational inference.

Comments:	An error in the von Mises distribution reparameterization in Table 2 has been corrected
Subjects:	Machine Learning (stat.ML); Methodology (stat.ME)
Cite as:	arXiv:1610.05683 [stat.ML]
	(or arXiv:1610.05683v3 [stat.ML] for this version)

Submission history

From: Christian A. Naesseth [view email]
[v1] Tue, 18 Oct 2016 15:55:08 GMT (778kb,D)
[v2] Fri, 10 Mar 2017 14:16:52 GMT (960kb,D)
[v3] Wed, 12 Feb 2020 15:01:15 GMT (960kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1610.05683

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Submission history