We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Abstract: Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization trick is applicable when we can simulate a random variable by applying a differentiable deterministic function on an auxiliary random variable whose distribution is fixed. For many distributions of interest (such as the gamma or Dirichlet), simulation of random variables relies on acceptance-rejection sampling. The discontinuity introduced by the accept-reject step means that standard reparameterization tricks are not applicable. We propose a new method that lets us leverage reparameterization gradients even when variables are outputs of a acceptance-rejection sampling algorithm. Our approach enables reparameterization on a larger class of variational distributions. In several studies of real and synthetic data, we show that the variance of the estimator of the gradient is significantly lower than other state-of-the-art methods. This leads to faster convergence of stochastic gradient variational inference.
Comments: An error in the von Mises distribution reparameterization in Table 2 has been corrected
Subjects: Machine Learning (stat.ML); Methodology (stat.ME)
Cite as: arXiv:1610.05683 [stat.ML]
  (or arXiv:1610.05683v3 [stat.ML] for this version)

Submission history

From: Christian A. Naesseth [view email]
[v1] Tue, 18 Oct 2016 15:55:08 GMT (778kb,D)
[v2] Fri, 10 Mar 2017 14:16:52 GMT (960kb,D)
[v3] Wed, 12 Feb 2020 15:01:15 GMT (960kb,D)

Link back to: arXiv, form interface, contact.