We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Abstract: In this paper, we study the problem of sampling from a given probability density function that is known to be smooth and strongly log-concave. We analyze several methods of approximate sampling based on discretizations of the (highly overdamped) Langevin diffusion and establish guarantees on its error measured in the Wasserstein-2 distance. Our guarantees improve or extend the state-of-the-art results in three directions. First, we provide an upper bound on the error of the first-order Langevin Monte Carlo (LMC) algorithm with optimized varying step-size. This result has the advantage of being horizon free (we do not need to know in advance the target precision) and to improve by a logarithmic factor the corresponding result for the constant step-size. Second, we study the case where accurate evaluations of the gradient of the log-density are unavailable, but one can have access to approximations of the aforementioned gradient. In such a situation, we consider both deterministic and stochastic approximations of the gradient and provide an upper bound on the sampling error of the first-order LMC that quantifies the impact of the gradient evaluation inaccuracies. Third, we establish upper bounds for two versions of the second-order LMC, which leverage the Hessian of the log-density. We provide nonasymptotic guarantees on the sampling error of these second-order LMCs. These guarantees reveal that the second-order LMC algorithms improve on the first-order LMC in ill-conditioned settings.
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Computation (stat.CO); Machine Learning (stat.ML)
Journal reference: Stochastic Processes and their Applications, Volume 129, Issue 12, December 2019, Pages 5278-5311
DOI: 10.1016/j.spa.2019.02.016
Cite as: arXiv:1710.00095 [math.ST]
  (or arXiv:1710.00095v4 [math.ST] for this version)

Submission history

From: Arnak Dalalyan S. [view email]
[v1] Fri, 29 Sep 2017 21:15:03 GMT (115kb)
[v2] Mon, 6 Nov 2017 21:01:23 GMT (117kb)
[v3] Mon, 10 Sep 2018 11:46:52 GMT (119kb)
[v4] Fri, 23 Feb 2024 17:39:40 GMT (169kb,D)

Link back to: arXiv, form interface, contact.