User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Dalalyan, Arnak S.; Karagulyan, Avetik G.

doi:10.1016/j.spa.2019.02.016

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1710

Mathematics > Statistics Theory

Title: User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Authors: Arnak S. Dalalyan, Avetik G. Karagulyan

(Submitted on 29 Sep 2017 (v1), last revised 23 Feb 2024 (this version, v4))

Abstract: In this paper, we study the problem of sampling from a given probability density function that is known to be smooth and strongly log-concave. We analyze several methods of approximate sampling based on discretizations of the (highly overdamped) Langevin diffusion and establish guarantees on its error measured in the Wasserstein-2 distance. Our guarantees improve or extend the state-of-the-art results in three directions. First, we provide an upper bound on the error of the first-order Langevin Monte Carlo (LMC) algorithm with optimized varying step-size. This result has the advantage of being horizon free (we do not need to know in advance the target precision) and to improve by a logarithmic factor the corresponding result for the constant step-size. Second, we study the case where accurate evaluations of the gradient of the log-density are unavailable, but one can have access to approximations of the aforementioned gradient. In such a situation, we consider both deterministic and stochastic approximations of the gradient and provide an upper bound on the sampling error of the first-order LMC that quantifies the impact of the gradient evaluation inaccuracies. Third, we establish upper bounds for two versions of the second-order LMC, which leverage the Hessian of the log-density. We provide nonasymptotic guarantees on the sampling error of these second-order LMCs. These guarantees reveal that the second-order LMC algorithms improve on the first-order LMC in ill-conditioned settings.

Subjects:	Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Computation (stat.CO); Machine Learning (stat.ML)
Journal reference:	Stochastic Processes and their Applications, Volume 129, Issue 12, December 2019, Pages 5278-5311
DOI:	10.1016/j.spa.2019.02.016
Cite as:	arXiv:1710.00095 [math.ST]
	(or arXiv:1710.00095v4 [math.ST] for this version)

Submission history

From: Arnak Dalalyan S. [view email]
[v1] Fri, 29 Sep 2017 21:15:03 GMT (115kb)
[v2] Mon, 6 Nov 2017 21:01:23 GMT (117kb)
[v3] Mon, 10 Sep 2018 11:46:52 GMT (119kb)
[v4] Fri, 23 Feb 2024 17:39:40 GMT (169kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1710.00095

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Mathematics > Statistics Theory

Title: User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Submission history