Non-Convex Stochastic Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

Hu, Yuanhan; Wang, Xiaoyu; Gao, Xuefeng; Gurbuzbalaban, Mert; Zhu, Lingjiong

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2004

Mathematics > Optimization and Control

Title: Non-Convex Stochastic Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

Authors: Yuanhan Hu, Xiaoyu Wang, Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

(Submitted on 6 Apr 2020 (this version), latest version 2 Jun 2020 (v2))

Abstract: Stochastic gradient Langevin dynamics (SGLD) is a poweful algorithm for optimizing a non-convex objective, where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global minimum. SGLD is based on the overdamped Langevin diffusion which is reversible in time. By adding an anti-symmetric matrix to the drift term of the overdamped Langevin diffusion, one gets a non-reversible diffusion that converges to the same stationary distribution with a faster convergence rate. In this paper, we study the non-reversible stochastic gradient Langevin dynamics (NSGLD) which is based on discretization of the non-reversible Langevin diffusion. We provide finite time performance bounds for the global convergence of NSGLD for solving stochastic non-convex optimization problems. Our results lead to non-asymptotic guarantees for both population and empirical risk minimization problems. Numerical experiments for a simple polynomial function optimization, Bayesian independent component analysis and neural network models show that NSGLD can outperform SGLD with proper choices of the anti-symmetric matrix.

Comments:	49 pages
Subjects:	Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2004.02823 [math.OC]
	(or arXiv:2004.02823v1 [math.OC] for this version)

Submission history

From: Lingjiong Zhu [view email]
[v1] Mon, 6 Apr 2020 17:11:03 GMT (657kb,D)
[v2] Tue, 2 Jun 2020 20:49:37 GMT (685kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2004.02823v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Non-Convex Stochastic Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

Submission history