Variance reduction for Riemannian non-convex optimization with batch size adaptation

Han, Andi; Gao, Junbin

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2007

Mathematics > Optimization and Control

Title: Variance reduction for Riemannian non-convex optimization with batch size adaptation

Authors: Andi Han, Junbin Gao

(Submitted on 3 Jul 2020)

Abstract: Variance reduction techniques are popular in accelerating gradient descent and stochastic gradient descent for optimization problems defined on both Euclidean space and Riemannian manifold. In this paper, we further improve on existing variance reduction methods for non-convex Riemannian optimization, including R-SVRG and R-SRG/R-SPIDER with batch size adaptation. We show that this strategy can achieve lower total complexities for optimizing both general non-convex and gradient dominated functions under both finite-sum and online settings. As a result, we also provide simpler convergence analysis for R-SVRG and improve complexity bounds for R-SRG under finite-sum setting. Specifically, we prove that R-SRG achieves the same near-optimal complexity as R-SPIDER without requiring a small step size. Empirical experiments on a variety of tasks demonstrate effectiveness of proposed adaptive batch size scheme.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2007.01494 [math.OC]
	(or arXiv:2007.01494v1 [math.OC] for this version)

Submission history

From: Andi Han [view email]
[v1] Fri, 3 Jul 2020 04:34:39 GMT (234kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2007.01494

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Variance reduction for Riemannian non-convex optimization with batch size adaptation

Submission history