On the Convergence of SARAH and Beyond

Li, Bingcong; Ma, Meng; Giannakis, Georgios B.

Full-text links:

Download:

Current browse context:

math

< prev | next >

new | recent | 1906

Computer Science > Machine Learning

Title: On the Convergence of SARAH and Beyond

Authors: Bingcong Li, Meng Ma, Georgios B. Giannakis

(Submitted on 5 Jun 2019 (v1), last revised 16 Jan 2020 (this version, v2))

Abstract: The main theme of this work is a unifying algorithm, \textbf{L}oop\textbf{L}ess \textbf{S}ARAH (L2S) for problems formulated as summation of $n$ individual loss functions. L2S broadens a recently developed variance reduction method known as SARAH. To find an $\epsilon$-accurate solution, L2S enjoys a complexity of ${\cal O}\big( (n+\kappa) \ln (1/\epsilon)\big)$ for strongly convex problems. For convex problems, when adopting an $n$-dependent step size, the complexity of L2S is ${\cal O}(n+ \sqrt{n}/\epsilon)$; while for more frequently adopted $n$-independent step size, the complexity is ${\cal O}(n+ n/\epsilon)$. Distinct from SARAH, our theoretical findings support an $n$-independent step size in convex problems without extra assumptions. For nonconvex problems, the complexity of L2S is ${\cal O}(n+ \sqrt{n}/\epsilon)$. Our numerical tests on neural networks suggest that L2S can have better generalization properties than SARAH. Along with L2S, our side results include the linear convergence of the last iteration for SARAH in strongly convex problems.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1906.02351 [cs.LG]
	(or arXiv:1906.02351v2 [cs.LG] for this version)

Submission history

From: Bingcong Li [view email]
[v1] Wed, 5 Jun 2019 23:02:46 GMT (579kb,D)
[v2] Thu, 16 Jan 2020 17:54:07 GMT (688kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1906.02351

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: On the Convergence of SARAH and Beyond

Submission history