Asynchronous Stochastic Gradient Descent with Variance Reduction for Non-Convex Optimization

Huo, Zhouyuan; Huang, Heng

Full-text links:

Download:

Computer Science > Machine Learning

Title: Asynchronous Stochastic Gradient Descent with Variance Reduction for Non-Convex Optimization

Authors: Zhouyuan Huo, Heng Huang

(Submitted on 12 Apr 2016 (v1), last revised 20 Dec 2016 (this version, v4))

Abstract: We provide the first theoretical analysis on the convergence rate of the asynchronous stochastic variance reduced gradient (SVRG) descent algorithm on non-convex optimization. Recent studies have shown that the asynchronous stochastic gradient descent (SGD) based algorithms with variance reduction converge with a linear convergent rate on convex problems. However, there is no work to analyze asynchronous SGD with variance reduction technique on non-convex problem. In this paper, we study two asynchronous parallel implementations of SVRG: one is on a distributed memory system and the other is on a shared memory system. We provide the theoretical analysis that both algorithms can obtain a convergence rate of $O(1/T)$, and linear speed up is achievable if the number of workers is upper bounded. V1,v2,v3 have been withdrawn due to reference issue, please refer the newest version v4.

Comments:	V1,v2,v3 have been withdrawn due to reference issue, because arXiv policy, we can't delete them. Please refer the newest version v4
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:1604.03584 [cs.LG]
	(or arXiv:1604.03584v4 [cs.LG] for this version)

Submission history

From: Zhouyuan Huo [view email]
[v1] Tue, 12 Apr 2016 21:02:38 GMT (12kb)
[v2] Thu, 14 Apr 2016 02:54:36 GMT (12kb)
[v3] Thu, 19 May 2016 16:58:17 GMT (93kb,D)
[v4] Tue, 20 Dec 2016 05:44:39 GMT (93kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.03584

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Asynchronous Stochastic Gradient Descent with Variance Reduction for Non-Convex Optimization

Submission history