Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Shamir, Ohad

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1603

Computer Science > Machine Learning

Title: Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Authors: Ohad Shamir

(Submitted on 2 Mar 2016 (v1), last revised 17 Oct 2016 (this version, v3))

Abstract: Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled \emph{with} replacement. In practice, however, sampling \emph{without} replacement is very common, easier to implement in many cases, and often performs better. In this paper, we provide competitive convergence guarantees for without-replacement sampling, under various scenarios, for three types of algorithms: Any algorithm with online regret guarantees, stochastic gradient descent, and SVRG. A useful application of our SVRG analysis is a nearly-optimal algorithm for regularized least squares in a distributed setting, in terms of both communication complexity and runtime complexity, when the data is randomly partitioned and the condition number can be as large as the data size per machine (up to logarithmic factors). Our proof techniques combine ideas from stochastic optimization, adversarial online learning, and transductive learning theory, and can potentially be applied to other stochastic optimization and learning problems.

Comments:	Fixed a few minor typos, and slightly tightened Corollary 1
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1603.00570 [cs.LG]
	(or arXiv:1603.00570v3 [cs.LG] for this version)

Submission history

From: Ohad Shamir [view email]
[v1] Wed, 2 Mar 2016 04:02:57 GMT (32kb)
[v2] Fri, 29 Apr 2016 00:29:34 GMT (32kb)
[v3] Mon, 17 Oct 2016 03:58:41 GMT (32kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1603.00570

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Submission history