We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Faster Coordinate Descent via Adaptive Importance Sampling

Abstract: Coordinate descent methods employ random partial updates of decision variables in order to solve huge-scale convex optimization problems. In this work, we introduce new adaptive rules for the random selection of their updates. By adaptive, we mean that our selection rules are based on the dual residual or the primal-dual gap estimates and can change at each iteration. We theoretically characterize the performance of our selection rules and demonstrate improvements over the state-of-the-art, and extend our theory and algorithms to general convex objectives. Numerical evidence with hinge-loss support vector machines and Lasso confirm that the practice follows the theory.
Comments: appearing at AISTATS 2017
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Computation (stat.CO); Machine Learning (stat.ML)
ACM classes: G.1.6
Cite as: arXiv:1703.02518 [cs.LG]
  (or arXiv:1703.02518v1 [cs.LG] for this version)

Submission history

From: Martin Jaggi [view email]
[v1] Tue, 7 Mar 2017 18:36:55 GMT (396kb)

Link back to: arXiv, form interface, contact.