Sub-sampled Cubic Regularization for Non-convex Optimization

Kohler, Jonas Moritz; Lucchi, Aurelien

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1705

Computer Science > Machine Learning

Title: Sub-sampled Cubic Regularization for Non-convex Optimization

Authors: Jonas Moritz Kohler, Aurelien Lucchi

(Submitted on 16 May 2017 (v1), last revised 1 Jul 2017 (this version, v3))

Abstract: We consider the minimization of non-convex functions that typically arise in machine learning. Specifically, we focus our attention on a variant of trust region methods known as cubic regularization. This approach is particularly attractive because it escapes strict saddle points and it provides stronger convergence guarantees than first- and second-order as well as classical trust region methods. However, it suffers from a high computational complexity that makes it impractical for large-scale learning. Here, we propose a novel method that uses sub-sampling to lower this computational cost. By the use of concentration inequalities we provide a sampling scheme that gives sufficiently accurate gradient and Hessian approximations to retain the strong global and local convergence guarantees of cubically regularized methods. To the best of our knowledge this is the first work that gives global convergence guarantees for a sub-sampled variant of cubic regularization on non-convex functions. Furthermore, we provide experimental results supporting our theory.

Comments:	Proceedings of the 34th International Conference on Machine Learning
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1705.05933 [cs.LG]
	(or arXiv:1705.05933v3 [cs.LG] for this version)

Submission history

From: Aurelien Lucchi [view email]
[v1] Tue, 16 May 2017 21:44:44 GMT (1031kb,D)
[v2] Fri, 23 Jun 2017 11:49:57 GMT (3667kb,D)
[v3] Sat, 1 Jul 2017 11:17:59 GMT (3667kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1705.05933

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Sub-sampled Cubic Regularization for Non-convex Optimization

Submission history