We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Convergence Rates for Deterministic and Stochastic Subgradient Methods Without Lipschitz Continuity

Abstract: We extend the classic convergence rate theory for subgradient methods to apply to non-Lipschitz functions. For the deterministic projected subgradient method, we present a global $O(1/\sqrt{T})$ convergence rate for any convex function which is locally Lipschitz around its minimizers. This approach is based on Shor's classic subgradient analysis and implies generalizations of the standard convergence rates for gradient descent on functions with Lipschitz or H\"older continuous gradients. Further, we show a $O(1/\sqrt{T})$ convergence rate for the stochastic projected subgradient method on convex functions with at most quadratic growth, which improves to $O(1/T)$ under either strong convexity or a weaker quadratic lower bound condition.
Comments: Update 2/26/18: Major revision improving the convergence results to no longer need an exponential upper bound on function growth in the convex case. Now local Lipschitz continuity around a minimizer suffices for a global convergence rate. Update 12/21/17: Added three more references on weakening strong convexity and minorly changed some wording. 16 pages
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
MSC classes: 65K05, 65K10, 90C25, 90C15, 90C30
Cite as: arXiv:1712.04104 [math.OC]
  (or arXiv:1712.04104v3 [math.OC] for this version)

Submission history

From: Benjamin Grimmer [view email]
[v1] Tue, 12 Dec 2017 02:51:59 GMT (17kb)
[v2] Thu, 21 Dec 2017 17:23:39 GMT (18kb)
[v3] Mon, 26 Feb 2018 22:27:18 GMT (15kb)

Link back to: arXiv, form interface, contact.