We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Stochastic Gradient Methods with Preconditioned Updates

Abstract: This work considers the non-convex finite sum minimization problem. There are several algorithms for such problems, but existing methods often work poorly when the problem is badly scaled and/or ill-conditioned, and a primary goal of this work is to introduce methods that alleviate this issue. Thus, here we include a preconditioner based on Hutchinson's approach to approximating the diagonal of the Hessian, and couple it with several gradient-based methods to give new scaled algorithms: Scaled SARAH and Scaled L-SVRG. Theoretical complexity guarantees under smoothness assumptions are presented. We prove linear convergence when both smoothness and the PL condition are assumed. Our adaptively scaled methods use approximate partial second-order curvature information and, therefore, can better mitigate the impact of badly scaled problems. This improved practical performance is demonstrated in the numerical experiments also presented in this work.
Comments: 40 pages, 2 new algorithms, 20 figures, 4 tables
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
DOI: 10.1007/s10957-023-02365-3
Cite as: arXiv:2206.00285 [math.OC]
  (or arXiv:2206.00285v2 [math.OC] for this version)

Submission history

From: Aleksandr Beznosikov [view email]
[v1] Wed, 1 Jun 2022 07:38:08 GMT (38062kb,D)
[v2] Sun, 14 Jan 2024 17:18:32 GMT (21390kb,D)

Link back to: arXiv, form interface, contact.