We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Asymptotic Analysis of Conditioned Stochastic Gradient Descent

Abstract: In this paper, we investigate a general class of stochastic gradient descent (SGD) algorithms, called Conditioned SGD, based on a preconditioning of the gradient direction. Using a discrete-time approach with martingale tools, we establish under mild assumptions the weak convergence of the rescaled sequence of iterates for a broad class of conditioning matrices including stochastic first-order and second-order methods. Almost sure convergence results, which may be of independent interest, are also presented. Interestingly, the asymptotic normality result consists in a stochastic equicontinuity property so when the conditioning matrix is an estimate of the inverse Hessian, the algorithm is asymptotically optimal.
Comments: Accepted to Transactions on Machine Learning Research 2023
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG)
MSC classes: 62L20 (Primary) 60F05, 60G46, 68W40 (Secondary)
Journal reference: Transactions on Machine Learning Research (2023)
Cite as: arXiv:2006.02745 [math.ST]
  (or arXiv:2006.02745v5 [math.ST] for this version)

Submission history

From: Remi Leluc [view email]
[v1] Thu, 4 Jun 2020 10:08:05 GMT (364kb,D)
[v2] Thu, 1 Oct 2020 08:53:50 GMT (360kb,D)
[v3] Fri, 9 Jul 2021 14:43:08 GMT (364kb,D)
[v4] Thu, 20 Oct 2022 14:36:38 GMT (724kb,D)
[v5] Sun, 15 Oct 2023 13:23:07 GMT (243kb,D)

Link back to: arXiv, form interface, contact.