Versatile Single-Loop Method for Gradient Estimator: First and Second Order Optimality, and its Application to Federated Learning

Oko, Kazusato; Akiyama, Shunta; Murata, Tomoya; Suzuki, Taiji

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2209

Computer Science > Machine Learning

Title: Versatile Single-Loop Method for Gradient Estimator: First and Second Order Optimality, and its Application to Federated Learning

Authors: Kazusato Oko, Shunta Akiyama, Tomoya Murata, Taiji Suzuki

(Submitted on 1 Sep 2022 (v1), last revised 4 Oct 2022 (this version, v2))

Abstract: While variance reduction methods have shown great success in solving large scale optimization problems, many of them suffer from accumulated errors and, therefore, should periodically require the full gradient computation. In this paper, we present a single-loop algorithm named SLEDGE (Single-Loop mEthoD for Gradient Estimator) for finite-sum nonconvex optimization, which does not require periodic refresh of the gradient estimator but achieves nearly optimal gradient complexity. Unlike existing methods, SLEDGE has the advantage of versatility; (i) second-order optimality, (ii) exponential convergence in the PL region, and (iii) smaller complexity under less heterogeneity of data.
We build an efficient federated learning algorithm by exploiting these favorable properties. We show the first and second-order optimality of the output and also provide analysis under PL conditions. When the local budget is sufficiently large and clients are less (Hessian-)~heterogeneous, the algorithm requires fewer communication rounds then existing methods such as FedAvg, SCAFFOLD, and Mime. The superiority of our method is verified in numerical experiments.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2209.00361 [cs.LG]
	(or arXiv:2209.00361v2 [cs.LG] for this version)

Submission history

From: Kazusato Oko [view email]
[v1] Thu, 1 Sep 2022 11:05:26 GMT (6040kb,D)
[v2] Tue, 4 Oct 2022 08:04:10 GMT (6090kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.00361

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: Versatile Single-Loop Method for Gradient Estimator: First and Second Order Optimality, and its Application to Federated Learning

Submission history