Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach

Han, Pengchao; Wang, Shiqiang; Leung, Kin K.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach

Authors: Pengchao Han, Shiqiang Wang, Kin K. Leung

(Submitted on 14 Jan 2020 (v1), last revised 20 Mar 2020 (this version, v3))

Abstract: Federated learning (FL) is an emerging technique for training machine learning models using geographically dispersed data collected by local entities. It includes local computation and synchronization steps. To reduce the communication overhead and improve the overall efficiency of FL, gradient sparsification (GS) can be applied, where instead of the full gradient, only a small subset of important elements of the gradient is communicated. Existing work on GS uses a fixed degree of gradient sparsity for i.i.d.-distributed data within a datacenter. In this paper, we consider adaptive degree of sparsity and non-i.i.d. local datasets. We first present a fairness-aware GS method which ensures that different clients provide a similar amount of updates. Then, with the goal of minimizing the overall training time, we propose a novel online learning formulation and algorithm for automatically determining the near-optimal communication and computation trade-off that is controlled by the degree of gradient sparsity. The online learning algorithm uses an estimated sign of the derivative of the objective function, which gives a regret bound that is asymptotically equal to the case where exact derivative is available. Experiments with real datasets confirm the benefits of our proposed approaches, showing up to $40\%$ improvement in model accuracy for a finite training time.

Comments:	Accepted at IEEE ICDCS 2020
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2001.04756 [cs.LG]
	(or arXiv:2001.04756v3 [cs.LG] for this version)

Submission history

From: Shiqiang Wang [view email]
[v1] Tue, 14 Jan 2020 13:09:23 GMT (1412kb,D)
[v2] Thu, 16 Jan 2020 17:56:09 GMT (1413kb,D)
[v3] Fri, 20 Mar 2020 16:34:48 GMT (1413kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.04756v3

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach

Submission history