A Non-parametric View of FedAvg and FedProx: Beyond Stationary Points

Su, Lili; Xu, Jiaming; Yang, Pengkun

Full-text links:

Download:

Current browse context:

cs.DC

< prev | next >

new | recent | 2106

Statistics > Machine Learning

Title: A Non-parametric View of FedAvg and FedProx: Beyond Stationary Points

Authors: Lili Su, Jiaming Xu, Pengkun Yang

(Submitted on 29 Jun 2021 (v1), last revised 15 Feb 2022 (this version, v2))

Abstract: Federated Learning (FL) is a promising decentralized learning framework and has great potentials in privacy preservation and in lowering the computation load at the cloud. Recent work showed that FedAvg and FedProx - the two widely-adopted FL algorithms - fail to reach the stationary points of the global optimization objective even for homogeneous linear regression problems. Further, it is concerned that the common model learned might not generalize well locally at all in the presence of heterogeneity.
In this paper, we analyze the convergence and statistical efficiency of FedAvg and FedProx, addressing the above two concerns. Our analysis is based on the standard non-parametric regression in a reproducing kernel Hilbert space (RKHS), and allows for heterogeneous local data distributions and unbalanced local datasets. We prove that the estimation errors, measured in either the empirical norm or the RKHS norm, decay with a rate of 1/t in general and exponentially for finite-rank kernels. In certain heterogeneous settings, these upper bounds also imply that both FedAvg and FedProx achieve the optimal error rate. To further analytically quantify the impact of the heterogeneity at each client, we propose and characterize a novel notion-federation gain, defined as the reduction of the estimation error for a client to join the FL. We discover that when the data heterogeneity is moderate, a client with limited local data can benefit from a common model with a large federation gain. Numerical experiments further corroborate our theoretical findings.

Subjects:	Machine Learning (stat.ML); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2106.15216 [stat.ML]
	(or arXiv:2106.15216v2 [stat.ML] for this version)

Submission history

From: Pengkun Yang [view email]
[v1] Tue, 29 Jun 2021 09:59:43 GMT (814kb,D)
[v2] Tue, 15 Feb 2022 05:49:09 GMT (141kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2106.15216

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Statistics > Machine Learning

Title: A Non-parametric View of FedAvg and FedProx: Beyond Stationary Points

Submission history