High-Dimensional Inference over Networks: Linear Convergence and Statistical Guarantees

Sun, Ying; Maros, Marie; Scutari, Gesualdo; Cheng, Guang

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: High-Dimensional Inference over Networks: Linear Convergence and Statistical Guarantees

Authors: Ying Sun, Marie Maros, Gesualdo Scutari, Guang Cheng

(Submitted on 21 Jan 2022)

Abstract: We study sparse linear regression over a network of agents, modeled as an undirected graph and no server node. The estimation of the $s$-sparse parameter is formulated as a constrained LASSO problem wherein each agent owns a subset of the $N$ total observations. We analyze the convergence rate and statistical guarantees of a distributed projected gradient tracking-based algorithm under high-dimensional scaling, allowing the ambient dimension $d$ to grow with (and possibly exceed) the sample size $N$. Our theory shows that, under standard notions of restricted strong convexity and smoothness of the loss functions, suitable conditions on the network connectivity and algorithm tuning, the distributed algorithm converges globally at a {\it linear} rate to an estimate that is within the centralized {\it statistical precision} of the model, $O(s\log d/N)$. When $s\log d/N=o(1)$, a condition necessary for statistical consistency, an $\varepsilon$-optimal solution is attained after $\mathcal{O}(\kappa \log (1/\varepsilon))$ gradient computations and $O (\kappa/(1-\rho) \log (1/\varepsilon))$ communication rounds, where $\kappa$ is the restricted condition number of the loss function and $\rho$ measures the network connectivity. The computation cost matches that of the centralized projected gradient algorithm despite having data distributed; whereas the communication rounds reduce as the network connectivity improves. Overall, our study reveals interesting connections between statistical efficiency, network connectivity \& topology, and convergence rate in high dimensions.

Comments:	50 pages, 7 figures
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST)
Cite as:	arXiv:2201.08507 [cs.LG]
	(or arXiv:2201.08507v1 [cs.LG] for this version)

Submission history

From: Gesualdo Scutari [view email]
[v1] Fri, 21 Jan 2022 01:26:08 GMT (969kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.08507v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: High-Dimensional Inference over Networks: Linear Convergence and Statistical Guarantees

Submission history