Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Communication-efficient distributed statistical learning
(Submitted on 25 May 2016 (v1), revised 3 Nov 2016 (this version, v2), latest version 6 Nov 2016 (v3))
Abstract: We present the Communication-efficient Surrogate Likelihood (CSL) framework for solving distributed statistical learning problems. CSL provides a communication-efficient surrogate to the global likelihood that can be used for low-dimensional estimation, high-dimensional regularized estimation and Bayesian inference. For low-dimensional estimation, CSL provably improves upon the averaging schemes and facilitates the construction of confidence intervals. For high-dimensional regularized estimation, CSL leads to a minimax optimal estimator with minimal communication cost. For Bayesian inference, CSL can be used to form a communication-efficient quasi-posterior distribution that converges to the true posterior. This quasi-posterior procedure significantly improves the computational efficiency of MCMC algorithms even in a non-distributed setting. The methods are illustrated through empirical studies.
Submission history
From: Jason Lee [view email][v1] Wed, 25 May 2016 00:12:06 GMT (246kb)
[v2] Thu, 3 Nov 2016 05:31:41 GMT (248kb)
[v3] Sun, 6 Nov 2016 00:37:39 GMT (248kb)
Link back to: arXiv, form interface, contact.