We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Randomized Block-Diagonal Preconditioning for Parallel Learning

Abstract: We study preconditioned gradient-based optimization methods where the preconditioning matrix has block-diagonal form. Such a structural constraint comes with the advantage that the update computation is block-separable and can be parallelized across multiple independent tasks. Our main contribution is to demonstrate that the convergence of these methods can significantly be improved by a randomization technique which corresponds to repartitioning coordinates across tasks during the optimization procedure. We provide a theoretical analysis that accurately characterizes the expected convergence gains of repartitioning and validate our findings empirically on various traditional machine learning tasks. From an implementation perspective, block-separable models are well suited for parallelization and, when shared memory is available, randomization can be implemented on top of existing methods very efficiently to improve convergence.
Comments: improvement in Theorem 3 compared to ICML 2020 version
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Journal reference: PMLR 119:6841-6851 (2020)
Cite as: arXiv:2006.13591 [cs.LG]
  (or arXiv:2006.13591v2 [cs.LG] for this version)

Submission history

From: Celestine Mendler-Dünner [view email]
[v1] Wed, 24 Jun 2020 10:12:36 GMT (1850kb,D)
[v2] Mon, 7 Dec 2020 09:33:02 GMT (1713kb,D)

Link back to: arXiv, form interface, contact.