Current browse context:
cs.DC
Change to browse by:
References & Citations
Computer Science > Distributed, Parallel, and Cluster Computing
Title: Randomized algorithms for distributed computation of principal component analysis and singular value decomposition
(Submitted on 27 Dec 2016 (v1), revised 31 Dec 2016 (this version, v2), latest version 1 Jan 2018 (v4))
Abstract: As illustrated via numerical experiments with an implementation in Spark (the popular platform for distributed computation), randomized algorithms solve two ubiquitous problems: (1) calculating a full principal component analysis or singular value decomposition of a highly rectangular matrix, and (2) calculating a low-rank approximation in the form of a singular value decomposition to an arbitrary matrix. Several optimizations to recently introduced methods yield results that are uniformly superior to those of the stock implementations.
Submission history
From: Mark Tygert [view email][v1] Tue, 27 Dec 2016 19:06:13 GMT (13kb)
[v2] Sat, 31 Dec 2016 22:06:19 GMT (13kb)
[v3] Wed, 31 May 2017 23:04:43 GMT (29kb)
[v4] Mon, 1 Jan 2018 20:24:15 GMT (41kb,D)
Link back to: arXiv, form interface, contact.