Randomized Block Krylov Methods for Stronger and Faster Approximate Singular Value Decomposition

Musco, Cameron; Musco, Christopher

Full-text links:

Download:

Current browse context:

cs.DS

< prev | next >

new | recent | 1504

Computer Science > Data Structures and Algorithms

Title: Randomized Block Krylov Methods for Stronger and Faster Approximate Singular Value Decomposition

Authors: Cameron Musco, Christopher Musco

(Submitted on 21 Apr 2015 (v1), last revised 30 Oct 2015 (this version, v4))

Abstract: Since being analyzed by Rokhlin, Szlam, and Tygert and popularized by Halko, Martinsson, and Tropp, randomized Simultaneous Power Iteration has become the method of choice for approximate singular value decomposition. It is more accurate than simpler sketching algorithms, yet still converges quickly for any matrix, independently of singular value gaps. After $\tilde{O}(1/\epsilon)$ iterations, it gives a low-rank approximation within $(1+\epsilon)$ of optimal for spectral norm error.
We give the first provable runtime improvement on Simultaneous Iteration: a simple randomized block Krylov method, closely related to the classic Block Lanczos algorithm, gives the same guarantees in just $\tilde{O}(1/\sqrt{\epsilon})$ iterations and performs substantially better experimentally. Despite their long history, our analysis is the first of a Krylov subspace method that does not depend on singular value gaps, which are unreliable in practice.
Furthermore, while it is a simple accuracy benchmark, even $(1+\epsilon)$ error for spectral norm low-rank approximation does not imply that an algorithm returns high quality principal components, a major issue for data applications. We address this problem for the first time by showing that both Block Krylov Iteration and a minor modification of Simultaneous Iteration give nearly optimal PCA for any matrix. This result further justifies their strength over non-iterative sketching methods.
Finally, we give insight beyond the worst case, justifying why both algorithms can run much faster in practice than predicted. We clarify how simple techniques can take advantage of common matrix properties to significantly improve runtime.

Comments:	Neural Information Processing Systems 2015
Subjects:	Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Numerical Analysis (math.NA)
Cite as:	arXiv:1504.05477 [cs.DS]
	(or arXiv:1504.05477v4 [cs.DS] for this version)

Submission history

From: Christopher Musco [view email]
[v1] Tue, 21 Apr 2015 15:48:44 GMT (34kb)
[v2] Sat, 6 Jun 2015 23:43:50 GMT (50kb)
[v3] Wed, 1 Jul 2015 03:55:11 GMT (50kb)
[v4] Fri, 30 Oct 2015 19:35:08 GMT (54kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1504.05477

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Data Structures and Algorithms

Title: Randomized Block Krylov Methods for Stronger and Faster Approximate Singular Value Decomposition

Submission history