Optimal Sketching for Trace Estimation

Jiang, Shuli; Pham, Hai; Woodruff, David P.; Qiuyi; Zhang

Full-text links:

Download:

Computer Science > Data Structures and Algorithms

Title: Optimal Sketching for Trace Estimation

Authors: Shuli Jiang, Hai Pham, David P. Woodruff, Qiuyi (Richard) Zhang

(Submitted on 1 Nov 2021)

Abstract: Matrix trace estimation is ubiquitous in machine learning applications and has traditionally relied on Hutchinson's method, which requires $O(\log(1/\delta)/\epsilon^2)$ matrix-vector product queries to achieve a $(1 \pm \epsilon)$-multiplicative approximation to $\text{tr}(A)$ with failure probability $\delta$ on positive-semidefinite input matrices $A$. Recently, the Hutch++ algorithm was proposed, which reduces the number of matrix-vector queries from $O(1/\epsilon^2)$ to the optimal $O(1/\epsilon)$, and the algorithm succeeds with constant probability. However, in the high probability setting, the non-adaptive Hutch++ algorithm suffers an extra $O(\sqrt{\log(1/\delta)})$ multiplicative factor in its query complexity. Non-adaptive methods are important, as they correspond to sketching algorithms, which are mergeable, highly parallelizable, and provide low-memory streaming algorithms as well as low-communication distributed protocols. In this work, we close the gap between non-adaptive and adaptive algorithms, showing that even non-adaptive algorithms can achieve $O(\sqrt{\log(1/\delta)}/\epsilon + \log(1/\delta))$ matrix-vector products. In addition, we prove matching lower bounds demonstrating that, up to a $\log \log(1/\delta)$ factor, no further improvement in the dependence on $\delta$ or $\epsilon$ is possible by any non-adaptive algorithm. Finally, our experiments demonstrate the superior performance of our sketch over the adaptive Hutch++ algorithm, which is less parallelizable, as well as over the non-adaptive Hutchinson's method.

Comments:	31 pages, 5 figures. Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia
Subjects:	Data Structures and Algorithms (cs.DS); Numerical Analysis (math.NA)
Cite as:	arXiv:2111.00664 [cs.DS]
	(or arXiv:2111.00664v1 [cs.DS] for this version)

Submission history

From: Shuli Jiang [view email]
[v1] Mon, 1 Nov 2021 02:36:18 GMT (398kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2111.00664

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Data Structures and Algorithms

Title: Optimal Sketching for Trace Estimation

Submission history