Tensor cumulants for statistical inference on invariant distributions

Kunisky, Dmitriy; Moore, Cristopher; Wein, Alexander S.

Full-text links:

Download:

Current browse context:

math.ST

< prev | next >

new | recent | 2404

Mathematics > Statistics Theory

Title: Tensor cumulants for statistical inference on invariant distributions

Authors: Dmitriy Kunisky, Cristopher Moore, Alexander S. Wein

(Submitted on 29 Apr 2024)

Abstract: Many problems in high-dimensional statistics appear to have a statistical-computational gap: a range of values of the signal-to-noise ratio where inference is information-theoretically possible, but (conjecturally) computationally intractable. A canonical such problem is Tensor PCA, where we observe a tensor $Y$ consisting of a rank-one signal plus Gaussian noise. Multiple lines of work suggest that Tensor PCA becomes computationally hard at a critical value of the signal's magnitude. In particular, below this transition, no low-degree polynomial algorithm can detect the signal with high probability; conversely, various spectral algorithms are known to succeed above this transition. We unify and extend this work by considering tensor networks, orthogonally invariant polynomials where multiple copies of $Y$ are "contracted" to produce scalars, vectors, matrices, or other tensors. We define a new set of objects, tensor cumulants, which provide an explicit, near-orthogonal basis for invariant polynomials of a given degree. This basis lets us unify and strengthen previous results on low-degree hardness, giving a combinatorial explanation of the hardness transition and of a continuum of subexponential-time algorithms that work below it, and proving tight lower bounds against low-degree polynomials for recovering rather than just detecting the signal. It also lets us analyze a new problem of distinguishing between different tensor ensembles, such as Wigner and Wishart tensors, establishing a sharp computational threshold and giving evidence of a new statistical-computational gap in the Central Limit Theorem for random tensors. Finally, we believe these cumulants are valuable mathematical objects in their own right: they generalize the free cumulants of free probability theory from matrices to tensors, and share many of their properties, including additivity under additive free convolution.

Comments:	72 pages, 12 figures
Subjects:	Statistics Theory (math.ST); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Probability (math.PR); Machine Learning (stat.ML)
Cite as:	arXiv:2404.18735 [math.ST]
	(or arXiv:2404.18735v1 [math.ST] for this version)

Submission history

From: Dmitriy Kunisky [view email]
[v1] Mon, 29 Apr 2024 14:33:24 GMT (588kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2404.18735

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Statistics Theory

Title: Tensor cumulants for statistical inference on invariant distributions

Submission history