We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Near optimal sample complexity for matrix and tensor normal models via geodesic convexity

Abstract: The matrix normal model, the family of Gaussian matrix-variate distributions whose covariance matrix is the Kronecker product of two lower dimensional factors, is frequently used to model matrix-variate data. The tensor normal model generalizes this family to Kronecker products of three or more factors. We study the estimation of the Kronecker factors of the covariance matrix in the matrix and tensor models. We show nonasymptotic bounds for the error achieved by the maximum likelihood estimator (MLE) in several natural metrics. In contrast to existing bounds, our results do not rely on the factors being well-conditioned or sparse. For the matrix normal model, all our bounds are minimax optimal up to logarithmic factors, and for the tensor normal model our bound for the largest factor and overall covariance matrix are minimax optimal up to constant factors provided there are enough samples for any estimator to obtain constant Frobenius error. In the same regimes as our sample complexity bounds, we show that an iterative procedure to compute the MLE known as the flip-flop algorithm converges linearly with high probability. Our main tool is geodesic strong convexity in the geometry on positive-definite matrices induced by the Fisher information metric. This strong convexity is determined by the expansion of certain random quantum channels. We also provide numerical evidence that combining the flip-flop algorithm with a simple shrinkage estimator can improve performance in the undersampled regime.
Comments: Measured computation time on more instances
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Quantum Physics (quant-ph)
MSC classes: Primary: 62F12, Secondary: 62F30
Cite as: arXiv:2110.07583 [math.ST]
  (or arXiv:2110.07583v2 [math.ST] for this version)

Submission history

From: Cole Franks [view email]
[v1] Thu, 14 Oct 2021 17:47:00 GMT (1229kb,D)
[v2] Thu, 11 Nov 2021 22:22:10 GMT (1226kb,D)

Link back to: arXiv, form interface, contact.