We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Subspace Perspective on Canonical Correlation Analysis: Dimension Reduction and Minimax Rates

Abstract: Canonical correlation analysis (CCA) is a fundamental statistical tool for exploring the correlation structure between two sets of random variables. In this paper, motivated by recent success of applying CCA to learn low dimensional representations of high dimensional objects, we propose to quantify the estimation loss of CCA by the excess prediction loss defined through a prediction-after-dimension-reduction framework. Such framework suggests viewing CCA estimation as estimating the subspaces spanned by the canonical variates. Interestedly, the proposed error metrics derived from the excess prediction loss turn out to be closely related to the principal angles between the subspaces spanned by the population and sample canonical variates respectively.
We characterize the non-asymptotic minimax rates under the proposed metrics, especially the dependency of the minimax rates on the key quantities including the dimensions, the condition number of the covariance matrices, the canonical correlations and the eigen-gap, with minimal assumptions on the joint covariance matrix. To the best of our knowledge, this is the first finite sample result that captures the effect of the canonical correlations on the minimax rates.
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as: arXiv:1605.03662 [math.ST]
  (or arXiv:1605.03662v2 [math.ST] for this version)

Submission history

From: Zhuang Ma [view email]
[v1] Thu, 12 May 2016 03:09:28 GMT (53kb)
[v2] Sun, 21 Jan 2018 03:53:44 GMT (72kb)

Link back to: arXiv, form interface, contact.