We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Bayesian inference for spectral projectors of the covariance matrix

Abstract: Let $X_1, \ldots, X_n$ be i.i.d. sample in $\mathbb{R}^p$ with zero mean and the covariance matrix $\mathbf{\Sigma^*}$. The classical PCA approach recovers the projector $\mathbf{P^*_{\mathcal{J}}}$ onto the principal eigenspace of $\mathbf{\Sigma^*}$ by its empirical counterpart $\mathbf{\widehat{P}_{\mathcal{J}}}$. Recent paper [Koltchinskii, Lounici (2017)] investigated the asymptotic distribution of the Frobenius distance between the projectors $\| \mathbf{\widehat{P}_{\mathcal{J}}} - \mathbf{P^*_{\mathcal{J}}} \|_2$, while [Naumov et al. (2017)] offered a bootstrap procedure to measure uncertainty in recovering this subspace $\mathbf{P^*_{\mathcal{J}}}$ even in a finite sample setup. The present paper considers this problem from a Bayesian perspective and suggests to use the credible sets of the pseudo-posterior distribution on the space of covariance matrices induced by the conjugated Inverse Wishart prior as sharp confidence sets. This yields a numerically efficient procedure. Moreover, we theoretically justify this method and derive finite sample bounds on the corresponding coverage probability. Contrary to [Koltchinskii, Lounici (2017), Naumov et al. (2017)], the obtained results are valid for non-Gaussian data: the main assumption that we impose is the concentration of the sample covariance $\mathbf{\widehat{\Sigma}}$ in a vicinity of $\mathbf{\Sigma^*}$. Numerical simulations illustrate good performance of the proposed procedure even on non-Gaussian data in a rather challenging regime.
Comments: 40 pages, 2 figures, accepted version
Subjects: Statistics Theory (math.ST)
MSC classes: 62F15, 62H25, 62G20 (primary), 62F25 (secondary)
Journal reference: Electronic Journal of Statistics, Vol. 12 (2018), 1948--1987
DOI: 10.1214/18-EJS1451
Cite as: arXiv:1711.11532 [math.ST]
  (or arXiv:1711.11532v3 [math.ST] for this version)

Submission history

From: Igor Silin [view email]
[v1] Thu, 30 Nov 2017 17:42:10 GMT (170kb,D)
[v2] Sun, 10 Dec 2017 09:20:17 GMT (170kb,D)
[v3] Wed, 26 Jun 2019 21:23:07 GMT (472kb,D)

Link back to: arXiv, form interface, contact.