We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Ancillary-file links:

Ancillary files (details):

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: On the number of principal components in high dimensions

Abstract: We consider the problem of how many components to retain in the application of principal component analysis when the dimension is much higher than the number of observations. To estimate the number of components, we propose to sequentially test skewness of the squared lengths of residual scores that are obtained by removing leading principal components. The residual lengths are asymptotically left-skewed if all principal components with diverging variances are removed, and right-skewed if not. The proposed estimator is shown to be consistent, performs well in high-dimensional simulation studies, and provides reasonable estimates in a number of real data examples.
Subjects: Methodology (stat.ME)
Journal reference: Biometrika 105 (2018) 389-402
DOI: 10.1093/biomet/asy010
Cite as: arXiv:1708.04981 [stat.ME]
  (or arXiv:1708.04981v1 [stat.ME] for this version)

Submission history

From: Sungkyu Jung [view email]
[v1] Wed, 16 Aug 2017 17:13:13 GMT (468kb,A)

Link back to: arXiv, form interface, contact.