Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA)

Frost, H. Robert

doi:10.1080/10618600.2021.1987254

Full-text links:

Download:

Current browse context:

stat.ME

< prev | next >

new | recent | 2006

Statistics > Methodology

Title: Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA)

Authors: H. Robert Frost

(Submitted on 2 Jun 2020 (v1), last revised 23 Sep 2021 (this version, v3))

Abstract: We present a novel technique for sparse principal component analysis. This method, named Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA), is based on the formula for computing squared eigenvector loadings of a Hermitian matrix from the eigenvalues of the full matrix and associated sub-matrices. We explore two versions of the EESPCA method: a version that uses a fixed threshold for inducing sparsity and a version that selects the threshold via cross-validation. Relative to the state-of-the-art sparse PCA methods of Witten et al., Yuan & Zhang and Tan et al., the fixed threshold EESPCA technique offers an order-of-magnitude improvement in computational speed, does not require estimation of tuning parameters via cross-validation, and can more accurately identify true zero principal component loadings across a range of data matrix sizes and covariance structures. Importantly, the EESPCA method achieves these benefits while maintaining out-of-sample reconstruction error and PC estimation error close to the lowest error generated by all evaluated approaches. EESPCA is a practical and effective technique for sparse PCA with particular relevance to computationally demanding statistical problems such as the analysis of high-dimensional data sets or application of statistical techniques like resampling that involve the repeated calculation of sparse PCs.

Subjects:	Methodology (stat.ME); Quantitative Methods (q-bio.QM)
DOI:	10.1080/10618600.2021.1987254
Cite as:	arXiv:2006.01924 [stat.ME]
	(or arXiv:2006.01924v3 [stat.ME] for this version)

Submission history

From: H Frost [view email]
[v1] Tue, 2 Jun 2020 20:14:55 GMT (75kb,D)
[v2] Mon, 28 Dec 2020 15:42:40 GMT (992kb,D)
[v3] Thu, 23 Sep 2021 21:31:05 GMT (4321kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2006.01924

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Methodology

Title: Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA)

Submission history