Current browse context:
math.ST
Change to browse by:
References & Citations
Mathematics > Statistics Theory
Title: On Principal Component Regression in a High-Dimensional Error-in-Variables Setting
(Submitted on 27 Oct 2020 (this version), latest version 25 Aug 2023 (v5))
Abstract: We analyze the classical method of Principal Component Regression (PCR) in the high-dimensional error-in-variables setting. Here, the observed covariates are not only noisy and contain missing data, but the number of covariates can also exceed the sample size. Under suitable conditions, we establish that PCR identifies the unique model parameter with minimum $\ell_2$-norm, and derive non-asymptotic $\ell_2$-rates of convergence that show its consistency. We further provide non-asymptotic out-of-sample prediction performance guarantees that again prove consistency, even in the presence of corrupted unseen data. Notably, our results do not require the out-of-samples covariates to follow the same distribution as that of the in-sample covariates, but rather that they obey a simple linear algebraic constraint. We finish by presenting simulations that illustrate our theoretical results.
Submission history
From: Dennis Shen [view email][v1] Tue, 27 Oct 2020 17:07:36 GMT (721kb,D)
[v2] Wed, 30 Dec 2020 17:50:20 GMT (718kb,D)
[v3] Wed, 21 Apr 2021 23:12:13 GMT (802kb,D)
[v4] Thu, 4 Aug 2022 19:57:45 GMT (1672kb,D)
[v5] Fri, 25 Aug 2023 17:33:22 GMT (876kb,D)
Link back to: arXiv, form interface, contact.