We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: On Principal Component Regression in a High-Dimensional Error-in-Variables Setting

Abstract: We analyze the classical method of Principal Component Regression (PCR) in the high-dimensional error-in-variables setting. Here, the observed covariates are not only noisy and contain missing data, but the number of covariates can also exceed the sample size. Under suitable conditions, we establish that PCR identifies the unique model parameter with minimum $\ell_2$-norm, and derive non-asymptotic $\ell_2$-rates of convergence that show its consistency. We further provide non-asymptotic out-of-sample prediction performance guarantees that again prove consistency, even in the presence of corrupted unseen data. Notably, our results do not require the out-of-samples covariates to follow the same distribution as that of the in-sample covariates, but rather that they obey a simple linear algebraic constraint. We finish by presenting simulations that illustrate our theoretical results.
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2010.14449 [math.ST]
  (or arXiv:2010.14449v1 [math.ST] for this version)

Submission history

From: Dennis Shen [view email]
[v1] Tue, 27 Oct 2020 17:07:36 GMT (721kb,D)
[v2] Wed, 30 Dec 2020 17:50:20 GMT (718kb,D)
[v3] Wed, 21 Apr 2021 23:12:13 GMT (802kb,D)
[v4] Thu, 4 Aug 2022 19:57:45 GMT (1672kb,D)
[v5] Fri, 25 Aug 2023 17:33:22 GMT (876kb,D)

Link back to: arXiv, form interface, contact.