References & Citations
Mathematics > Statistics Theory
Title: Dense Signals, Linear Estimators, and Out-of-Sample Prediction for High-Dimensional Linear Models
(Submitted on 15 Feb 2011 (v1), last revised 20 Mar 2012 (this version, v3))
Abstract: Motivated by questions about dense (non-sparse) signals in high-dimensional data analysis, we study the unconditional out-of-sample prediction error (predictive risk) associated with three popular linear estimators for high-dimensional linear models: ridge regression estimators, scalar multiples of the ordinary least squares (OLS) estimator (referred to as James-Stein shrinkage estimators), and marginal regression estimators. The results in this paper require no assumptions about sparsity and imply: (i) if prior information about the population predictor covariance is available, then the ridge estimator outperforms the OLS, James-Stein, and marginal estimators; (ii) if little is known about the population predictor covariance, then the James-Stein estimator may be an effective alternative to the ridge estimator; and (iii) the marginal estimator has serious deficiencies for out-of-sample prediction. Both finite sample and asymptotic properties of the estimators are studied in this paper. Though various asymptotic regimes are considered, we focus on the setting where the number of predictors is roughly proportional to the number of observations. Ultimately, the results presented here provide new and detailed practical guidance regarding several well-known non-sparse methods for high-dimensional linear models.
Submission history
From: Lee Dicker [view email][v1] Tue, 15 Feb 2011 02:59:36 GMT (61kb)
[v2] Sat, 25 Feb 2012 16:47:10 GMT (333kb)
[v3] Tue, 20 Mar 2012 19:59:29 GMT (331kb)
Link back to: arXiv, form interface, contact.