We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Inference in Linear Regression Models with Many Covariates and Heteroskedasticity

Abstract: The linear regression model is widely used in empirical work in Economics, Statistics, and many other disciplines. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We give inference methods that allow for many covariates and heteroskedasticity. Our results are obtained using high-dimensional approximations, where the number of included covariates are allowed to grow as fast as the sample size. We find that all of the usual versions of Eicker-White heteroskedasticity consistent standard error estimators for linear models are inconsistent under this asymptotics. We then propose a new heteroskedasticity consistent standard error formula that is fully automatic and robust to both (conditional)\ heteroskedasticity of unknown form and the inclusion of possibly many covariates. We apply our findings to three settings: parametric linear models with many covariates, linear panel models with many fixed effects, and semiparametric semi-linear models with many technical regressors. Simulation evidence consistent with our theoretical results is also provided. The proposed methods are also illustrated with an empirical application.
Subjects: Statistics Theory (math.ST); Econometrics (econ.EM); Methodology (stat.ME)
Cite as: arXiv:1507.02493 [math.ST]
  (or arXiv:1507.02493v2 [math.ST] for this version)

Submission history

From: Matias Cattaneo [view email]
[v1] Thu, 9 Jul 2015 13:13:47 GMT (29kb)
[v2] Mon, 16 Jan 2017 16:25:20 GMT (30kb)

Link back to: arXiv, form interface, contact.