Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Machine Learning for Set-Identified Linear Models
(Submitted on 28 Dec 2017 (v1), revised 6 Dec 2019 (this version, v3), latest version 13 Dec 2022 (v6))
Abstract: This paper provides estimation and inference methods for an identified set where the selection among a very large number of covariates is based on modern machine learning tools. I characterize the boundary of the identified set (i.e., support function) using a semiparametric moment condition. Combining Neyman-orthogonality and sample splitting ideas, I construct a root-N consistent, uniformly asymptotically Gaussian estimator of the support function and propose a weighted bootstrap procedure to conduct inference about the identified set. I provide a general method to construct a Neyman-orthogonal moment condition for the support function. Applying my method to Lee (2008)'s endogenous selection model, I provide the asymptotic theory for the sharp (i.e., the tightest possible) bounds on the Average Treatment Effect in the presence of high-dimensional covariates. Furthermore, I relax the conventional monotonicity assumption and allow the sign of the treatment effect on the selection (e.g., employment) to be determined by covariates. Using JobCorps data set with very rich baseline characteristics, I substantially tighten the bounds on the JobCorps effect on wages under weakened monotonicity assumption.
Submission history
From: Vira Semenova [view email][v1] Thu, 28 Dec 2017 19:04:28 GMT (11kb)
[v2] Tue, 6 Nov 2018 02:12:00 GMT (368kb,D)
[v3] Fri, 6 Dec 2019 21:48:51 GMT (1115kb,D)
[v4] Sat, 11 Sep 2021 16:19:18 GMT (394kb)
[v5] Sun, 11 Dec 2022 04:39:43 GMT (1534kb)
[v6] Tue, 13 Dec 2022 05:56:26 GMT (1558kb)
Link back to: arXiv, form interface, contact.