References & Citations
Statistics > Methodology
Title: Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects
(Submitted on 29 Jun 2017 (v1), last revised 13 Nov 2019 (this version, v4))
Abstract: This paper presents a novel nonlinear regression model for estimating heterogeneous treatment effects from observational data, geared specifically towards situations with small effect sizes, heterogeneous effects, and strong confounding. Standard nonlinear regression models, which may work quite well for prediction, have two notable weaknesses when used to estimate heterogeneous treatment effects. First, they can yield badly biased estimates of treatment effects when fit to data with strong confounding. The Bayesian causal forest model presented in this paper avoids this problem by directly incorporating an estimate of the propensity function in the specification of the response model, implicitly inducing a covariate-dependent prior on the regression function. Second, standard approaches to response surface modeling do not provide adequate control over the strength of regularization over effect heterogeneity. The Bayesian causal forest model permits treatment effect heterogeneity to be regularized separately from the prognostic effect of control variables, making it possible to informatively "shrink to homogeneity". We illustrate these benefits via the reanalysis of an observational study assessing the causal effects of smoking on medical expenditures as well as extensive simulation studies.
Submission history
From: P. Richard Hahn [view email][v1] Thu, 29 Jun 2017 00:20:37 GMT (329kb,D)
[v2] Thu, 12 Jul 2018 04:50:27 GMT (1286kb,D)
[v3] Thu, 23 May 2019 07:30:20 GMT (1287kb,D)
[v4] Wed, 13 Nov 2019 06:16:38 GMT (1680kb,D)
Link back to: arXiv, form interface, contact.