What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work?

Dandl, Susanne; Hothorn, Torsten; Seibold, Heidi; Sverdrup, Erik; Wager, Stefan; Zeileis, Achim

Full-text links:

Download:

Current browse context:

stat.ME

< prev | next >

new | recent | 2206

Statistics > Methodology

Title: What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work?

Authors: Susanne Dandl, Torsten Hothorn, Heidi Seibold, Erik Sverdrup, Stefan Wager, Achim Zeileis

(Submitted on 21 Jun 2022 (v1), last revised 20 Dec 2023 (this version, v2))

Abstract: Estimation of heterogeneous treatment effects (HTE) is of prime importance in many disciplines, ranging from personalized medicine to economics among many others. Random forests have been shown to be a flexible and powerful approach to HTE estimation in both randomized trials and observational studies. In particular "causal forests", introduced by Athey, Tibshirani and Wager (2019), along with the R implementation in package grf were rapidly adopted. A related approach, called "model-based forests", that is geared towards randomized trials and simultaneously captures effects of both prognostic and predictive variables, was introduced by Seibold, Zeileis and Hothorn (2018) along with a modular implementation in the R package model4you.
Here, we present a unifying view that goes beyond the theoretical motivations and investigates which computational elements make causal forests so successful and how these can be blended with the strengths of model-based forests. To do so, we show that both methods can be understood in terms of the same parameters and model assumptions for an additive model under L2 loss. This theoretical insight allows us to implement several flavors of "model-based causal forests" and dissect their different elements in silico.
The original causal forests and model-based forests are compared with the new blended versions in a benchmark study exploring both randomized trials and observational settings. In the randomized setting, both approaches performed akin. If confounding was present in the data generating process, we found local centering of the treatment indicator with the corresponding propensities to be the main driver for good performance. Local centering of the outcome was less important, and might be replaced or enhanced by simultaneous split selection with respect to both prognostic and predictive effects.

Comments:	Contribution has been accepted for publication in the Annals of Applied Statistics
Subjects:	Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2206.10323 [stat.ME]
	(or arXiv:2206.10323v2 [stat.ME] for this version)

Submission history

From: Torsten Hothorn [view email]
[v1] Tue, 21 Jun 2022 12:45:07 GMT (1199kb,D)
[v2] Wed, 20 Dec 2023 14:20:51 GMT (897kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2206.10323

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Methodology

Title: What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work?

Submission history