Comparing the Performance of Statistical Adjustment Methods By Recovering the Experimental Benchmark from the REFLUX Trial

Keele, Luke; O'Neill, Stephen; Grieve, Richard

Full-text links:

Download:

Current browse context:

stat.AP

< prev | next >

new | recent | 2001

Statistics > Applications

Title: Comparing the Performance of Statistical Adjustment Methods By Recovering the Experimental Benchmark from the REFLUX Trial

Authors: Luke Keele, Stephen O'Neill, Richard Grieve

(Submitted on 22 Jan 2020)

Abstract: Much evidence in comparative effectiveness research is based on observational studies. Researchers who conduct observational studies typically assume that there are no unobservable differences between the treated and control groups. Treatment effects are estimated after adjusting for observed differences between treated and controls. However, treatment effect estimates may be biased due to model misspecification. That is, if the method of treatment effect estimation imposes unduly strong functional form assumptions, treatment effect estimates may be significantly biased. In this study, we compare the performance of a wide variety of treatment effect estimation methods. We do so within the context of the REFLUX study from the UK. In REFLUX, after study qualification, participants were enrolled in either a randomized trial arm or patient preference arm. In the randomized trial, patients were randomly assigned to either surgery or medical management. In the patient preference arm, participants selected to either have surgery or medical management. We attempt to recover the treatment effect estimate from the randomized trial arm using the data from the patient preference arm of the study. We vary the method of treatment effect estimation and record which methods are successful and which are not. We apply over 20 different methods including standard regression models as well as advanced machine learning methods. We find that simple propensity score matching methods perform the worst. We also find significant variation in performance across methods. The wide variation in performance suggests analysts should use multiple methods of estimation as a robustness check.

Subjects:	Applications (stat.AP)
Cite as:	arXiv:2001.08170 [stat.AP]
	(or arXiv:2001.08170v1 [stat.AP] for this version)

Submission history

From: Luke Keele [view email]
[v1] Wed, 22 Jan 2020 17:37:48 GMT (35kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2001.08170

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Applications

Title: Comparing the Performance of Statistical Adjustment Methods By Recovering the Experimental Benchmark from the REFLUX Trial

Submission history