We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

Abstract: It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized experiment to train a model which is robust to the feature shifts and therefore transfers to a new targeting distribution. Theoretically, we justify the robustness of the approach against feature shifts with the knowledge from causal learning. Empirically, we evaluate the CTRF using both synthetic data experiments and real-world experiments in the Bing Ads platform, including a click prediction task and in the context of an end-to-end counterfactual optimization system. The proposed CTRF produces robust predictions and outperforms most baseline methods compared in the presence of feature shifts.
Comments: 9 pages, 7 figures, 2 tables, accepted to WSDM 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2010.08710 [cs.LG]
  (or arXiv:2010.08710v2 [cs.LG] for this version)

Submission history

From: Shuxi Zeng [view email]
[v1] Sat, 17 Oct 2020 03:54:37 GMT (235kb,D)
[v2] Thu, 14 Jan 2021 16:29:07 GMT (246kb,D)

Link back to: arXiv, form interface, contact.