Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: A comparative study of counterfactual estimators
(Submitted on 3 Apr 2017 (v1), last revised 29 Jan 2019 (this version, v3))
Abstract: We provide a comparative study of several widely used off-policy estimators (Empirical Average, Basic Importance Sampling and Normalized Importance Sampling), detailing the different regimes where they are individually suboptimal. We then exhibit properties optimal estimators should possess. In the case where examples have been gathered using multiple policies, we show that fused estimators dominate basic ones but can still be improved.
Submission history
From: Thomas Nedelec [view email][v1] Mon, 3 Apr 2017 19:16:06 GMT (390kb,D)
[v2] Tue, 2 May 2017 08:00:07 GMT (392kb,D)
[v3] Tue, 29 Jan 2019 13:58:22 GMT (403kb,D)
Link back to: arXiv, form interface, contact.