We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Comment: Entropy Learning for Dynamic Treatment Regimes

Authors: Nathan Kallus
Abstract: I congratulate Profs. Binyan Jiang, Rui Song, Jialiang Li, and Donglin Zeng (JSLZ) for an exciting development in conducting inferences on optimal dynamic treatment regimes (DTRs) learned via empirical risk minimization using the entropy loss as a surrogate. JSLZ's approach leverages a rejection-and-importance-sampling estimate of the value of a given decision rule based on inverse probability weighting (IPW) and its interpretation as a weighted (or cost-sensitive) classification. Their use of smooth classification surrogates enables their careful approach to analyzing asymptotic distributions. However, even for evaluation purposes, the IPW estimate is problematic as it leads to weights that discard most of the data and are extremely variable on whatever remains. In this comment, I discuss an optimization-based alternative to evaluating DTRs, review several connections, and suggest directions forward. This extends the balanced policy evaluation approach of Kallus (2018a) to the longitudinal setting.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Statistica Sinica 29.4 (2019): 1697-1705
DOI: 10.5705/ss.202019.0115
Cite as: arXiv:2004.02778 [stat.ML]
  (or arXiv:2004.02778v1 [stat.ML] for this version)

Submission history

From: Nathan Kallus [view email]
[v1] Mon, 6 Apr 2020 16:11:05 GMT (11kb)

Link back to: arXiv, form interface, contact.