Comment: Entropy Learning for Dynamic Treatment Regimes

Kallus, Nathan

doi:10.5705/ss.202019.0115

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2004

Statistics > Machine Learning

Title: Comment: Entropy Learning for Dynamic Treatment Regimes

Authors: Nathan Kallus

(Submitted on 6 Apr 2020)

Abstract: I congratulate Profs. Binyan Jiang, Rui Song, Jialiang Li, and Donglin Zeng (JSLZ) for an exciting development in conducting inferences on optimal dynamic treatment regimes (DTRs) learned via empirical risk minimization using the entropy loss as a surrogate. JSLZ's approach leverages a rejection-and-importance-sampling estimate of the value of a given decision rule based on inverse probability weighting (IPW) and its interpretation as a weighted (or cost-sensitive) classification. Their use of smooth classification surrogates enables their careful approach to analyzing asymptotic distributions. However, even for evaluation purposes, the IPW estimate is problematic as it leads to weights that discard most of the data and are extremely variable on whatever remains. In this comment, I discuss an optimization-based alternative to evaluating DTRs, review several connections, and suggest directions forward. This extends the balanced policy evaluation approach of Kallus (2018a) to the longitudinal setting.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference:	Statistica Sinica 29.4 (2019): 1697-1705
DOI:	10.5705/ss.202019.0115
Cite as:	arXiv:2004.02778 [stat.ML]
	(or arXiv:2004.02778v1 [stat.ML] for this version)

Submission history

From: Nathan Kallus [view email]
[v1] Mon, 6 Apr 2020 16:11:05 GMT (11kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2004.02778

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Statistics > Machine Learning

Title: Comment: Entropy Learning for Dynamic Treatment Regimes

Submission history