We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Survival-Supervised Topic Modeling with Anchor Words: Characterizing Pancreatitis Outcomes

Abstract: We introduce a new approach for topic modeling that is supervised by survival analysis. Specifically, we build on recent work on unsupervised topic modeling with so-called anchor words by providing supervision through an elastic-net regularized Cox proportional hazards model. In short, an anchor word being present in a document provides strong indication that the document is partially about a specific topic. For example, by seeing "gallstones" in a document, we are fairly certain that the document is partially about medicine. Our proposed method alternates between learning a topic model and learning a survival model to find a local minimum of a block convex optimization problem. We apply our proposed approach to predicting how long patients with pancreatitis admitted to an intensive care unit (ICU) will stay in the ICU. Our approach is as accurate as the best of a variety of baselines while being more interpretable than any of the baselines.
Comments: NIPS Workshop on Machine Learning for Health 2017, fixed some equation typos, some minor wording edits
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1712.00535 [stat.ML]
  (or arXiv:1712.00535v2 [stat.ML] for this version)

Submission history

From: George Chen [view email]
[v1] Sat, 2 Dec 2017 01:57:35 GMT (25kb)
[v2] Thu, 7 Dec 2017 06:46:10 GMT (25kb)

Link back to: arXiv, form interface, contact.