We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Quantitative Methods

Title: On Classifying Sepsis Heterogeneity in the ICU: Insight Using Machine Learning

Abstract: Current machine learning models aiming to predict sepsis from Electronic Health Records (EHR) do not account for the heterogeneity of the condition, despite its emerging importance in prognosis and treatment. This work demonstrates the added value of stratifying the types of organ dysfunction observed in patients who develop sepsis in the ICU in improving the ability to recognise patients at risk of sepsis from their EHR data. Using an ICU dataset of 13,728 records, we identify clinically significant sepsis subpopulations with distinct organ dysfunction patterns. Classification experiments using Random Forest, Gradient Boost Trees and Support Vector Machines, aiming to distinguish patients who develop sepsis in the ICU from those who do not, show that features selected using sepsis subpopulations as background knowledge yield a superior performance regardless of the classification model used. Our findings can steer machine learning efforts towards more personalised models for complex conditions including sepsis.
Comments: 3 Figures and 2 tables. Accepted for publication at the Journal of American Medical Informatics Association
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1912.00672 [q-bio.QM]
  (or arXiv:1912.00672v2 [q-bio.QM] for this version)

Submission history

From: Zina Ibrahim [view email]
[v1] Mon, 2 Dec 2019 10:32:40 GMT (195kb)
[v2] Tue, 3 Dec 2019 12:42:51 GMT (1559kb)

Link back to: arXiv, form interface, contact.