We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Multitask learning and benchmarking with clinical time series data

Abstract: Health care is one of the most exciting frontiers in data mining and machine learning. Successful adoption of electronic health records (EHRs) created an explosion in digital clinical data available for analysis, but progress in machine learning for healthcare research has been difficult to measure because of the absence of publicly available benchmark data sets. To address this problem, we propose four clinical prediction benchmarks using data derived from the publicly available Medical Information Mart for Intensive Care (MIMIC-III) database. These tasks cover a range of clinical problems including modeling risk of mortality, forecasting length of stay, detecting physiologic decline, and phenotype classification. We propose strong linear and neural baselines for all four tasks and evaluate the effect of deep supervision, multitask training and data-specific architectural modifications on the performance of neural models.
Comments: This version of the paper adds details about the generation of the benchmark tasks and describes improved neural baselines
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Scientific Data 6 (2019) 96
DOI: 10.1038/s41597-019-0103-9
Cite as: arXiv:1703.07771 [stat.ML]
  (or arXiv:1703.07771v3 [stat.ML] for this version)

Submission history

From: Hrayr Harutyunyan [view email]
[v1] Wed, 22 Mar 2017 17:53:27 GMT (1857kb,D)
[v2] Fri, 21 Dec 2018 21:56:38 GMT (2895kb,D)
[v3] Fri, 9 Aug 2019 19:21:40 GMT (597kb,D)

Link back to: arXiv, form interface, contact.