We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Canonical Correlation Analysis for Analyzing Sequences of Medical Billing Codes

Abstract: We propose using canonical correlation analysis (CCA) to generate features from sequences of medical billing codes. Applying this novel use of CCA to a database of medical billing codes for patients with diverticulitis, we first demonstrate that the CCA embeddings capture meaningful relationships among the codes. We then generate features from these embeddings and establish their usefulness in predicting future elective surgery for diverticulitis, an important marker in efforts for reducing costs in healthcare.
Comments: Accepted at NIPS 2016 Workshop on Machine Learning for Health
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1612.00516 [stat.ML]
  (or arXiv:1612.00516v2 [stat.ML] for this version)

Submission history

From: Corinne Jones [view email]
[v1] Thu, 1 Dec 2016 23:38:34 GMT (11kb,D)
[v2] Fri, 6 Jan 2017 16:42:36 GMT (11kb,D)

Link back to: arXiv, form interface, contact.