We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: Predictive Hierarchical Clustering: Learning clusters of CPT codes for improving surgical outcomes

Abstract: We develop a novel algorithm, Predictive Hierarchical Clustering (PHC), for agglomerative hierarchical clustering of current procedural terminology (CPT) codes. Our predictive hierarchical clustering aims to cluster subgroups, not individual observations, found within our data, such that the clusters discovered result in optimal performance of a classification model. Therefore, merges are chosen based on a Bayesian hypothesis test, which chooses pairings of the subgroups that result in the best model fit, as measured by held out predictive likelihoods. We place a Dirichlet prior on the probability of merging clusters, allowing us to adjust the size and sparsity of clusters. The motivation is to predict patient-specific surgical outcomes using data from ACS NSQIP (American College of Surgeon's National Surgical Quality Improvement Program). An important predictor of surgical outcomes is the actual surgical procedure performed as described by a CPT code. We use PHC to cluster CPT codes, represented as subgroups, together in a way that enables us to better predict patient-specific outcomes compared to currently used clusters based on clinical judgment.
Comments: Accepted at MLHC 2017 to appear in JMLR
Subjects: Methodology (stat.ME); Applications (stat.AP)
Cite as: arXiv:1604.07031 [stat.ME]
  (or arXiv:1604.07031v2 [stat.ME] for this version)

Submission history

From: Elizabeth Lorenzi [view email]
[v1] Sun, 24 Apr 2016 13:49:23 GMT (661kb,D)
[v2] Tue, 1 Aug 2017 19:02:01 GMT (604kb,D)

Link back to: arXiv, form interface, contact.