We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Tree Space Prototypes: Another Look at Making Tree Ensembles Interpretable

Abstract: Ensembles of decision trees perform well on many problems, but are not interpretable. In contrast to existing approaches in interpretability that focus on explaining relationships between features and predictions, we propose an alternative approach to interpret tree ensemble classifiers by surfacing representative points for each class -- prototypes. We introduce a new distance for Gradient Boosted Tree models, and propose new, adaptive prototype selection methods with theoretical guarantees, with the flexibility to choose a different number of prototypes in each class. We demonstrate our methods on random forests and gradient boosted trees, showing that the prototypes can perform as well as or even better than the original tree ensemble when used as a nearest-prototype classifier. In a user study, humans were better at predicting the output of a tree ensemble classifier when using prototypes than when using Shapley values, a popular feature attribution method. Hence, prototypes present a viable alternative to feature-based explanations for tree ensembles.
Comments: Camera-ready version for ACM-IMS FODS 2020. A short version was presented at NIPS 2016 Workshop on Interpretable Machine Learning for Complex Systems
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1611.07115 [stat.ML]
  (or arXiv:1611.07115v3 [stat.ML] for this version)

Submission history

From: Sarah Tan [view email]
[v1] Tue, 22 Nov 2016 00:53:29 GMT (168kb,D)
[v2] Sun, 24 Nov 2019 22:56:22 GMT (5670kb,D)
[v3] Tue, 25 Aug 2020 08:01:26 GMT (3941kb,D)

Link back to: arXiv, form interface, contact.