Accurate Shapley Values for explaining tree-based models

Amoukou, Salim I.; Brunel, Nicolas J-B.; Salaün, Tangi

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2106

Statistics > Machine Learning

Title: Accurate Shapley Values for explaining tree-based models

Authors: Salim I. Amoukou, Nicolas J-B. Brunel, Tangi Salaün

(Submitted on 7 Jun 2021 (v1), last revised 31 May 2023 (this version, v3))

Abstract: Shapley Values (SV) are widely used in explainable AI, but their estimation and interpretation can be challenging, leading to inaccurate inferences and explanations. As a starting point, we remind an invariance principle for SV and derive the correct approach for computing the SV of categorical variables that are particularly sensitive to the encoding used. In the case of tree-based models, we introduce two estimators of Shapley Values that exploit the tree structure efficiently and are more accurate than state-of-the-art methods. Simulations and comparisons are performed with state-of-the-art algorithms and show the practical gain of our approach. Finally, we discuss the limitations of Shapley Values as a local explanation. These methods are available as a Python package.

Comments:	Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022. V2: The section on Active Shapley Values has been removed in this updated version
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference:	AISTATS 2022
Cite as:	arXiv:2106.03820 [stat.ML]
	(or arXiv:2106.03820v3 [stat.ML] for this version)

Submission history

From: Salim I. Amoukou [view email]
[v1] Mon, 7 Jun 2021 17:35:54 GMT (1034kb,D)
[v2] Thu, 24 Mar 2022 16:37:58 GMT (2143kb,D)
[v3] Wed, 31 May 2023 17:19:43 GMT (2148kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2106.03820

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Accurate Shapley Values for explaining tree-based models

Submission history