References & Citations
Computer Science > Machine Learning
Title: Counterfactual Shapley Additive Explanations
(Submitted on 27 Oct 2021 (v1), last revised 16 May 2022 (this version, v4))
Abstract: Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify the link between actionable recourse and feature attributions. Concretely, we propose a variant of SHAP, Counterfactual SHAP (CF-SHAP), that incorporates counterfactual information to produce a background dataset for use within the marginal (a.k.a. interventional) Shapley value framework. We motivate the need within the actionable recourse setting for careful consideration of background datasets when using Shapley values for feature attributions with numerous synthetic examples. Moreover, we demonstrate the efficacy of CF-SHAP by proposing and justifying a quantitative score for feature attributions, counterfactual-ability, showing that as measured by this metric, CF-SHAP is superior to existing methods when evaluated on public datasets using tree ensembles.
Submission history
From: Emanuele Albini [view email][v1] Wed, 27 Oct 2021 08:44:53 GMT (4130kb,D)
[v2] Tue, 2 Nov 2021 14:59:28 GMT (4127kb,D)
[v3] Thu, 17 Mar 2022 23:27:56 GMT (4003kb,D)
[v4] Mon, 16 May 2022 14:28:42 GMT (2924kb,D)
Link back to: arXiv, form interface, contact.