We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models

Abstract: Shapley values underlie one of the most popular model-agnostic methods within explainable artificial intelligence. These values are designed to attribute the difference between a model's prediction and an average baseline to the different features used as input to the model. Being based on solid game-theoretic principles, Shapley values uniquely satisfy several desirable properties, which is why they are increasingly used to explain the predictions of possibly complex and highly non-linear machine learning models. Shapley values are well calibrated to a user's intuition when features are independent, but may lead to undesirable, counterintuitive explanations when the independence assumption is violated.
In this paper, we propose a novel framework for computing Shapley values that generalizes recent work that aims to circumvent the independence assumption. By employing Pearl's do-calculus, we show how these 'causal' Shapley values can be derived for general causal graphs without sacrificing any of their desirable properties. Moreover, causal Shapley values enable us to separate the contribution of direct and indirect effects. We provide a practical implementation for computing causal Shapley values based on causal chain graphs when only partial information is available and illustrate their utility on a real-world example.
Comments: Accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020)
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
ACM classes: I.2.6
Cite as: arXiv:2011.01625 [cs.AI]
  (or arXiv:2011.01625v1 [cs.AI] for this version)

Submission history

From: Tom Heskes [view email]
[v1] Tue, 3 Nov 2020 11:11:36 GMT (603kb,D)

Link back to: arXiv, form interface, contact.