We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Rational Shapley Values

Abstract: Explaining the predictions of opaque machine learning algorithms is an important and challenging task, especially as complex models are increasingly used to assist in high-stakes decisions such as those arising in healthcare and finance. Most popular tools for post-hoc explainable artificial intelligence (XAI) are either insensitive to context (e.g., feature attributions) or difficult to summarize (e.g., counterfactuals). In this paper, I introduce $\textit{rational Shapley values}$, a novel XAI method that synthesizes and extends these seemingly incompatible approaches in a rigorous, flexible manner. I leverage tools from decision theory and causal modeling to formalize and implement a pragmatic approach that resolves a number of known challenges in XAI. By pairing the distribution of random variables with the appropriate reference class for a given explanation task, I illustrate through theory and experiments how user goals and knowledge can inform and constrain the solution set in an iterative fashion. The method compares favorably to state of the art XAI tools in a range of quantitative and qualitative comparisons.
Comments: To be presented at the 2022 ACM FAccT Conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Journal reference: 2022 ACM Conference on Fairness, Accountability, and Transparency
DOI: 10.1145/3531146.3533170
Cite as: arXiv:2106.10191 [cs.LG]
  (or arXiv:2106.10191v2 [cs.LG] for this version)

Submission history

From: David Watson [view email]
[v1] Fri, 18 Jun 2021 15:45:21 GMT (1377kb,D)
[v2] Mon, 16 May 2022 14:46:19 GMT (1547kb,D)

Link back to: arXiv, form interface, contact.