Abstraction, Validation, and Generalization for Explainable Artificial Intelligence

Yang, Scott Cheng-Hsin; Folke, Tomas; Shafto, Patrick

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2105

Change to browse by:

Computer Science > Artificial Intelligence

Title: Abstraction, Validation, and Generalization for Explainable Artificial Intelligence

Authors: Scott Cheng-Hsin Yang, Tomas Folke, Patrick Shafto

(Submitted on 16 May 2021 (v1), last revised 12 Oct 2021 (this version, v2))

Abstract: Neural network architectures are achieving superhuman performance on an expanding range of tasks. To effectively and safely deploy these systems, their decision-making must be understandable to a wide range of stakeholders. Methods to explain AI have been proposed to answer this challenge, but a lack of theory impedes the development of systematic abstractions which are necessary for cumulative knowledge gains. We propose Bayesian Teaching as a framework for unifying explainable AI (XAI) by integrating machine learning and human learning. Bayesian Teaching formalizes explanation as a communication act of an explainer to shift the beliefs of an explainee. This formalization decomposes any XAI method into four components: (1) the inference to be explained, (2) the explanatory medium, (3) the explainee model, and (4) the explainer model. The abstraction afforded by Bayesian Teaching to decompose any XAI method elucidates the invariances among them. The decomposition of XAI systems enables modular validation, as each of the first three components listed can be tested semi-independently. This decomposition also promotes generalization through recombination of components from different XAI systems, which facilitates the generation of novel variants. These new variants need not be evaluated one by one provided that each component has been validated, leading to an exponential decrease in development time. Finally, by making the goal of explanation explicit, Bayesian Teaching helps developers to assess how suitable an XAI system is for its intended real-world use case. Thus, Bayesian Teaching provides a theoretical framework that encourages systematic, scientific investigation of XAI.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2105.07508 [cs.AI]
	(or arXiv:2105.07508v2 [cs.AI] for this version)

Submission history

From: Scott Cheng-Hsin Yang [view email]
[v1] Sun, 16 May 2021 20:40:23 GMT (26kb)
[v2] Tue, 12 Oct 2021 18:32:56 GMT (31kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2105.07508

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Abstraction, Validation, and Generalization for Explainable Artificial Intelligence

Submission history