T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Ortigossa, Evandro S.; Dias, Fábio F.; Barr, Brian; Silva, Claudio T.; Nonato, Luis Gustavo

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2404

Change to browse by:

Computer Science > Machine Learning

Title: T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Authors: Evandro S. Ortigossa, Fábio F. Dias, Brian Barr, Claudio T. Silva, Luis Gustavo Nonato

(Submitted on 25 Apr 2024)

Abstract: The development of machine learning applications has increased significantly in recent years, motivated by the remarkable ability of learning-powered systems to discover and generalize intricate patterns hidden in massive datasets. Modern learning models, while powerful, often exhibit a level of complexity that renders them opaque black boxes, resulting in a notable lack of transparency that hinders our ability to decipher their decision-making processes. Opacity challenges the interpretability and practical application of machine learning, especially in critical domains where understanding the underlying reasons is essential for informed decision-making. Explainable Artificial Intelligence (XAI) rises to meet that challenge, unraveling the complexity of black boxes by providing elucidating explanations. Among the various XAI approaches, feature attribution/importance XAI stands out for its capacity to delineate the significance of input features in the prediction process. However, most existing attribution methods have limitations, such as instability, when divergent explanations may result from similar or even the same instance. In this work, we introduce T-Explainer, a novel local additive attribution explainer based on Taylor expansion endowed with desirable properties, such as local accuracy and consistency, while stable over multiple runs. We demonstrate T-Explainer's effectiveness through benchmark experiments with well-known attribution methods. In addition, T-Explainer is developed as a comprehensive XAI framework comprising quantitative metrics to assess and visualize attribution explanations.

Comments:	15 pages and 4 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2404.16495 [cs.LG]
	(or arXiv:2404.16495v1 [cs.LG] for this version)

Submission history

From: Evandro S. Ortigossa [view email]
[v1] Thu, 25 Apr 2024 10:40:49 GMT (197kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.16495

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Submission history