On the Granularity of Explanations in Model Agnostic NLP Interpretability

Rychener, Yves; Renard, Xavier; Seddah, Djamé; Frossard, Pascal; Detyniecki, Marcin

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Computer Science > Computation and Language

Title: On the Granularity of Explanations in Model Agnostic NLP Interpretability

Authors: Yves Rychener, Xavier Renard, Djamé Seddah, Pascal Frossard, Marcin Detyniecki

(Submitted on 24 Dec 2020 (v1), last revised 8 Aug 2022 (this version, v3))

Abstract: Current methods for Black-Box NLP interpretability, like LIME or SHAP, are based on altering the text to interpret by removing words and modeling the Black-Box response. In this paper, we outline limitations of this approach when using complex BERT-based classifiers: The word-based sampling produces texts that are out-of-distribution for the classifier and further gives rise to a high-dimensional search space, which can't be sufficiently explored when time or computation power is limited. Both of these challenges can be addressed by using segments as elementary building blocks for NLP interpretability. As illustration, we show that the simple choice of sentences greatly improves on both of these challenges. As a consequence, the resulting explainer attains much better fidelity on a benchmark classification task.

Comments:	accepted for the ECML PKDD 2022 International Workshop on eXplainable Knowledge Discovery in Data Mining (XKDD 2022), Grenoble, France
Subjects:	Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2012.13189 [cs.CL]
	(or arXiv:2012.13189v3 [cs.CL] for this version)

Submission history

From: Yves Rychener [view email]
[v1] Thu, 24 Dec 2020 10:32:41 GMT (8752kb,D)
[v2] Sun, 27 Dec 2020 17:54:38 GMT (8752kb,D)
[v3] Mon, 8 Aug 2022 11:04:42 GMT (6887kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.13189

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: On the Granularity of Explanations in Model Agnostic NLP Interpretability

Submission history