Current browse context:
cs.CL
Change to browse by:
References & Citations
Computer Science > Computation and Language
Title: On the Granularity of Explanations in Model Agnostic NLP Interpretability
(Submitted on 24 Dec 2020 (v1), last revised 8 Aug 2022 (this version, v3))
Abstract: Current methods for Black-Box NLP interpretability, like LIME or SHAP, are based on altering the text to interpret by removing words and modeling the Black-Box response. In this paper, we outline limitations of this approach when using complex BERT-based classifiers: The word-based sampling produces texts that are out-of-distribution for the classifier and further gives rise to a high-dimensional search space, which can't be sufficiently explored when time or computation power is limited. Both of these challenges can be addressed by using segments as elementary building blocks for NLP interpretability. As illustration, we show that the simple choice of sentences greatly improves on both of these challenges. As a consequence, the resulting explainer attains much better fidelity on a benchmark classification task.
Submission history
From: Yves Rychener [view email][v1] Thu, 24 Dec 2020 10:32:41 GMT (8752kb,D)
[v2] Sun, 27 Dec 2020 17:54:38 GMT (8752kb,D)
[v3] Mon, 8 Aug 2022 11:04:42 GMT (6887kb,D)
Link back to: arXiv, form interface, contact.