We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: On the Granularity of Explanations in Model Agnostic NLP Interpretability

Abstract: Current methods for Black-Box NLP interpretability, like LIME or SHAP, are based on altering the text to interpret by removing words and modeling the Black-Box response. In this paper, we outline limitations of this approach when using complex BERT-based classifiers: The word-based sampling produces texts that are out-of-distribution for the classifier and further gives rise to a high-dimensional search space, which can't be sufficiently explored when time or computation power is limited. Both of these challenges can be addressed by using segments as elementary building blocks for NLP interpretability. As illustration, we show that the simple choice of sentences greatly improves on both of these challenges. As a consequence, the resulting explainer attains much better fidelity on a benchmark classification task.
Comments: accepted for the ECML PKDD 2022 International Workshop on eXplainable Knowledge Discovery in Data Mining (XKDD 2022), Grenoble, France
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as: arXiv:2012.13189 [cs.CL]
  (or arXiv:2012.13189v3 [cs.CL] for this version)

Submission history

From: Yves Rychener [view email]
[v1] Thu, 24 Dec 2020 10:32:41 GMT (8752kb,D)
[v2] Sun, 27 Dec 2020 17:54:38 GMT (8752kb,D)
[v3] Mon, 8 Aug 2022 11:04:42 GMT (6887kb,D)

Link back to: arXiv, form interface, contact.