An Attention Matrix for Every Decision: Faithfulness-based Arbitration Among Multiple Attention-Based Interpretations of Transformers in Text Classification

Mylonas, Nikolaos; Mollas, Ioannis; Tsoumakas, Grigorios

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2209

Computer Science > Computation and Language

Title: An Attention Matrix for Every Decision: Faithfulness-based Arbitration Among Multiple Attention-Based Interpretations of Transformers in Text Classification

Authors: Nikolaos Mylonas, Ioannis Mollas, Grigorios Tsoumakas

(Submitted on 22 Sep 2022 (v1), last revised 28 Nov 2022 (this version, v2))

Abstract: Transformers are widely used in natural language processing, where they consistently achieve state-of-the-art performance. This is mainly due to their attention-based architecture, which allows them to model rich linguistic relations between (sub)words. However, transformers are difficult to interpret. Being able to provide reasoning for its decisions is an important property for a model in domains where human lives are affected. With transformers finding wide use in such fields, the need for interpretability techniques tailored to them arises. We propose a new technique that selects the most faithful attention-based interpretation among the several ones that can be obtained by combining different head, layer and matrix operations. In addition, two variations are introduced towards (i) reducing the computational complexity, thus being faster and friendlier to the environment, and (ii) enhancing the performance in multi-label data. We further propose a new faithfulness metric that is more suitable for transformer models and exhibits high correlation with the area under the precision-recall curve based on ground truth rationales. We validate the utility of our contributions with a series of quantitative and qualitative experiments on seven datasets.

Comments:	16 pages, 7 figures, 5 tables, Submitted to DAMI Journal (ECMLPKDD2023 Special Issue)
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2209.10876 [cs.CL]
	(or arXiv:2209.10876v2 [cs.CL] for this version)

Submission history

From: Ioannis Mollas [view email]
[v1] Thu, 22 Sep 2022 09:19:22 GMT (852kb,D)
[v2] Mon, 28 Nov 2022 11:37:33 GMT (3162kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.10876

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: An Attention Matrix for Every Decision: Faithfulness-based Arbitration Among Multiple Attention-Based Interpretations of Transformers in Text Classification

Submission history