The Irrationality of Neural Rationale Models

Zheng, Yiming; Booth, Serena; Shah, Julie; Zhou, Yilun

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computation and Language

Title: The Irrationality of Neural Rationale Models

Authors: Yiming Zheng, Serena Booth, Julie Shah, Yilun Zhou

(Submitted on 14 Oct 2021 (v1), last revised 24 Jul 2022 (this version, v2))

Abstract: Neural rationale models are popular for interpretable predictions of NLP tasks. In these, a selector extracts segments of the input text, called rationales, and passes these segments to a classifier for prediction. Since the rationale is the only information accessible to the classifier, it is plausibly defined as the explanation. Is such a characterization unconditionally correct? In this paper, we argue to the contrary, with both philosophical perspectives and empirical evidence suggesting that rationale models are, perhaps, less rational and interpretable than expected. We call for more rigorous and comprehensive evaluations of these models to ensure desired properties of interpretability are indeed achieved. The code can be found at this https URL

Comments:	NAACL Workshop on Trustworthy Natural Language Processing (TrustNLP) 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.07550 [cs.CL]
	(or arXiv:2110.07550v2 [cs.CL] for this version)

Submission history

From: Yiming Zheng [view email]
[v1] Thu, 14 Oct 2021 17:22:10 GMT (715kb,D)
[v2] Sun, 24 Jul 2022 02:59:31 GMT (6693kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.07550

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: The Irrationality of Neural Rationale Models

Submission history