Making a (Counterfactual) Difference One Rationale at a Time

Plyler, Mitchell; Green, Michael; Chi, Min

Full-text links:

Download:

PDF only

Current browse context:

cs.CL

< prev | next >

new | recent | 2201

Computer Science > Computation and Language

Title: Making a (Counterfactual) Difference One Rationale at a Time

Authors: Mitchell Plyler, Michael Green, Min Chi

(Submitted on 13 Jan 2022)

Abstract: Rationales, snippets of extracted text that explain an inference, have emerged as a popular framework for interpretable natural language processing (NLP). Rationale models typically consist of two cooperating modules: a selector and a classifier with the goal of maximizing the mutual information (MMI) between the "selected" text and the document label. Despite their promises, MMI-based methods often pick up on spurious text patterns and result in models with nonsensical behaviors. In this work, we investigate whether counterfactual data augmentation (CDA), without human assistance, can improve the performance of the selector by lowering the mutual information between spurious signals and the document label. Our counterfactuals are produced in an unsupervised fashion using class-dependent generative models. From an information theoretic lens, we derive properties of the unaugmented dataset for which our CDA approach would succeed. The effectiveness of CDA is empirically evaluated by comparing against several baselines including an improved MMI-based rationale schema on two multi aspect datasets. Our results show that CDA produces rationales that better capture the signal of interest.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Journal reference:	Advances in Neural Information Processing Systems 2021
Cite as:	arXiv:2201.05177 [cs.CL]
	(or arXiv:2201.05177v1 [cs.CL] for this version)

Submission history

From: Mitchell Plyler [view email]
[v1] Thu, 13 Jan 2022 19:05:02 GMT (1419kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.05177

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Making a (Counterfactual) Difference One Rationale at a Time

Submission history