AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

Wen, Jiaxin; Zhu, Yeshuang; Zhang, Jinchao; Zhou, Jie; Huang, Minlie

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2211

Computer Science > Artificial Intelligence

Title: AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

Authors: Jiaxin Wen, Yeshuang Zhu, Jinchao Zhang, Jie Zhou, Minlie Huang

(Submitted on 29 Nov 2022)

Abstract: Recent studies have shown the impressive efficacy of counterfactually augmented data (CAD) for reducing NLU models' reliance on spurious features and improving their generalizability. However, current methods still heavily rely on human efforts or task-specific designs to generate counterfactuals, thereby impeding CAD's applicability to a broad range of NLU tasks. In this paper, we present AutoCAD, a fully automatic and task-agnostic CAD generation framework. AutoCAD first leverages a classifier to unsupervisedly identify rationales as spans to be intervened, which disentangles spurious and causal features. Then, AutoCAD performs controllable generation enhanced by unlikelihood training to produce diverse counterfactuals. Extensive evaluations on multiple out-of-domain and challenge benchmarks demonstrate that AutoCAD consistently and significantly boosts the out-of-distribution performance of powerful pre-trained models across different NLU tasks, which is comparable or even better than previous state-of-the-art human-in-the-loop or task-specific CAD methods. The code is publicly available at this https URL

Comments:	Accepted by EMNLP 2022 findings
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2211.16202 [cs.AI]
	(or arXiv:2211.16202v1 [cs.AI] for this version)

Submission history

From: Jiaxin Wen [view email]
[v1] Tue, 29 Nov 2022 13:39:53 GMT (1835kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.16202

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

Submission history