Prompt Consistency for Zero-Shot Task Generalization

Zhou, Chunting; He, Junxian; Ma, Xuezhe; Berg-Kirkpatrick, Taylor; Neubig, Graham

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2205

Computer Science > Computation and Language

Title: Prompt Consistency for Zero-Shot Task Generalization

Authors: Chunting Zhou, Junxian He, Xuezhe Ma, Taylor Berg-Kirkpatrick, Graham Neubig

(Submitted on 29 Apr 2022 (v1), last revised 27 Dec 2022 (this version, v2))

Abstract: One of the most impressive results of recent NLP history is the ability of pre-trained language models to solve new tasks in a zero-shot setting. To achieve this, NLP tasks are framed as natural language prompts, generating a response indicating the predicted output. Nonetheless, the performance in such settings often lags far behind its supervised counterpart, suggesting a large space for potential improvement. In this paper, we explore methods to utilize unlabeled data to improve zero-shot performance. Specifically, we take advantage of the fact that multiple prompts can be used to specify a single task, and propose to regularize prompt consistency, encouraging consistent predictions over this diverse set of prompts. Our method makes it possible to fine-tune the model either with extra unlabeled training data, or directly on test input at inference time in an unsupervised manner. In experiments, our approach outperforms the state-of-the-art zero-shot learner, T0 (Sanh et al., 2022), on 9 out of 11 datasets across 4 NLP tasks by up to 10.6 absolute points in terms of accuracy. The gains are often attained with a small number of unlabeled examples.

Comments:	EMNLP 2022 Findings. Code is available at this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2205.00049 [cs.CL]
	(or arXiv:2205.00049v2 [cs.CL] for this version)

Submission history

From: Junxian He [view email]
[v1] Fri, 29 Apr 2022 19:18:37 GMT (1076kb,D)
[v2] Tue, 27 Dec 2022 03:59:51 GMT (1078kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.00049

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Prompt Consistency for Zero-Shot Task Generalization

Submission history