ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Ye, Jiacheng; Gao, Jiahui; Li, Qintong; Xu, Hang; Feng, Jiangtao; Wu, Zhiyong; Yu, Tao; Kong, Lingpeng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2202

Computer Science > Computation and Language

Title: ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Authors: Jiacheng Ye, Jiahui Gao, Qintong Li, Hang Xu, Jiangtao Feng, Zhiyong Wu, Tao Yu, Lingpeng Kong

(Submitted on 16 Feb 2022 (v1), last revised 22 Oct 2022 (this version, v2))

Abstract: There is a growing interest in dataset generation recently due to the superior generative capacity of large pre-trained language models (PLMs). In this paper, we study a flexible and efficient zero-short learning method, \textsc{ZeroGen}. Given a zero-shot task, we first generate a dataset from scratch using PLMs in an unsupervised manner. Then, we train a tiny task model (e.g., LSTM) under the supervision of the synthesized dataset. This approach allows highly efficient inference as the final task model only has orders of magnitude fewer parameters comparing to PLMs (e.g., GPT2-XL). Apart from being annotation-free and efficient, we argue that \textsc{ZeroGen} can also provide useful insights from the perspective of data-free model-agnostic knowledge distillation, and unreferenced text generation evaluation. Experiments and analysis on different NLP tasks, namely, text classification, question answering, and natural language inference, show the effectiveness of \textsc{ZeroGen}.

Comments:	Accepted by EMNLP 2022 (Main Conference)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.07922 [cs.CL]
	(or arXiv:2202.07922v2 [cs.CL] for this version)

Submission history

From: Jiacheng Ye [view email]
[v1] Wed, 16 Feb 2022 08:18:02 GMT (1142kb,D)
[v2] Sat, 22 Oct 2022 01:32:03 GMT (1229kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2202.07922

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Submission history