MetaPrompting: Learning to Learn Better Prompts

Hou, Yutai; Dong, Hongyuan; Wang, Xinghao; Li, Bohan; Che, Wanxiang

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2209

Change to browse by:

Computer Science > Computation and Language

Title: MetaPrompting: Learning to Learn Better Prompts

Authors: Yutai Hou, Hongyuan Dong, Xinghao Wang, Bohan Li, Wanxiang Che

(Submitted on 23 Sep 2022 (v1), last revised 3 Feb 2023 (this version, v4))

Abstract: Prompting method is regarded as one of the crucial progress for few-shot nature language processing. Recent research on prompting moves from discrete tokens based ``hard prompts'' to continuous ``soft prompts'', which employ learnable vectors as pseudo prompt tokens and achieve better performance. Though showing promising prospects, these soft-prompting methods are observed to rely heavily on good initialization to take effect. Unfortunately, obtaining a perfect initialization for soft prompts requires understanding of inner language models working and elaborate design, which is no easy task and has to restart from scratch for each new task. To remedy this, we propose a generalized soft prompting method called MetaPrompting, which adopts the well-recognized model-agnostic meta-learning algorithm to automatically find better prompt initialization that facilitates fast adaptation to new prompting tasks.Extensive experiments show MetaPrompting tackles soft prompt initialization problem and brings significant improvement on four different datasets (over 6 points improvement in accuracy for 1-shot setting), achieving new state-of-the-art performance.

Comments:	Accepted as COLING 2022 long paper
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2209.11486 [cs.CL]
	(or arXiv:2209.11486v4 [cs.CL] for this version)

Submission history

From: Hongyuan Dong [view email]
[v1] Fri, 23 Sep 2022 09:01:05 GMT (1301kb,D)
[v2] Tue, 27 Sep 2022 02:30:05 GMT (1302kb,D)
[v3] Thu, 13 Oct 2022 06:36:48 GMT (1301kb,D)
[v4] Fri, 3 Feb 2023 12:47:29 GMT (1302kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.11486

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: MetaPrompting: Learning to Learn Better Prompts

Submission history