Meta-augmented Prompt Tuning for Better Few-shot Learning

Pan, Kaihang; Li, Juncheng; Song, Hongye; Lin, Jun; Liu, Xiaozhong; Tang, Siliang

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2303

Computer Science > Computation and Language

Title: Meta-augmented Prompt Tuning for Better Few-shot Learning

Authors: Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang

(Submitted on 22 Mar 2023 (v1), revised 28 Mar 2023 (this version, v2), latest version 23 Oct 2023 (v4))

Abstract: Prompt tuning is a parameter-efficient method, which freezes all PLM parameters and only prepends some additional tunable tokens called soft prompts to the input text. However, soft prompts heavily rely on a better initialization and may easily result in overfitting under few-shot settings, which causes prompt-tuning performing much worse than fine-tuning. To address the above issues, this paper proposes a novel Self-sUpervised Meta-prompt learning framework with MEtagradient Regularization for few shot generalization (SUMMER). We leverage self-supervised meta-learning to better initialize soft prompts and curriculum-based task augmentation is further proposed to enrich the meta-task distribution. Besides, a novel meta-gradient regularization method is integrated into the meta-prompt learning framework, which meta-learns to transform the raw gradient during few-shot learning into a domain-generalizable direction, thus alleviating the problem of overfitting. Extensive experiments show that SUMMER achieves better performance for different few-shot downstream tasks, and also exhibits a stronger domain generalization ability.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2303.12314 [cs.CL]
	(or arXiv:2303.12314v2 [cs.CL] for this version)

Submission history

From: Kaihang Pan [view email]
[v1] Wed, 22 Mar 2023 05:04:21 GMT (7022kb,D)
[v2] Tue, 28 Mar 2023 13:56:07 GMT (7021kb,D)
[v3] Sun, 21 May 2023 07:18:54 GMT (7021kb,D)
[v4] Mon, 23 Oct 2023 12:43:35 GMT (268kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2303.12314v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Meta-augmented Prompt Tuning for Better Few-shot Learning

Submission history