Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning

Xu, Shicheng; Pang, Liang; Shen, Huawei; Cheng, Xueqi

doi:10.1145/3511808.3557388

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 2204

Computer Science > Information Retrieval

Title: Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning

Authors: Shicheng Xu, Liang Pang, Huawei Shen, Xueqi Cheng

(Submitted on 6 Apr 2022 (v1), last revised 20 Aug 2022 (this version, v2))

Abstract: Text matching is a fundamental technique in both information retrieval and natural language processing. Text matching tasks share the same paradigm that determines the relationship between two given texts. The relationships vary from task to task, e.g.~relevance in document retrieval, semantic alignment in paraphrase identification and answerable judgment in question answering. However, the essential signals for text matching remain in a finite scope, i.e.~exact matching, semantic matching, and inference matching. Ideally, a good text matching model can learn to capture and aggregate these signals for different matching tasks to achieve competitive performance, while recent state-of-the-art text matching models, e.g.~Pre-trained Language Models (PLMs), are hard to generalize. It is because the end-to-end supervised learning on task-specific dataset makes model overemphasize the data sample bias and task-specific signals instead of the essential matching signals. To overcome this problem, we adopt a specialization-generalization training strategy and refer to it as Match-Prompt. In specialization stage, descriptions of different matching tasks are mapped to a few prompt tokens. In generalization stage, matching model explores the essential matching signals by being trained on diverse matching tasks. High diverse matching tasks avoid model fitting the data bias on a specific task, so that model can focus on learning the essential matching signals. Meanwhile, the prompt tokens obtained in the first step help the model distinguish different task-specific matching signals. Experimental results on public datasets show that Match-Prompt can improve multi-task generalization capability of PLMs in text matching and yield better in-domain multi-task, out-of-domain multi-task and new task adaptation performance than multi-task and task-specific models trained by previous fine-tuning paradigm.

Comments:	Accepted by CIKM 2022
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
DOI:	10.1145/3511808.3557388
Cite as:	arXiv:2204.02725 [cs.IR]
	(or arXiv:2204.02725v2 [cs.IR] for this version)

Submission history

From: Shicheng Xu [view email]
[v1] Wed, 6 Apr 2022 11:01:08 GMT (3452kb,D)
[v2] Sat, 20 Aug 2022 02:08:02 GMT (2345kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2204.02725

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning

Submission history