KPT: Keyword-guided Pre-training for Grounded Dialog Generation

Zhu, Qi; Mi, Fei; Zhang, Zheng; Wang, Yasheng; Li, Yitong; Jiang, Xin; Liu, Qun; Zhu, Xiaoyan; Huang, Minlie

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2212

Change to browse by:

Computer Science > Computation and Language

Title: KPT: Keyword-guided Pre-training for Grounded Dialog Generation

Authors: Qi Zhu, Fei Mi, Zheng Zhang, Yasheng Wang, Yitong Li, Xin Jiang, Qun Liu, Xiaoyan Zhu, Minlie Huang

(Submitted on 4 Dec 2022)

Abstract: Incorporating external knowledge into the response generation process is essential to building more helpful and reliable dialog agents. However, collecting knowledge-grounded conversations is often costly, calling for a better pre-trained model for grounded dialog generation that generalizes well w.r.t. different types of knowledge. In this work, we propose KPT (Keyword-guided Pre-Training), a novel self-supervised pre-training method for grounded dialog generation without relying on extra knowledge annotation. Specifically, we use a pre-trained language model to extract the most uncertain tokens in the dialog as keywords. With these keywords, we construct two kinds of knowledge and pre-train a knowledge-grounded response generation model, aiming at handling two different scenarios: (1) the knowledge should be faithfully grounded; (2) it can be selectively used. For the former, the grounding knowledge consists of keywords extracted from the response. For the latter, the grounding knowledge is additionally augmented with keywords extracted from other utterances in the same dialog. Since the knowledge is extracted from the dialog itself, KPT can be easily performed on a large volume and variety of dialogue data. We considered three data sources (open-domain, task-oriented, conversational QA) with a total of 2.5M dialogues. We conduct extensive experiments on various few-shot knowledge-grounded generation tasks, including grounding on dialog acts, knowledge graphs, persona descriptions, and Wikipedia passages. Our comprehensive experiments and analyses demonstrate that KPT consistently outperforms state-of-the-art methods on these tasks with diverse grounding knowledge.

Comments:	Accepted by AAAI 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.01739 [cs.CL]
	(or arXiv:2212.01739v1 [cs.CL] for this version)

Submission history

From: Qi Zhu [view email]
[v1] Sun, 4 Dec 2022 04:05:01 GMT (21510kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2212.01739

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: KPT: Keyword-guided Pre-training for Grounded Dialog Generation

Submission history