KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning

He, Bin; Jiang, Xin; Xiao, Jinghui; Liu, Qun

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Computer Science > Computation and Language

Title: KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning

Authors: Bin He, Xin Jiang, Jinghui Xiao, Qun Liu

(Submitted on 7 Dec 2020)

Abstract: Recent studies on pre-trained language models have demonstrated their ability to capture factual knowledge and applications in knowledge-aware downstream tasks. In this work, we present a language model pre-training framework guided by factual knowledge completion and verification, and use the generative and discriminative approaches cooperatively to learn the model. Particularly, we investigate two learning schemes, named two-tower scheme and pipeline scheme, in training the generator and discriminator with shared parameter. Experimental results on LAMA, a set of zero-shot cloze-style question answering tasks, show that our model contains richer factual knowledge than the conventional pre-trained language models. Furthermore, when fine-tuned and evaluated on the MRQA shared tasks which consists of several machine reading comprehension datasets, our model achieves the state-of-the-art performance, and gains large improvements on NewsQA (+1.26 F1) and TriviaQA (+1.56 F1) over RoBERTa.

Comments:	10 pages, 3 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.03551 [cs.CL]
	(or arXiv:2012.03551v1 [cs.CL] for this version)

Submission history

From: Bin He [view email]
[v1] Mon, 7 Dec 2020 09:39:25 GMT (1135kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.03551

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning

Submission history