A Generative Approach for Script Event Prediction via Contrastive Fine-tuning

Zhu, Fangqi; Gao, Jun; Yu, Changlong; Wang, Wei; Xu, Chen; Mu, Xin; Yang, Min; Xu, Ruifeng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2212

Change to browse by:

Computer Science > Computation and Language

Title: A Generative Approach for Script Event Prediction via Contrastive Fine-tuning

Authors: Fangqi Zhu, Jun Gao, Changlong Yu, Wei Wang, Chen Xu, Xin Mu, Min Yang, Ruifeng Xu

(Submitted on 7 Dec 2022 (v1), last revised 9 Dec 2022 (this version, v3))

Abstract: Script event prediction aims to predict the subsequent event given the context. This requires the capability to infer the correlations between events. Recent works have attempted to improve event correlation reasoning by using pretrained language models and incorporating external knowledge~(e.g., discourse relations). Though promising results have been achieved, some challenges still remain. First, the pretrained language models adopted by current works ignore event-level knowledge, resulting in an inability to capture the correlations between events well. Second, modeling correlations between events with discourse relations is limited because it can only capture explicit correlations between events with discourse markers, and cannot capture many implicit correlations. To this end, we propose a novel generative approach for this task, in which a pretrained language model is fine-tuned with an event-centric pretraining objective and predicts the next event within a generative paradigm. Specifically, we first introduce a novel event-level blank infilling strategy as the learning objective to inject event-level knowledge into the pretrained language model, and then design a likelihood-based contrastive loss for fine-tuning the generative model. Instead of using an additional prediction layer, we perform prediction by using sequence likelihoods generated by the generative model. Our approach models correlations between events in a soft way without any external knowledge. The likelihood-based prediction eliminates the need to use additional networks to make predictions and is somewhat interpretable since it scores each word in the event. Experimental results on the multi-choice narrative cloze~(MCNC) task demonstrate that our approach achieves better results than other state-of-the-art baselines. Our code will be available at this https URL

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.03496 [cs.CL]
	(or arXiv:2212.03496v3 [cs.CL] for this version)

Submission history

From: Fangqi Zhu [view email]
[v1] Wed, 7 Dec 2022 07:32:47 GMT (542kb,D)
[v2] Thu, 8 Dec 2022 02:42:03 GMT (542kb,D)
[v3] Fri, 9 Dec 2022 06:34:26 GMT (542kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2212.03496

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: A Generative Approach for Script Event Prediction via Contrastive Fine-tuning

Submission history