Joint Generator-Ranker Learning for Natural Language Generation

Shen, Weizhou; Gong, Yeyun; Shen, Yelong; Wang, Song; Quan, Xiaojun; Duan, Nan; Chen, Weizhu

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computation and Language

Title: Joint Generator-Ranker Learning for Natural Language Generation

Authors: Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen

(Submitted on 28 Jun 2022 (this version), latest version 28 May 2023 (v3))

Abstract: Due to exposure bias, most existing natural language generation (NLG) models trained by maximizing the likelihood objective predict poor text results during the inference stage. In this paper, to tackle this problem, we revisit the generate-then-rank framework and propose a joint generator-ranker (JGR) training algorithm for text generation tasks. In JGR, the generator model is trained by maximizing two objectives: the likelihood of the training corpus and the expected reward given by the ranker model. Meanwhile, the ranker model takes input samples from the generator model and learns to distinguish good samples from the generation pool. The generator and ranker models are alternately optimized till convergence. In the empirical study, the proposed JGR model achieves new state-of-the-art performance on five public benchmarks covering three popular generation tasks: summarization, question generation, and response generation. We will make code, data, and models available at this https URL

Comments:	In progress
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.13974 [cs.CL]
	(or arXiv:2206.13974v1 [cs.CL] for this version)

Submission history

From: Weizhou Shen [view email]
[v1] Tue, 28 Jun 2022 12:58:30 GMT (524kb,D)
[v2] Wed, 19 Oct 2022 05:39:11 GMT (533kb,D)
[v3] Sun, 28 May 2023 13:51:09 GMT (8336kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.13974v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Joint Generator-Ranker Learning for Natural Language Generation

Submission history