We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Facts2Story: Controlling Text Generation by Key Facts

Authors: Eyal Orbach (Bar Ilan University), Yoav Goldberg (Bar Ilan University and Allen Institute for Artificial Intelligence)
Abstract: Recent advancements in self-attention neural network architectures have raised the bar for open-ended text generation. Yet, while current methods are capable of producing a coherent text which is several hundred words long, attaining control over the content that is being generated -- as well as evaluating it -- are still open questions. We propose a controlled generation task which is based on expanding a sequence of facts, expressed in natural language, into a longer narrative. We introduce human-based evaluation metrics for this task, as well as a method for deriving a large training dataset. We evaluate three methods on this task, based on fine-tuning pre-trained models. We show that while auto-regressive, unidirectional Language Models such as GPT2 produce better fluency, they struggle to adhere to the requested facts. We propose a plan-and-cloze model (using fine-tuned XLNet) which produces competitive fluency while adhering to the requested content.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2012.04332 [cs.CL]
  (or arXiv:2012.04332v1 [cs.CL] for this version)

Submission history

From: Eyal Orbach [view email]
[v1] Tue, 8 Dec 2020 10:14:29 GMT (35kb)

Link back to: arXiv, form interface, contact.