We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: MReD: A Meta-Review Dataset for Controllable Text Generation

Abstract: When directly using existing text generation datasets for controllable generation, we are facing the problem of not having the domain knowledge and thus the aspects that could be controlled are limited.A typical example is when using CNN/Daily Mail dataset for controllable text summarization, there is no guided information on the emphasis of summary sentences. A more useful text generator should leverage both the input text and control variables to guide the generation, which can only be built with deep understanding of the domain knowledge. Motivated by this vi-sion, our paper introduces a new text generation dataset, named MReD. Our new dataset consists of 7,089 meta-reviews and all its 45k meta-review sentences are manually annotated as one of the carefully defined 9 categories, including abstract, strength, decision, etc. We present experimental results on start-of-the-art summarization models, and propose methods for controlled generation on both extractive and abstractive models using our annotated data. By exploring various settings and analaysing the model behavior with respect to the control inputs, we demonstrate the challenges and values of our dataset. MReD allows us to have a better understanding of the meta-review corpora and enlarge the research room for controllable text generation.
Comments: 15 pages, 8 figures
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2110.07474 [cs.CL]
  (or arXiv:2110.07474v1 [cs.CL] for this version)

Submission history

From: Liying Cheng [view email]
[v1] Thu, 14 Oct 2021 15:48:03 GMT (2223kb,D)
[v2] Thu, 24 Mar 2022 11:36:37 GMT (4569kb,D)
[v3] Mon, 28 Mar 2022 07:02:45 GMT (4568kb,D)
[v4] Mon, 4 Apr 2022 09:47:08 GMT (4568kb,D)
[v5] Mon, 11 Apr 2022 04:07:34 GMT (4568kb,D)
[v6] Tue, 5 Jul 2022 07:43:44 GMT (3137kb,D)

Link back to: arXiv, form interface, contact.