We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization

Abstract: Narrative summarization aims to produce a distilled version of a narrative to describe its most salient events and characters. Summarizing a narrative is challenging as it requires an understanding of event causality and character behaviors. To encourage research in this direction, we propose NarraSum, a large-scale narrative summarization dataset. It contains 122K narrative documents, which are collected from plot descriptions of movies and TV episodes with diverse genres, and their corresponding abstractive summaries. Experiments show that there is a large performance gap between humans and the state-of-the-art summarization models on NarraSum. We hope that this dataset will promote future research in summarization, as well as broader studies of natural language understanding and generation. The dataset is available at this https URL
Comments: EMNLP Findings 2022
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2212.01476 [cs.CL]
  (or arXiv:2212.01476v2 [cs.CL] for this version)

Submission history

From: Chao Zhao [view email]
[v1] Fri, 2 Dec 2022 22:51:51 GMT (469kb,D)
[v2] Wed, 28 Jun 2023 04:08:20 GMT (469kb,D)

Link back to: arXiv, form interface, contact.