We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

Abstract: The diverse demands of different summarization tasks and their high annotation costs are driving a need for few-shot summarization. However, despite the emergence of many summarization tasks and datasets, the current training paradigm for few-shot summarization systems ignores potentially shareable knowledge in heterogeneous datasets. To this end, we propose \textsc{UniSumm}, a unified few-shot summarization model pre-trained with multiple summarization tasks and can be prefix-tuned to excel at any few-shot summarization datasets. Meanwhile, to better evaluate few-shot summarization systems, under the principles of diversity and robustness, we assemble and publicize a new benchmark \textsc{SummZoo}. It consists of $8$ diverse summarization tasks with multiple sets of few-shot samples for each task, covering both monologue and dialogue domains. Experimental results and ablation studies show that \textsc{UniSumm} outperforms strong baseline systems by a large margin across all tasks in \textsc{SummZoo} under both automatic and human evaluations. We release our code and benchmark at \url{this https URL}.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2211.09783 [cs.CL]
  (or arXiv:2211.09783v5 [cs.CL] for this version)

Submission history

From: Yulong Chen [view email]
[v1] Thu, 17 Nov 2022 18:54:47 GMT (1993kb,D)
[v2] Mon, 21 Nov 2022 15:16:40 GMT (2176kb,D)
[v3] Tue, 6 Dec 2022 08:54:22 GMT (2177kb,D)
[v4] Tue, 13 Dec 2022 14:57:14 GMT (2177kb,D)
[v5] Mon, 19 Dec 2022 05:15:58 GMT (2177kb,D)

Link back to: arXiv, form interface, contact.