We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Measuring Attribution in Natural Language Generation Models

Abstract: With recent improvements in natural language generation (NLG) models for various applications, it has become imperative to have the means to identify and evaluate whether NLG output is only sharing verifiable information about the external world. In this work, we present a new evaluation framework entitled Attributable to Identified Sources (AIS) for assessing the output of natural language generation models, when such output pertains to the external world. We first define AIS and introduce a two-stage annotation pipeline for allowing annotators to appropriately evaluate model output according to AIS guidelines. We empirically validate this approach on generation datasets spanning three tasks (two conversational QA datasets, a summarization dataset, and a table-to-text dataset) via human evaluation studies that suggest that AIS could serve as a common framework for measuring whether model-generated statements are supported by underlying sources. We release guidelines for the human evaluation studies.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2112.12870 [cs.CL]
  (or arXiv:2112.12870v2 [cs.CL] for this version)

Submission history

From: Hannah Rashkin [view email]
[v1] Thu, 23 Dec 2021 22:33:20 GMT (762kb,D)
[v2] Tue, 2 Aug 2022 20:40:20 GMT (6358kb,D)

Link back to: arXiv, form interface, contact.