We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Revisiting text decomposition methods for NLI-based factuality scoring of summaries

Abstract: Scoring the factuality of a generated summary involves measuring the degree to which a target text contains factual information using the input document as support. Given the similarities in the problem formulation, previous work has shown that Natural Language Inference models can be effectively repurposed to perform this task. As these models are trained to score entailment at a sentence level, several recent studies have shown that decomposing either the input document or the summary into sentences helps with factuality scoring. But is fine-grained decomposition always a winning strategy? In this paper we systematically compare different granularities of decomposition -- from document to sub-sentence level, and we show that the answer is no. Our results show that incorporating additional context can yield improvement, but that this does not necessarily apply to all datasets. We also show that small changes to previously proposed entailment-based scoring methods can result in better performance, highlighting the need for caution in model and methodology selection for downstream tasks.
Comments: Generation, Evaluation & Metrics (GEM) Workshop 2022
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2211.16853 [cs.CL]
  (or arXiv:2211.16853v1 [cs.CL] for this version)

Submission history

From: John Glover [view email]
[v1] Wed, 30 Nov 2022 09:54:37 GMT (234kb,D)

Link back to: arXiv, form interface, contact.