Improved Beam Search for Hallucination Mitigation in Abstractive Summarization

Sridhar, Arvind Krishna; Visser, Erik

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2212

Computer Science > Computation and Language

Title: Improved Beam Search for Hallucination Mitigation in Abstractive Summarization

Authors: Arvind Krishna Sridhar, Erik Visser

(Submitted on 6 Dec 2022 (v1), last revised 14 Nov 2023 (this version, v2))

Abstract: Advancement in large pretrained language models has significantly improved their performance for conditional language generation tasks including summarization albeit with hallucinations. To reduce hallucinations, conventional methods proposed improving beam search or using a fact checker as a postprocessing step. In this paper, we investigate the use of the Natural Language Inference (NLI) entailment metric to detect and prevent hallucinations in summary generation. We propose an NLI-assisted beam re-ranking mechanism by computing entailment probability scores between the input context and summarization model-generated beams during saliency-enhanced greedy decoding. Moreover, a diversity metric is introduced to compare its effectiveness against vanilla beam search. Our proposed algorithm significantly outperforms vanilla beam decoding on XSum and CNN/DM datasets.

Comments:	8 pages, 2 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2212.02712 [cs.CL]
	(or arXiv:2212.02712v2 [cs.CL] for this version)

Submission history

From: Arvind Krishna Sridhar [view email]
[v1] Tue, 6 Dec 2022 02:33:47 GMT (163kb,D)
[v2] Tue, 14 Nov 2023 17:12:36 GMT (221kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2212.02712

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Improved Beam Search for Hallucination Mitigation in Abstractive Summarization

Submission history