Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Zhao, Zheng; Monti, Emilio; Lehmann, Jens; Assem, Haytham

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2405

Computer Science > Computation and Language

Title: Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Authors: Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

(Submitted on 4 May 2024)

Abstract: Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or contextually unfaithful content. LLMs utilize two primary knowledge sources: 1) prior (parametric) knowledge from pretraining, and 2) contextual (non-parametric) knowledge from input prompts. The study addresses the open question of how LLMs effectively balance these knowledge sources during the generation process, specifically in the context of open-domain question answering. To address this issue, we introduce a novel approach integrating contrastive decoding with adversarial irrelevant passages as negative samples to enhance robust context grounding during generation. Notably, our method operates at inference time without requiring further training. We conduct comprehensive experiments to demonstrate its applicability and effectiveness, providing empirical evidence showcasing its superiority over existing methodologies. Our code is publicly available at: this https URL

Comments:	Accepted to NAACL 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.02750 [cs.CL]
	(or arXiv:2405.02750v1 [cs.CL] for this version)

Submission history

From: Zheng Zhao [view email]
[v1] Sat, 4 May 2024 20:38:41 GMT (8359kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.02750

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Computation and Language

Title: Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Submission history