Complementary Explanations for Effective In-Context Learning

Ye, Xi; Iyer, Srinivasan; Celikyilmaz, Asli; Stoyanov, Ves; Durrett, Greg; Pasunuru, Ramakanth

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2211

Change to browse by:

Computer Science > Computation and Language

Title: Complementary Explanations for Effective In-Context Learning

Authors: Xi Ye, Srinivasan Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, Ramakanth Pasunuru

(Submitted on 25 Nov 2022 (v1), last revised 12 Jun 2023 (this version, v2))

Abstract: Large language models (LLMs) have exhibited remarkable capabilities in learning from explanations in prompts, but there has been limited understanding of exactly how these explanations function or why they are effective. This work aims to better understand the mechanisms by which explanations are used for in-context learning. We first study the impact of two different factors on the performance of prompts with explanations: the computation trace (the way the solution is decomposed) and the natural language used to express the prompt. By perturbing explanations on three controlled tasks, we show that both factors contribute to the effectiveness of explanations. We further study how to form maximally effective sets of explanations for solving a given test query. We find that LLMs can benefit from the complementarity of the explanation set: diverse reasoning skills shown by different exemplars can lead to better performance. Therefore, we propose a maximal marginal relevance-based exemplar selection approach for constructing exemplar sets that are both relevant as well as complementary, which successfully improves the in-context learning performance across three real-world tasks on multiple LLMs.

Comments:	ACL Findings 2023 Camera-Ready
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2211.13892 [cs.CL]
	(or arXiv:2211.13892v2 [cs.CL] for this version)

Submission history

From: Xi Ye [view email]
[v1] Fri, 25 Nov 2022 04:40:47 GMT (290kb,D)
[v2] Mon, 12 Jun 2023 19:50:21 GMT (232kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.13892

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Complementary Explanations for Effective In-Context Learning

Submission history