On the Relation between Sensitivity and Accuracy in In-context Learning

Chen, Yanda; Zhao, Chen; Yu, Zhou; McKeown, Kathleen; He, He

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2209

Computer Science > Computation and Language

Title: On the Relation between Sensitivity and Accuracy in In-context Learning

Authors: Yanda Chen, Chen Zhao, Zhou Yu, Kathleen McKeown, He He

(Submitted on 16 Sep 2022 (v1), last revised 27 Jan 2024 (this version, v3))

Abstract: In-context learning (ICL) suffers from oversensitivity to the prompt, making it unreliable in real-world scenarios. We study the sensitivity of ICL with respect to multiple perturbation types. First, we find that label bias obscures the true sensitivity, and therefore prior work may have significantly underestimated ICL sensitivity. Second, we observe a strong negative correlation between ICL sensitivity and accuracy: predictions sensitive to perturbations are less likely to be correct. Motivated by these findings, we propose \textsc{SenSel}, a few-shot selective prediction method that abstains from sensitive predictions. Experiments on ten classification datasets show that \textsc{SenSel} consistently outperforms two commonly used confidence-based and entropy-based baselines on abstention decisions.

Comments:	EMNLP 2023 camera-ready
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2209.07661 [cs.CL]
	(or arXiv:2209.07661v3 [cs.CL] for this version)

Submission history

From: Yanda Chen [view email]
[v1] Fri, 16 Sep 2022 00:52:34 GMT (6791kb,D)
[v2] Fri, 17 Feb 2023 23:45:47 GMT (335kb,D)
[v3] Sat, 27 Jan 2024 08:07:34 GMT (1568kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.07661

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: On the Relation between Sensitivity and Accuracy in In-context Learning

Submission history