Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration

Nie, Feng; Chen, Meixi; Zhang, Zhirui; Cheng, Xu

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2212

Change to browse by:

Computer Science > Computation and Language

Title: Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration

Authors: Feng Nie, Meixi Chen, Zhirui Zhang, Xu Cheng

(Submitted on 5 Dec 2022)

Abstract: Pre-trained language models (PLMs) have exhibited remarkable few-shot learning capabilities when provided a few examples in a natural language prompt as demonstrations of test instances, i.e., in-context learning. However, the performance of in-context learning is susceptible to the choice of prompt format, training examples and the ordering of the training examples. In this paper, we propose a novel nearest-neighbor calibration framework for in-context learning to ease this issue. It is inspired by a phenomenon that the in-context learning paradigm produces incorrect labels when inferring training instances, which provides a useful supervised signal to calibrate predictions. Thus, our method directly augments the predictions with a $k$-nearest-neighbor ($k$NN) classifier over a datastore of cached few-shot instance representations obtained by PLMs and their corresponding labels. Then adaptive neighbor selection and feature regularization modules are introduced to make full use of a few support instances to reduce the $k$NN retrieval noise. Experiments on various few-shot text classification tasks demonstrate that our method significantly improves in-context learning, while even achieving comparable performance with state-of-the-art tuning-based approaches in some sentiment analysis tasks.

Comments:	Work in progress
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.02216 [cs.CL]
	(or arXiv:2212.02216v1 [cs.CL] for this version)

Submission history

From: Zhirui Zhang [view email]
[v1] Mon, 5 Dec 2022 12:49:41 GMT (7797kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2212.02216

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration

Submission history