TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models

Davody, Ali; Adelani, David Ifeoluwa; Kleinbauer, Thomas; Klakow, Dietrich

Full-text links:

Download:

Computer Science > Computation and Language

Title: TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models

Authors: Ali Davody, David Ifeoluwa Adelani, Thomas Kleinbauer, Dietrich Klakow

(Submitted on 15 Jun 2022)

Abstract: Transferring knowledge from one domain to another is of practical importance for many tasks in natural language processing, especially when the amount of available data in the target domain is limited. In this work, we propose a novel few-shot approach to domain adaptation in the context of Named Entity Recognition (NER). We propose a two-step approach consisting of a variable base module and a template module that leverages the knowledge captured in pre-trained language models with the help of simple descriptive patterns. Our approach is simple yet versatile and can be applied in few-shot and zero-shot settings. Evaluating our lightweight approach across a number of different datasets shows that it can boost the performance of state-of-the-art baselines by 2-5% F1-score.

Comments:	Accepted to 25th International Conference on Text, Speech and Dialogue (TSD 2022)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.07841 [cs.CL]
	(or arXiv:2206.07841v1 [cs.CL] for this version)

Submission history

From: David Adelani [view email]
[v1] Wed, 15 Jun 2022 22:49:14 GMT (374kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.07841

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models

Submission history