We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models

Abstract: Transferring knowledge from one domain to another is of practical importance for many tasks in natural language processing, especially when the amount of available data in the target domain is limited. In this work, we propose a novel few-shot approach to domain adaptation in the context of Named Entity Recognition (NER). We propose a two-step approach consisting of a variable base module and a template module that leverages the knowledge captured in pre-trained language models with the help of simple descriptive patterns. Our approach is simple yet versatile and can be applied in few-shot and zero-shot settings. Evaluating our lightweight approach across a number of different datasets shows that it can boost the performance of state-of-the-art baselines by 2-5% F1-score.
Comments: Accepted to 25th International Conference on Text, Speech and Dialogue (TSD 2022)
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2206.07841 [cs.CL]
  (or arXiv:2206.07841v1 [cs.CL] for this version)

Submission history

From: David Adelani [view email]
[v1] Wed, 15 Jun 2022 22:49:14 GMT (374kb,D)

Link back to: arXiv, form interface, contact.