We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: LightNER: A Lightweight Generative Framework with Prompt-guided Attention for Low-resource NER

Abstract: NER in low-resource languages or domains suffers from inadequate training data. Existing transfer learning approaches for low-resource NER usually have the challenge that the target domain has different label sets compared with a resource-rich source domain, which can be concluded as class transfer and domain transfer problems. In this paper, we propose a lightweight generative framework with prompt-guided attention for low-resource NER (LightNER) to address these issues. Concretely, instead of tackling the problem by training label-specific discriminative classifiers, we convert sequence labeling to generate the entity pointer index sequence and entity categories without any label-specific classifiers, which can address the class transfer issue. We further propose prompt-guided attention by incorporating continuous prompts into the self-attention layer to re-modulate the attention and adapt pre-trained weights. Note that we only tune those continuous prompts with the whole parameter of the pre-trained language model fixed, thus, making our approach lightweight and flexible for low-resource scenarios and can better transfer knowledge across domains. Experimental results show that by tuning only 0.16% of the parameters, LightNER can obtain comparable performance in the standard setting and outperform standard sequence labeling and prototype-based methods in low-resource settings.
Comments: Work in progress. arXiv admin note: text overlap with 2106.01760 by other authors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as: arXiv:2109.00720 [cs.CL]
  (or arXiv:2109.00720v1 [cs.CL] for this version)

Submission history

From: Ningyu Zhang [view email]
[v1] Tue, 31 Aug 2021 15:01:49 GMT (5204kb,D)
[v2] Thu, 9 Sep 2021 15:59:20 GMT (2064kb,D)
[v3] Tue, 23 Aug 2022 16:24:26 GMT (490kb,D)
[v4] Wed, 31 Aug 2022 15:23:21 GMT (490kb,D)
[v5] Wed, 14 Sep 2022 15:47:37 GMT (491kb,D)

Link back to: arXiv, form interface, contact.