We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: The Importance of Context in Very Low Resource Language Modeling

Abstract: This paper investigates very low resource language model pretraining, when less than 100 thousand sentences are available. We find that, in very low resource scenarios, statistical n-gram language models outperform state-of-the-art neural models. Our experiments show that this is mainly due to the focus of the former on a local context. As such, we introduce three methods to improve a neural model's performance in the low-resource setting, finding that limiting the model's self-attention is the most effective one, improving on downstream tasks such as NLI and POS tagging by up to 5% for the languages we test on: English, Hindi, and Turkish.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2205.04810 [cs.CL]
  (or arXiv:2205.04810v1 [cs.CL] for this version)

Submission history

From: Lukas Edman [view email]
[v1] Tue, 10 May 2022 11:19:56 GMT (7106kb)

Link back to: arXiv, form interface, contact.