We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Comparison of Architectures and Pretraining Methods for Contextualized Multilingual Word Embeddings

Abstract: The lack of annotated data in many languages is a well-known challenge within the field of multilingual natural language processing (NLP). Therefore, many recent studies focus on zero-shot transfer learning and joint training across languages to overcome data scarcity for low-resource languages. In this work we (i) perform a comprehensive comparison of state-ofthe-art multilingual word and sentence encoders on the tasks of named entity recognition (NER) and part of speech (POS) tagging; and (ii) propose a new method for creating multilingual contextualized word embeddings, compare it to multiple baselines and show that it performs at or above state-of-theart level in zero-shot transfer settings. Finally, we show that our method allows for better knowledge sharing across languages in a joint training setting.
Comments: 7 pages, 6 figures
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1912.10169 [cs.CL]
  (or arXiv:1912.10169v1 [cs.CL] for this version)

Submission history

From: Niels Van Der Heijden [view email]
[v1] Sun, 15 Dec 2019 11:42:32 GMT (154kb,D)

Link back to: arXiv, form interface, contact.