We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Survey Of Cross-lingual Word Embedding Models

Abstract: Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In this survey, we provide a comprehensive typology of cross-lingual word embedding models. We compare their data requirements and objective functions. The recurring theme of the survey is that many of the models presented in the literature optimize for the same objectives, and that seemingly different models are often equivalent modulo optimization strategies, hyper-parameters, and such. We also discuss the different ways cross-lingual word embeddings are evaluated, as well as future challenges and research horizons.
Comments: Published in Journal of Artificial Intelligence Research
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Journal reference: JAIR 65 (2019) 569-631
DOI: 10.1613/jair.1.11640
Cite as: arXiv:1706.04902 [cs.CL]
  (or arXiv:1706.04902v4 [cs.CL] for this version)

Submission history

From: Sebastian Ruder [view email]
[v1] Thu, 15 Jun 2017 14:46:56 GMT (1827kb,D)
[v2] Wed, 18 Oct 2017 10:44:06 GMT (2430kb,D)
[v3] Thu, 30 May 2019 08:59:16 GMT (2623kb,D)
[v4] Sun, 6 Oct 2019 10:01:48 GMT (2666kb,D)

Link back to: arXiv, form interface, contact.