We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Multi-Adversarial Learning for Cross-Lingual Word Embeddings

Abstract: Generative adversarial networks (GANs) have succeeded in inducing cross-lingual word embeddings -- maps of matching words across languages -- without supervision. Despite these successes, GANs' performance for the difficult case of distant languages is still not satisfactory. These limitations have been explained by GANs' incorrect assumption that source and target embedding spaces are related by a single linear mapping and are approximately isomorphic. We assume instead that, especially across distant languages, the mapping is only piece-wise linear, and propose a multi-adversarial learning method. This novel method induces the seed cross-lingual dictionary through multiple mappings, each induced to fit the mapping for one subspace. Our experiments on unsupervised bilingual lexicon induction show that this method improves performance over previous single-mapping methods, especially for distant languages.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2010.08432 [cs.CL]
  (or arXiv:2010.08432v2 [cs.CL] for this version)

Submission history

From: Haozhou Wang [view email]
[v1] Fri, 16 Oct 2020 14:54:28 GMT (239kb,D)
[v2] Wed, 25 Aug 2021 22:11:48 GMT (869kb,D)

Link back to: arXiv, form interface, contact.