We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

Abstract: We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. In particular, we observe that the proposed model significantly improves the translation quality of low-resource language pairs.
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as: arXiv:1601.01073 [cs.CL]
  (or arXiv:1601.01073v1 [cs.CL] for this version)

Submission history

From: Orhan Firat [view email]
[v1] Wed, 6 Jan 2016 04:00:50 GMT (77kb,D)

Link back to: arXiv, form interface, contact.