We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders

Abstract: State-of-the-art multilingual machine translation relies on a universal encoder-decoder, which requires retraining the entire system to add new languages. In this paper, we propose an alternative approach that is based on language-specific encoder-decoders, and can thus be more easily extended to new languages by learning their corresponding modules. So as to encourage a common interlingua representation, we simultaneously train the N initial languages. Our experiments show that the proposed approach outperforms the universal encoder-decoder by 3.28 BLEU points on average, and when adding new languages, without the need to retrain the rest of the modules. All in all, our work closes the gap between shared and language-specific encoder-decoders, advancing toward modular multilingual machine translation systems that can be flexibly extended in lifelong learning settings.
Subjects: Computation and Language (cs.CL)
ACM classes: I.2.7
Cite as: arXiv:2004.06575 [cs.CL]
  (or arXiv:2004.06575v1 [cs.CL] for this version)

Submission history

From: Marta R. Costa-jussà [view email]
[v1] Tue, 14 Apr 2020 15:02:24 GMT (24kb)

Link back to: arXiv, form interface, contact.