We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages

Abstract: Transfer learning approaches for Neural Machine Translation (NMT) train a NMT model on the assisting-target language pair (parent model) which is later fine-tuned for the source-target language pair of interest (child model), with the target language being the same. In many cases, the assisting language has a different word order from the source language. We show that divergent word order adversely limits the benefits from transfer learning when little to no parallel corpus between the source and target language is available. To bridge this divergence, We propose to pre-order the assisting language sentence to match the word order of the source language and train the parent model. Our experiments on many language pairs show that bridging the word order gap leads to significant improvement in the translation quality.
Comments: Accepted as Short Paper at NAACL 2019
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1811.00383 [cs.CL]
  (or arXiv:1811.00383v2 [cs.CL] for this version)

Submission history

From: Rudra Murthy V [view email]
[v1] Thu, 1 Nov 2018 13:53:27 GMT (561kb,D)
[v2] Wed, 10 Apr 2019 05:15:55 GMT (70kb,D)

Link back to: arXiv, form interface, contact.