Non-Autoregressive Machine Translation with Disentangled Context Transformer

Kasai, Jungo; Cross, James; Ghazvininejad, Marjan; Gu, Jiatao

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2001

Change to browse by:

Computer Science > Computation and Language

Title: Non-Autoregressive Machine Translation with Disentangled Context Transformer

Authors: Jungo Kasai, James Cross, Marjan Ghazvininejad, Jiatao Gu

(Submitted on 15 Jan 2020 (v1), last revised 30 Jun 2020 (this version, v2))

Abstract: State-of-the-art neural machine translation models generate a translation from left to right and every step is conditioned on the previously generated tokens. The sequential nature of this generation process causes fundamental latency in inference since we cannot generate multiple tokens in each sentence in parallel. We propose an attention-masking based model, called Disentangled Context (DisCo) transformer, that simultaneously generates all tokens given different contexts. The DisCo transformer is trained to predict every output token given an arbitrary subset of the other reference tokens. We also develop the parallel easy-first inference algorithm, which iteratively refines every token in parallel and reduces the number of required iterations. Our extensive experiments on 7 translation directions with varying data sizes demonstrate that our model achieves competitive, if not better, performance compared to the state of the art in non-autoregressive machine translation while significantly reducing decoding time on average. Our code is available at this https URL

Comments:	ICML 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2001.05136 [cs.CL]
	(or arXiv:2001.05136v2 [cs.CL] for this version)

Submission history

From: Jungo Kasai [view email]
[v1] Wed, 15 Jan 2020 05:32:18 GMT (483kb,D)
[v2] Tue, 30 Jun 2020 07:31:11 GMT (430kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.05136

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Non-Autoregressive Machine Translation with Disentangled Context Transformer

Submission history