Generating Diverse Translation by Manipulating Multi-Head Attention

Sun, Zewei; Huang, Shujian; Wei, Hao-Ran; Dai, Xin-yu; Chen, Jiajun

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1911

Change to browse by:

Computer Science > Computation and Language

Title: Generating Diverse Translation by Manipulating Multi-Head Attention

Authors: Zewei Sun, Shujian Huang, Hao-Ran Wei, Xin-yu Dai, Jiajun Chen

(Submitted on 21 Nov 2019)

Abstract: Transformer model has been widely used on machine translation tasks and obtained state-of-the-art results. In this paper, we report an interesting phenomenon in its encoder-decoder multi-head attention: different attention heads of the final decoder layer align to different word translation candidates. We empirically verify this discovery and propose a method to generate diverse translations by manipulating heads. Furthermore, we make use of these diverse translations with the back-translation technique for better data augmentation. Experiment results show that our method generates diverse translations without severe drop in translation quality. Experiments also show that back-translation with these diverse translations could bring significant improvement on performance on translation tasks. An auxiliary experiment of conversation response generation task proves the effect of diversity as well.

Comments:	Accepted by AAAI 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1911.09333 [cs.CL]
	(or arXiv:1911.09333v1 [cs.CL] for this version)

Submission history

From: Zewei Sun [view email]
[v1] Thu, 21 Nov 2019 08:22:07 GMT (2066kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.09333

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Generating Diverse Translation by Manipulating Multi-Head Attention

Submission history