Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

Ferrando, Javier; Gállego, Gerard I.; Alastruey, Belen; Escolano, Carlos; Costa-jussà, Marta R.

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Computation and Language

Title: Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

Authors: Javier Ferrando, Gerard I. Gállego, Belen Alastruey, Carlos Escolano, Marta R. Costa-jussà

(Submitted on 23 May 2022 (v1), last revised 4 Nov 2022 (this version, v2))

Abstract: In Neural Machine Translation (NMT), each token prediction is conditioned on the source sentence and the target prefix (what has been previously translated at a decoding step). However, previous work on interpretability in NMT has mainly focused solely on source sentence tokens' attributions. Therefore, we lack a full understanding of the influences of every input token (source sentence and target prefix) in the model predictions. In this work, we propose an interpretability method that tracks input tokens' attributions for both contexts. Our method, which can be extended to any encoder-decoder Transformer-based model, allows us to better comprehend the inner workings of current NMT models. We apply the proposed method to both bilingual and multilingual Transformers and present insights into their behaviour.

Comments:	EMNLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.11631 [cs.CL]
	(or arXiv:2205.11631v2 [cs.CL] for this version)

Submission history

From: Javier Ferrando [view email]
[v1] Mon, 23 May 2022 20:59:14 GMT (7058kb,D)
[v2] Fri, 4 Nov 2022 21:40:53 GMT (8233kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.11631

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer

Submission history