Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Costa-jussà, Marta R.; Escolano, Carlos; Basta, Christine; Ferrando, Javier; Batlle, Roser; Kharitonova, Ksenia

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Change to browse by:

Computer Science > Computation and Language

Title: Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Authors: Marta R. Costa-jussà, Carlos Escolano, Christine Basta, Javier Ferrando, Roser Batlle, Ksenia Kharitonova

(Submitted on 24 Dec 2020)

Abstract: Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules and parameters among languages. In this paper, and from an algorithmic perspective, we explore if the chosen architecture, when trained with the same data, influences the gender bias accuracy. Experiments in four language pairs show that Language-Specific encoders-decoders exhibit less bias than the Shared encoder-decoder architecture. Further interpretability analysis of source embeddings and the attention shows that, in the Language-Specific case, the embeddings encode more gender information, and its attention is more diverted. Both behaviors help in mitigating gender bias.

Comments:	12 pages, 5 figures, 3 tables
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Cite as:	arXiv:2012.13176 [cs.CL]
	(or arXiv:2012.13176v1 [cs.CL] for this version)

Submission history

From: Marta R. Costa-jussà [view email]
[v1] Thu, 24 Dec 2020 09:27:52 GMT (1143kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.13176

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Submission history