We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Abstract: Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules and parameters among languages. In this paper, and from an algorithmic perspective, we explore if the chosen architecture, when trained with the same data, influences the gender bias accuracy. Experiments in four language pairs show that Language-Specific encoders-decoders exhibit less bias than the Shared encoder-decoder architecture. Further interpretability analysis of source embeddings and the attention shows that, in the Language-Specific case, the embeddings encode more gender information, and its attention is more diverted. Both behaviors help in mitigating gender bias.
Comments: 12 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL)
ACM classes: I.2.7
Cite as: arXiv:2012.13176 [cs.CL]
  (or arXiv:2012.13176v1 [cs.CL] for this version)

Submission history

From: Marta R. Costa-jussà [view email]
[v1] Thu, 24 Dec 2020 09:27:52 GMT (1143kb,D)

Link back to: arXiv, form interface, contact.