Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information

Limisiewicz, Tomasz; Mareček, David

Full-text links:

Download:

Computer Science > Computation and Language

Title: Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information

Authors: Tomasz Limisiewicz, David Mareček

(Submitted on 21 Jun 2022)

Abstract: The representations in large language models contain multiple types of gender information. We focus on two types of such signals in English texts: factual gender information, which is a grammatical or semantic property, and gender bias, which is the correlation between a word and specific gender. We can disentangle the model's embeddings and identify components encoding both types of information with probing. We aim to diminish the stereotypical bias in the representations while preserving the factual gender signal. Our filtering method shows that it is possible to decrease the bias of gender-neutral profession names without significant deterioration of language modeling capabilities. The findings can be applied to language generation to mitigate reliance on stereotypes while preserving gender agreement in coreferences.

Comments:	Presented at GeBNLP 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.10744 [cs.CL]
	(or arXiv:2206.10744v1 [cs.CL] for this version)

Submission history

From: Tomasz Limisiewicz [view email]
[v1] Tue, 21 Jun 2022 21:38:25 GMT (6379kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.10744

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information

Submission history