Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Guo, Wei; Caliskan, Aylin

doi:10.1145/3461702.3462536

Full-text links:

Download:

Current browse context:

cs.CY

< prev | next >

new | recent | 2006

Computer Science > Computers and Society

Title: Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Authors: Wei Guo, Aylin Caliskan

(Submitted on 6 Jun 2020 (v1), last revised 19 May 2021 (this version, v5))

Abstract: With the starting point that implicit human biases are reflected in the statistical regularities of language, it is possible to measure biases in English static word embeddings. State-of-the-art neural language models generate dynamic word embeddings dependent on the context in which the word appears. Current methods measure pre-defined social and intersectional biases that appear in particular contexts defined by sentence templates. Dispensing with templates, we introduce the Contextualized Embedding Association Test (CEAT), that can summarize the magnitude of overall bias in neural language models by incorporating a random-effects model. Experiments on social and intersectional biases show that CEAT finds evidence of all tested biases and provides comprehensive information on the variance of effect magnitudes of the same bias in different contexts. All the models trained on English corpora that we study contain biased representations.
Furthermore, we develop two methods, Intersectional Bias Detection (IBD) and Emergent Intersectional Bias Detection (EIBD), to automatically identify the intersectional biases and emergent intersectional biases from static word embeddings in addition to measuring them in contextualized word embeddings. We present the first algorithmic bias detection findings on how intersectional group members are strongly associated with unique emergent biases that do not overlap with the biases of their constituent minority identities. IBD and EIBD achieve high accuracy when detecting the intersectional and emergent biases of African American females and Mexican American females. Our results indicate that biases at the intersection of race and gender associated with members of multiple minority groups, such as African American females and Mexican American females, have the highest magnitude across all neural language models.

Comments:	19 pages, 2 figures, 4 tables
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Journal reference:	AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society 2021
DOI:	10.1145/3461702.3462536
Cite as:	arXiv:2006.03955 [cs.CY]
	(or arXiv:2006.03955v5 [cs.CY] for this version)

Submission history

From: Aylin Caliskan [view email]
[v1] Sat, 6 Jun 2020 19:49:50 GMT (358kb,D)
[v2] Mon, 22 Jun 2020 20:08:41 GMT (69kb,D)
[v3] Mon, 6 Jul 2020 18:43:34 GMT (71kb,D)
[v4] Fri, 16 Apr 2021 01:45:35 GMT (2503kb,D)
[v5] Wed, 19 May 2021 15:06:28 GMT (2504kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.03955

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computers and Society

Title: Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Submission history