We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CY

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computers and Society

Title: Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Abstract: With the starting point that implicit human biases are reflected in the statistical regularities of language, it is possible to measure biases in English static word embeddings. State-of-the-art neural language models generate dynamic word embeddings dependent on the context in which the word appears. Current methods measure pre-defined social and intersectional biases that appear in particular contexts defined by sentence templates. Dispensing with templates, we introduce the Contextualized Embedding Association Test (CEAT), that can summarize the magnitude of overall bias in neural language models by incorporating a random-effects model. Experiments on social and intersectional biases show that CEAT finds evidence of all tested biases and provides comprehensive information on the variance of effect magnitudes of the same bias in different contexts. All the models trained on English corpora that we study contain biased representations.
Furthermore, we develop two methods, Intersectional Bias Detection (IBD) and Emergent Intersectional Bias Detection (EIBD), to automatically identify the intersectional biases and emergent intersectional biases from static word embeddings in addition to measuring them in contextualized word embeddings. We present the first algorithmic bias detection findings on how intersectional group members are strongly associated with unique emergent biases that do not overlap with the biases of their constituent minority identities. IBD and EIBD achieve high accuracy when detecting the intersectional and emergent biases of African American females and Mexican American females. Our results indicate that biases at the intersection of race and gender associated with members of multiple minority groups, such as African American females and Mexican American females, have the highest magnitude across all neural language models.
Comments: 19 pages, 2 figures, 4 tables
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Journal reference: AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society 2021
DOI: 10.1145/3461702.3462536
Cite as: arXiv:2006.03955 [cs.CY]
  (or arXiv:2006.03955v5 [cs.CY] for this version)

Submission history

From: Aylin Caliskan [view email]
[v1] Sat, 6 Jun 2020 19:49:50 GMT (358kb,D)
[v2] Mon, 22 Jun 2020 20:08:41 GMT (69kb,D)
[v3] Mon, 6 Jul 2020 18:43:34 GMT (71kb,D)
[v4] Fri, 16 Apr 2021 01:45:35 GMT (2503kb,D)
[v5] Wed, 19 May 2021 15:06:28 GMT (2504kb,D)

Link back to: arXiv, form interface, contact.