We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models

Abstract: This paper presents exploratory work on whether and to what extent biases against queer and trans people are encoded in large language models (LLMs) such as BERT. We also propose a method for reducing these biases in downstream tasks: finetuning the models on data written by and/or about queer people. To measure anti-queer bias, we introduce a new benchmark dataset, WinoQueer, modeled after other bias-detection benchmarks but addressing homophobic and transphobic biases. We found that BERT shows significant homophobic bias, but this bias can be mostly mitigated by finetuning BERT on a natural language corpus written by members of the LGBTQ+ community.
Comments: Accepted to Queer in AI Workshop @ NAACL 2022. Updated 07/07 with minor typographical fixes
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
ACM classes: I.2.7
Cite as: arXiv:2206.11484 [cs.CL]
  (or arXiv:2206.11484v2 [cs.CL] for this version)

Submission history

From: Virginia K. Felkner [view email]
[v1] Thu, 23 Jun 2022 05:30:47 GMT (27kb,D)
[v2] Fri, 8 Jul 2022 02:09:28 GMT (27kb,D)

Link back to: arXiv, form interface, contact.