We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Controlled Analyses of Social Biases in Wikipedia Bios

Abstract: Social biases on Wikipedia, a widely-read global platform, could greatly influence public opinion. While prior research has examined man/woman gender bias in biography articles, possible influences of other demographic attributes limit conclusions. In this work, we present a methodology for analyzing Wikipedia pages about people that isolates dimensions of interest (e.g., gender), from other attributes (e.g., occupation). Given a target corpus for analysis (e.g.~biographies about women), we present a method for constructing a comparison corpus that matches the target corpus in as many attributes as possible, except the target one. We develop evaluation metrics to measure how well the comparison corpus aligns with the target corpus and then examine how articles about gender and racial minorities (cis. women, non-binary people, transgender women, and transgender men; African American, Asian American, and Hispanic/Latinx American people) differ from other articles. In addition to identifying suspect social biases, our results show that failing to control for covariates can result in different conclusions and veil biases. Our contributions include methodology that facilitates further analyses of bias in Wikipedia articles, findings that can aid Wikipedia editors in reducing biases, and a framework and evaluation metrics to guide future work in this area.
Comments: Accepted to the Web Conference 2022 (WWW '22)
Subjects: Computation and Language (cs.CL)
DOI: 10.1145/3485447.3512134
Cite as: arXiv:2101.00078 [cs.CL]
  (or arXiv:2101.00078v4 [cs.CL] for this version)

Submission history

From: Anjalie Field [view email]
[v1] Thu, 31 Dec 2020 21:27:12 GMT (1990kb,D)
[v2] Fri, 22 Oct 2021 21:12:04 GMT (17504kb,D)
[v3] Sat, 5 Feb 2022 01:25:00 GMT (8744kb,D)
[v4] Wed, 9 Feb 2022 06:34:38 GMT (8744kb,D)

Link back to: arXiv, form interface, contact.