Socially Aware Bias Measurements for Hindi Language Representations

Malik, Vijit; Dev, Sunipa; Nishi, Akihiro; Peng, Nanyun; Chang, Kai-Wei

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computation and Language

Title: Socially Aware Bias Measurements for Hindi Language Representations

Authors: Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang

(Submitted on 15 Oct 2021 (this version), latest version 9 May 2022 (v2))

Abstract: Language representations are an efficient tool used across NLP, but they are strife with encoded societal biases. These biases are studied extensively, but with a primary focus on English language representations and biases common in the context of Western society. In this work, we investigate the biases present in Hindi language representations such as caste and religion associated biases. We demonstrate how biases are unique to specific language representations based on the history and culture of the region they are widely spoken in, and also how the same societal bias (such as binary gender associated biases) when investigated across languages is encoded by different words and text spans. With this work, we emphasize on the necessity of social-awareness along with linguistic and grammatical artefacts when modeling language representations, in order to understand the biases encoded.

Comments:	11 Pages (5 Pages main content+ 1 pages for references + 5 Pages Appendix)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.07871 [cs.CL]
	(or arXiv:2110.07871v1 [cs.CL] for this version)

Submission history

From: Vijit Malik [view email]
[v1] Fri, 15 Oct 2021 05:49:15 GMT (5244kb)
[v2] Mon, 9 May 2022 06:18:07 GMT (6307kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.07871v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Socially Aware Bias Measurements for Hindi Language Representations

Submission history