Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models

Röttger, Paul; Seelawi, Haitham; Nozza, Debora; Talat, Zeerak; Vidgen, Bertie

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computation and Language

Title: Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models

Authors: Paul Röttger, Haitham Seelawi, Debora Nozza, Zeerak Talat, Bertie Vidgen

(Submitted on 20 Jun 2022)

Abstract: Hate speech detection models are typically evaluated on held-out test sets. However, this risks painting an incomplete and potentially misleading picture of model performance because of increasingly well-documented systematic gaps and biases in hate speech datasets. To enable more targeted diagnostic insights, recent research has thus introduced functional tests for hate speech detection models. However, these tests currently only exist for English-language content, which means that they cannot support the development of more effective models in other languages spoken by billions across the world. To help address this issue, we introduce Multilingual HateCheck (MHC), a suite of functional tests for multilingual hate speech detection models. MHC covers 34 functionalities across ten languages, which is more languages than any other hate speech dataset. To illustrate MHC's utility, we train and test a high-performing multilingual hate speech detection model, and reveal critical model weaknesses for monolingual and cross-lingual applications.

Comments:	Accepted at WOAH (NAACL 2022)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.09917 [cs.CL]
	(or arXiv:2206.09917v1 [cs.CL] for this version)

Submission history

From: Paul Röttger [view email]
[v1] Mon, 20 Jun 2022 17:54:39 GMT (133kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.09917

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models

Submission history