Towards countering hate speech and personal attack in social media

Charitidis, Polychronis; Doropoulos, Stavros; Vologiannidis, Stavros; Papastergiou, Ioannis; Karakeva, Sophia

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 1912

Computer Science > Information Retrieval

Title: Towards countering hate speech and personal attack in social media

Authors: Polychronis Charitidis, Stavros Doropoulos, Stavros Vologiannidis, Ioannis Papastergiou, Sophia Karakeva

(Submitted on 5 Dec 2019 (this version), latest version 30 Apr 2020 (v2))

Abstract: The damaging effects of hate speech in social media are evident during the last few years, and several organizations, researchers and the social media platforms themselves have tried to harness them without great success. Recently, following the advent of deep learning, several novel approaches appeared in the field of hate speech detection. However, it is apparent that such approaches depend on large-scale datasets in order to exhibit competitive performance. In this paper, we present a novel, publicly available collection of datasets in five different languages, that consists of tweets referring to journalism-related accounts, including high-quality human annotations for hate speech and personal attack. To build the datasets we follow a concise annotation strategy and employ an active learning approach. Additionally, we present a number of state-of-the-art deep learning architectures for hate speech detection and use these datasets to train and evaluate them. Finally, we propose an ensemble model that outperforms all individual models.

Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
Cite as:	arXiv:1912.04106 [cs.IR]
	(or arXiv:1912.04106v1 [cs.IR] for this version)

Submission history

From: Stavros Vologiannidis [view email]
[v1] Thu, 5 Dec 2019 07:51:23 GMT (224kb,D)
[v2] Thu, 30 Apr 2020 19:55:34 GMT (617kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1912.04106v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: Towards countering hate speech and personal attack in social media

Submission history