Misogynistic Tweet Detection: Modelling CNN with Small Datasets

Bashar, Md Abul; Nayak, Richi; Suzor, Nicolas; Weir, Bridget

doi:10.1007/978-981-13-6661-1_1

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2008

Computer Science > Computation and Language

Title: Misogynistic Tweet Detection: Modelling CNN with Small Datasets

Authors: Md Abul Bashar, Richi Nayak, Nicolas Suzor, Bridget Weir

(Submitted on 28 Aug 2020)

Abstract: Online abuse directed towards women on the social media platform Twitter has attracted considerable attention in recent years. An automated method to effectively identify misogynistic abuse could improve our understanding of the patterns, driving factors, and effectiveness of responses associated with abusive tweets over a sustained time period. However, training a neural network (NN) model with a small set of labelled data to detect misogynistic tweets is difficult. This is partly due to the complex nature of tweets which contain misogynistic content, and the vast number of parameters needed to be learned in a NN model. We have conducted a series of experiments to investigate how to train a NN model to detect misogynistic tweets effectively. In particular, we have customised and regularised a Convolutional Neural Network (CNN) architecture and shown that the word vectors pre-trained on a task-specific domain can be used to train a CNN model effectively when a small set of labelled data is available. A CNN model trained in this way yields an improved accuracy over the state-of-the-art models.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Journal reference:	Australasian Conference on Data Mining. Springer. 3--16, 2018
DOI:	10.1007/978-981-13-6661-1_1
Cite as:	arXiv:2008.12452 [cs.CL]
	(or arXiv:2008.12452v1 [cs.CL] for this version)

Submission history

From: Md Abul Bashar [view email]
[v1] Fri, 28 Aug 2020 02:59:22 GMT (119kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2008.12452

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Misogynistic Tweet Detection: Modelling CNN with Small Datasets

Submission history