Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Bhatnagar, Varad; Kumar, Prince; Moghili, Sairam; Bhattacharyya, Pushpak

Full-text links:

Download:

PDF only

Current browse context:

cs.CL

< prev | next >

new | recent | 2101

Change to browse by:

Computer Science > Computation and Language

Title: Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Authors: Varad Bhatnagar, Prince Kumar, Sairam Moghili, Pushpak Bhattacharyya

(Submitted on 20 Jan 2021)

Abstract: Recently the NLP community has started showing interest towards the challenging task of Hostile Post Detection. This paper present our system for Shared Task at Constraint2021 on "Hostile Post Detection in Hindi". The data for this shared task is provided in Hindi Devanagari script which was collected from Twitter and Facebook. It is a multi-label multi-class classification problem where each data instance is annotated into one or more of the five classes: fake, hate, offensive, defamation, and non-hostile. We propose a two level architecture which is made up of BERT based classifiers and statistical classifiers to solve this problem. Our team 'Albatross', scored 0.9709 Coarse grained hostility F1 score measure on Hostile Post Detection in Hindi subtask and secured 2nd rank out of 45 teams for the task. Our submission is ranked 2nd and 3rd out of a total of 156 submissions with Coarse grained hostility F1 score of 0.9709 and 0.9703 respectively. Our fine grained scores are also very encouraging and can be improved with further finetuning. The code is publicly available.

Subjects:	Computation and Language (cs.CL)
Journal reference:	CONSTRAINT @AAAI 2021 Combating Online Hostile Posts in Regional Languages during Emergency Situation pp244-255
Cite as:	arXiv:2101.07973 [cs.CL]
	(or arXiv:2101.07973v1 [cs.CL] for this version)

Submission history

From: Varad Bhatnagar [view email]
[v1] Wed, 20 Jan 2021 05:38:07 GMT (519kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2101.07973

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Submission history