LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Pàmies, Marc; Öhman, Emily; Kajava, Kaisla; Tiedemann, Jörg

Full-text links:

Download:

Computer Science > Computation and Language

Title: LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Authors: Marc Pàmies, Emily Öhman, Kaisla Kajava, Jörg Tiedemann

(Submitted on 3 Aug 2020)

Abstract: This paper presents the different models submitted by the LT@Helsinki team for the SemEval 2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so-called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID and SOLID datasets. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.

Comments:	Accepted at SemEval-2020 Task 12. Identical to camera-ready version except where adjustments to fit arXiv requirements were necessary
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2008.00805 [cs.CL]
	(or arXiv:2008.00805v1 [cs.CL] for this version)

Submission history

From: Emily Ohman [view email]
[v1] Mon, 3 Aug 2020 12:03:17 GMT (214kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2008.00805

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Submission history