We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Abstract: This paper presents the different models submitted by the LT@Helsinki team for the SemEval 2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so-called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID and SOLID datasets. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.
Comments: Accepted at SemEval-2020 Task 12. Identical to camera-ready version except where adjustments to fit arXiv requirements were necessary
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2008.00805 [cs.CL]
  (or arXiv:2008.00805v1 [cs.CL] for this version)

Submission history

From: Emily Ohman [view email]
[v1] Mon, 3 Aug 2020 12:03:17 GMT (214kb,D)

Link back to: arXiv, form interface, contact.