iSarcasm: A Dataset of Intended Sarcasm

Oprea, Silviu; Magdy, Walid

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1911

Change to browse by:

Computer Science > Computation and Language

Title: iSarcasm: A Dataset of Intended Sarcasm

Authors: Silviu Oprea, Walid Magdy

(Submitted on 8 Nov 2019 (v1), last revised 1 May 2020 (this version, v2))

Abstract: We consider the distinction between intended and perceived sarcasm in the context of textual sarcasm detection. The former occurs when an utterance is sarcastic from the perspective of its author, while the latter occurs when the utterance is interpreted as sarcastic by the audience. We show the limitations of previous labelling methods in capturing intended sarcasm and introduce the iSarcasm dataset of tweets labeled for sarcasm directly by their authors. Examining the state-of-the-art sarcasm detection models on our dataset showed low performance compared to previously studied datasets, which indicates that these datasets might be biased or obvious and sarcasm could be a phenomenon under-studied computationally thus far. By providing the iSarcasm dataset, we aim to encourage future NLP research to develop methods for detecting sarcasm in text as intended by the authors of the text, not as labeled under assumptions that we demonstrate to be sub-optimal.

Comments:	9 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1911.03123 [cs.CL]
	(or arXiv:1911.03123v2 [cs.CL] for this version)

Submission history

From: Silviu Oprea [view email]
[v1] Fri, 8 Nov 2019 08:40:22 GMT (95kb)
[v2] Fri, 1 May 2020 19:40:12 GMT (249kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.03123

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: iSarcasm: A Dataset of Intended Sarcasm

Submission history