We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Speech Act Classifier for Persian Texts and its Application in Identifying Rumors

Abstract: Speech Acts (SAs) are one of the important areas of pragmatics, which give us a better understanding of the state of mind of the people and convey an intended language function. Knowledge of the SA of a text can be helpful in analyzing that text in natural language processing applications. This study presents a dictionary-based statistical technique for Persian SA recognition. The proposed technique classifies a text into seven classes of SA based on four criteria: lexical, syntactic, semantic, and surface features. WordNet as the tool for extracting synonym and enriching features dictionary is utilized. To evaluate the proposed technique, we utilized four classification methods including Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), and K-Nearest Neighbors (KNN). The experimental results demonstrate that the proposed method using RF and SVM as the best classifiers achieved a state-of-the-art performance with an accuracy of 0.95 for classification of Persian SAs. Our original vision of this work is introducing an application of SA recognition on social media content, especially the common SA in rumors. Therefore, the proposed system utilized to determine the common SAs in rumors. The results showed that Persian rumors are often expressed in three SA classes including narrative, question, and threat, and in some cases with the request SA.
Comments: Published Link: this http URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Journal reference: Journal of Soft Computing and Information Technology, 9, 1, 1399 (2020), 18-27
Cite as: arXiv:1901.03904 [cs.CL]
  (or arXiv:1901.03904v4 [cs.CL] for this version)

Submission history

From: Mohammad Reza Feizi Derakhshi [view email]
[v1] Sat, 12 Jan 2019 21:54:23 GMT (1108kb)
[v2] Fri, 1 Feb 2019 15:08:44 GMT (1135kb)
[v3] Thu, 9 Jul 2020 08:19:03 GMT (596kb)
[v4] Sun, 12 Jul 2020 10:42:12 GMT (596kb)

Link back to: arXiv, form interface, contact.