We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Empirical evaluation of shallow and deep learning classifiers for Arabic sentiment analysis

Abstract: This work presents a detailed comparison of the performance of deep learning models such as convolutional neural networks (CNN), long short-term memory (LSTM), gated recurrent units (GRU), their hybrids, and a selection of shallow learning classifiers for sentiment analysis of Arabic reviews. Additionally, the comparison includes state-of-the-art models such as the transformer architecture and the araBERT pre-trained model. The datasets used in this study are multi-dialect Arabic hotel and book review datasets, which are some of the largest publicly available datasets for Arabic reviews. Results showed deep learning outperforming shallow learning for binary and multi-label classification, in contrast with the results of similar work reported in the literature. This discrepancy in outcome was caused by dataset size as we found it to be proportional to the performance of deep learning models. The performance of deep and shallow learning techniques was analyzed in terms of accuracy and F1 score. The best performing shallow learning technique was Random Forest followed by Decision Tree, and AdaBoost. The deep learning models performed similarly using a default embedding layer, while the transformer model performed best when augmented with araBERT.
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
ACM classes: I.2.7; I.5
Journal reference: ACM Trans. Asian Low-Resour. Lang. Inf. Process. 21, 1, Article 14 (November 2021), 25 pages (2021)
DOI: 10.1145/3466171
Cite as: arXiv:2112.00534 [cs.CL]
  (or arXiv:2112.00534v1 [cs.CL] for this version)

Submission history

From: Abdollah Darya [view email]
[v1] Wed, 1 Dec 2021 14:45:43 GMT (586kb,D)

Link back to: arXiv, form interface, contact.