Correlated Feature Selection for Tweet Spam Classification using Artificial Neural Networks

Mishra, Prakamya

Full-text links:

Download:

Current browse context:

cs.SI

< prev | next >

new | recent | 1911

Computer Science > Social and Information Networks

Title: Correlated Feature Selection for Tweet Spam Classification using Artificial Neural Networks

Authors: Prakamya Mishra

(Submitted on 6 Nov 2019 (this version), latest version 25 Oct 2020 (v4))

Abstract: Identification of spam messages is a very challenging task for social networks due to its large size and complex nature. The purpose of this paper is to undertake the analysis of spamming on Twitter. To classify spams efficiently it is necessary to first understand the features of the spam tweets as well as identify attributes of the spammer. We extract both tweet based features and user based features for our analysis and observe the correlation between these features. This step is necessary as we can reduce the training time if we combine the features that are highly correlated. To perform our analysis we use artificial neural networks and train the model to classify the tweets as spam or non-spam. Using Correlational Artificial Neural Network gives us the highest accuracy of 97.57\% when compared with four other classifiers SVM, Kernel SVM, K Nearest Neighbours and Artificial Neural Network.

Subjects:	Social and Information Networks (cs.SI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.05495 [cs.SI]
	(or arXiv:1911.05495v1 [cs.SI] for this version)

Submission history

From: Prakamya Mishra [view email]
[v1] Wed, 6 Nov 2019 15:16:35 GMT (1265kb,D)
[v2] Wed, 27 Nov 2019 22:22:01 GMT (0kb,I)
[v3] Tue, 26 May 2020 22:44:27 GMT (1265kb,D)
[v4] Sun, 25 Oct 2020 20:23:53 GMT (1257kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.05495v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Social and Information Networks

Title: Correlated Feature Selection for Tweet Spam Classification using Artificial Neural Networks

Submission history