We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Social and Information Networks

Title: Detection of Novel Social Bots by Ensembles of Specialized Classifiers

Abstract: Malicious actors create inauthentic social media accounts controlled in part by algorithms, known as social bots, to disseminate misinformation and agitate online discussion. While researchers have developed sophisticated methods to detect abuse, novel bots with diverse behaviors evade detection. We show that different types of bots are characterized by different behavioral features. As a result, supervised learning techniques suffer severe performance deterioration when attempting to detect behaviors not observed in the training data. Moreover, tuning these models to recognize novel bots requires retraining with a significant amount of new annotations, which are expensive to obtain. To address these issues, we propose a new supervised learning method that trains classifiers specialized for each class of bots and combines their decisions through the maximum rule. The ensemble of specialized classifiers (ESC) can better generalize, leading to an average improvement of 56\% in F1 score for unseen accounts across datasets. Furthermore, novel bot behaviors are learned with fewer labeled examples during retraining. We deployed ESC in the newest version of Botometer, a popular tool to detect social bots in the wild, with a cross-validation AUC of 0.99.
Comments: 8 pages, 10 figures, Accepted to CIKM'20
Subjects: Social and Information Networks (cs.SI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Journal reference: Proc. 29th ACM International Conference on Information and Knowledge Management (CIKM), pages 2725-2732, 2020
DOI: 10.1145/3340531.3412698
Cite as: arXiv:2006.06867 [cs.SI]
  (or arXiv:2006.06867v2 [cs.SI] for this version)

Submission history

From: Onur Varol [view email]
[v1] Thu, 11 Jun 2020 22:59:59 GMT (1646kb,D)
[v2] Fri, 14 Aug 2020 20:04:21 GMT (1183kb,D)

Link back to: arXiv, form interface, contact.