We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Filtering Tweets for Social Unrest

Abstract: Since the events of the Arab Spring, there has been increased interest in using social media to anticipate social unrest. While efforts have been made toward automated unrest prediction, we focus on filtering the vast volume of tweets to identify tweets relevant to unrest, which can be provided to downstream users for further analysis. We train a supervised classifier that is able to label Arabic language tweets as relevant to unrest with high reliability. We examine the relationship between training data size and performance and investigate ways to optimize the model building process while minimizing cost. We also explore how confidence thresholds can be set to achieve desired levels of performance.
Comments: 7 pages, 8 figures, 3 tables; published in Proceedings of the 2017 IEEE 11th International Conference on Semantic Computing (ICSC), San Diego, CA, USA, pages 17-23, January 2017
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
ACM classes: H.3.3; I.2.6; I.2.7; I.5.4
Journal reference: In Proceedings of the 2017 IEEE 11th International Conference on Semantic Computing (ICSC), pages 17-23, San Diego, CA, USA, January 2017. IEEE
DOI: 10.1109/ICSC.2017.75
Cite as: arXiv:1702.06216 [cs.CL]
  (or arXiv:1702.06216v2 [cs.CL] for this version)

Submission history

From: Michael Bloodgood [view email]
[v1] Mon, 20 Feb 2017 23:48:39 GMT (2131kb,D)
[v2] Sat, 1 Apr 2017 22:37:35 GMT (2131kb,D)

Link back to: arXiv, form interface, contact.