We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CY

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computers and Society

Title: A general method for estimating the prevalence of Influenza-Like-Symptoms with Wikipedia data

Abstract: Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Being able to estimate in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting information about Wikipedia's page views of a selected group of articles and machine learning models to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model is able to reach state-of-the-art results by comparing it with previous solutions.
Subjects: Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Journal reference: PLOS ONE. 16 (2021) 0-20
DOI: 10.1371/journal.pone.0256858
Cite as: arXiv:2010.14903 [cs.CY]
  (or arXiv:2010.14903v1 [cs.CY] for this version)

Submission history

From: Giovanni De Toni [view email]
[v1] Wed, 28 Oct 2020 11:44:44 GMT (2995kb,D)

Link back to: arXiv, form interface, contact.