We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Physics > Physics and Society

Title: Inference of Media Bias and Content Quality Using Natural-Language Processing

Abstract: Media bias can significantly impact the formation and development of opinions and sentiments in a population. It is thus important to study the emergence and development of partisan media and political polarization. However, it is challenging to quantitatively infer the ideological positions of media outlets. In this paper, we present a quantitative framework to infer both political bias and content quality of media outlets from text, and we illustrate this framework with empirical experiments with real-world data. We apply a bidirectional long short-term memory (LSTM) neural network to a data set of more than 1 million tweets to generate a two-dimensional ideological-bias and content-quality measurement for each tweet. We then infer a ``media-bias chart'' of (bias, quality) coordinates for the media outlets by integrating the (bias, quality) measurements of the tweets of the media outlets. We also apply a variety of baseline machine-learning methods, such as a naive-Bayes method and a support-vector machine (SVM), to infer the bias and quality values for each tweet. All of these baseline approaches are based on a bag-of-words approach. We find that the LSTM-network approach has the best performance of the examined methods. Our results illustrate the importance of leveraging word order into machine-learning methods in text analysis.
Comments: 21 pages, 7 figures, 4 tables
Subjects: Physics and Society (physics.soc-ph); Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as: arXiv:2212.00237 [physics.soc-ph]
  (or arXiv:2212.00237v1 [physics.soc-ph] for this version)

Submission history

From: Mason A. Porter [view email]
[v1] Thu, 1 Dec 2022 03:04:55 GMT (1649kb,D)

Link back to: arXiv, form interface, contact.