Title: Signal Processing on Higher-Order Networks: Livin' on the Edge ... and Beyond
Comments: 38 pages; 7 figures
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG); Physics and Society (physics.soc-ph); Machine Learning (stat.ML)

This tutorial paper presents a didactic treatment of the emerging topic of signal processing on higher-order networks. Drawing analogies from discrete and graph signal processing, we introduce the building blocks for processing data on simplicial complexes and hypergraphs, two common abstractions of higher-order networks that can incorporate polyadic relationships.We provide basic introductions to simplicial complexes and hypergraphs, making special emphasis on the concepts needed for processing signals on them. Leveraging these concepts, we discuss Fourier analysis, signal denoising, signal interpolation, node embeddings, and non-linear processing through neural networks in these two representations of polyadic relational structures. In the context of simplicial complexes, we specifically focus on signal processing using the Hodge Laplacian matrix, a multi-relational operator that leverages the special structure of simplicial complexes and generalizes desirable properties of the Laplacian matrix in graph signal processing. For hypergraphs, we present both matrix and tensor representations, and discuss the trade-offs in adopting one or the other. We also highlight limitations and potential research avenues, both to inform practitioners and to motivate the contribution of new researchers to the area.

Title: Towards Understanding and Evaluating Structural Node Embeddings
Comments: A shorter version of this paper was presented in the Mining and Learning with Graphs workshop at KDD 2020
Subjects: Social and Information Networks (cs.SI)

While most network embedding techniques model the proximity between nodes in a network, recently there has been significant interest in structural embeddings that are based on node equivalences, a notion rooted in sociology: equivalences or positions are collections of nodes that have similar roles--i.e., similar functions, ties or interactions with nodes in other positions--irrespective of their distance or reachability in the network. Unlike the proximity-based methods that are rigorously evaluated in the literature, the evaluation of structural embeddings is less mature. It relies on small synthetic or real networks with labels that are not perfectly defined, and its connection to sociological equivalences has hitherto been vague and tenuous. With new node embedding methods being developed at a breakneck pace, proper evaluation and systematic characterization of existing approaches will be essential to progress. To fill in this gap, we set out to understand what types of equivalences structural embeddings capture. We are the first to contribute rigorous intrinsic and extrinsic evaluation methodology for structural embeddings, along with carefully-designed, diverse datasets of varying sizes. We observe a number of different evaluation variables that can lead to different results (e.g., choice of similarity measure, classifier, label definitions). We find that degree distributions within nodes' local neighborhoods can lead to simple yet effective baselines in their own right and guide the future development of structural embedding. We hope that our findings can influence the design of further node embedding methods and also pave the way for more comprehensive and fair evaluation of structural embedding methods.

Title: Eating Garlic Prevents COVID-19 Infection: Detecting Misinformation on the Arabic Content of Twitter
Comments: 18 pages, 4 figures
Subjects: Information Retrieval (cs.IR); Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)

The rapid growth of social media content during the current pandemic provides useful tools for disseminating information which has also become a root for misinformation. Therefore, there is an urgent need for fact-checking and effective techniques for detecting misinformation in social media. In this work, we study the misinformation in the Arabic content of Twitter. We construct a large Arabic dataset related to COVID-19 misinformation and gold-annotate the tweets into two categories: misinformation or not. Then, we apply eight different traditional and deep machine learning models, with different features including word embeddings and word frequency. The word embedding models (\textsc{FastText} and word2vec) exploit more than two million Arabic tweets related to COVID-19. Experiments show that optimizing the area under the curve (AUC) improves the models' performance and the Extreme Gradient Boosting (XGBoost) presents the highest accuracy in detecting COVID-19 misinformation online.

Title: Quantitative View of the Structure of Institutional Scientific Collaborations Using the Examples of Halle, Jena and Leipzig
Comments: 18 pages, 5 figures, 5 tables
Subjects: Digital Libraries (cs.DL); Social and Information Networks (cs.SI)

Examining effectiveness of institutional scientific coalitions can inform future policies. This is a study on the structure of scientific collaborations in three cities in central Germany. Since 1995, the three universities of this region have formed and maintained a coalition which led to the establishment of an interdisciplinary center in 2012, i.e., German Center for Integrative Biodiversity Research (iDiv). We investigate whether the impact of the former coalition is evident in the region's structure of scientific collaborations and the scientific output of the new center. Using publications data from 1996-2018, we build co-authorship networks and identify the most cohesive communities in terms of collaboration, and compare them with communities identified based on publications presented as the scientific outcome of the coalition and new center on their website. Our results show that despite the highly cohesive structure of collaborations presented on the coalition website, there is still much potential to be realized. The newly established center has bridged the member institutions but not to a particularly strong level. We see that geographical proximity, collaboration policies, funding, and organizational structure alone do not ensure prosperous scientific collaboration structures. When new center's scientific output is compared with its regional context, observed trends become less conspicuous. Nevertheless, the level of success the coalition achieved could inform policy makers regarding other regions' scientific development plans.

Title: Publishing patterns reflect political polarization in news media
Subjects: Social and Information Networks (cs.SI); Computers and Society (cs.CY)
Title: Capturing social media expressions during the COVID-19 pandemic in Argentina and forecasting mental health and emotions
Comments: 12 pages, 2 figures, 3 tables
Subjects: Computers and Society (cs.CY); Social and Information Networks (cs.SI)
