We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Social and Information Networks

Title: Comparing Alternatives to the Fixed Degree Sequence Model for Extracting the Backbone of Bipartite Projections

Abstract: Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between two nodes is significant, and therefore whether an edge exists between them. One approach, the fixed degree sequence model (FDSM), evaluates the significance of an edge's weight by comparison to a null model in which the degree sequences of the original bipartite network are fixed. Although the FDSM is an intuitive null model, it is computationally expensive because it requires Monte Carlo simulation to estimate each edge's $p$-value, and therefore is impractical for large projections. In this paper, we explore four potential alternatives to FDSM: fixed fill model (FFM), fixed row model (FRM), fixed column model (FCM), and stochastic degree sequence model (SDSM). We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. We find that the computationally-fast SDSM offers a statistically conservative but close approximation of the computationally-impractical FDSM under a wide range of conditions, and that it correctly recovers a known community structure even when the signal is weak. Therefore, although each backbone model may have particular applications, we recommend SDSM for extracting the backbone of bipartite projections when FDSM is impractical.
Subjects: Social and Information Networks (cs.SI); Applications (stat.AP)
Journal reference: Scientific reports, 11(1), 1-13 (2021)
DOI: 10.1038/s41598-021-03238-3
Cite as: arXiv:2105.13396 [cs.SI]
  (or arXiv:2105.13396v5 [cs.SI] for this version)

Submission history

From: Zachary Neal [view email]
[v1] Thu, 27 May 2021 18:56:04 GMT (1064kb,D)
[v2] Mon, 31 May 2021 12:24:53 GMT (1064kb,D)
[v3] Fri, 18 Jun 2021 15:02:14 GMT (1060kb,D)
[v4] Thu, 7 Oct 2021 17:55:14 GMT (1923kb,D)
[v5] Thu, 28 Oct 2021 18:06:03 GMT (955kb,D)

Link back to: arXiv, form interface, contact.