We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Unbiased sampling of network ensembles

Abstract: Sampling random graphs with given properties is a key step in the analysis of networks, as random ensembles represent basic null models required to identify patterns such as communities and motifs. An important requirement is that the sampling process is unbiased and efficient. The main approaches are microcanonical, i.e. they sample graphs that match the enforced constraints exactly. Unfortunately, when applied to strongly heterogeneous networks (like most real-world examples), the majority of these approaches become biased and/or time-consuming. Moreover, the algorithms defined in the simplest cases, such as binary graphs with given degrees, are not easily generalizable to more complicated ensembles. Here we propose a solution to the problem via the introduction of a "Maximize and Sample" ("Max & Sam" for short) method to correctly sample ensembles of networks where the constraints are `soft', i.e. realized as ensemble averages. Our method is based on exact maximum-entropy distributions and is therefore unbiased by construction, even for strongly heterogeneous networks. It is also more computationally efficient than most microcanonical alternatives. Finally, it works for both binary and weighted networks with a variety of constraints, including combined degree-strength sequences and full reciprocity structure, for which no alternative method exists. Our canonical approach can in principle be turned into an unbiased microcanonical one, via a restriction to the relevant subset. Importantly, the analysis of the fluctuations of the constraints suggests that the microcanonical and canonical versions of all the ensembles considered here are not equivalent. We show various real-world applications and provide a code implementing all our algorithms.
Comments: MatLab code available at this http URL
Subjects: Methodology (stat.ME); Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
Journal reference: New J. Phys. 17, 023052 (2015)
DOI: 10.1088/1367-2630/17/2/023052
Cite as: arXiv:1406.1197 [stat.ME]
  (or arXiv:1406.1197v3 [stat.ME] for this version)

Submission history

From: Diego Garlaschelli [view email]
[v1] Wed, 4 Jun 2014 20:05:22 GMT (1965kb,D)
[v2] Wed, 11 Jun 2014 19:54:59 GMT (1965kb,D)
[v3] Mon, 5 Jan 2015 14:47:04 GMT (1548kb,D)

Link back to: arXiv, form interface, contact.