We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Community recovery in non-binary and temporal stochastic block models

Abstract: This article studies the estimation of community memberships from non-binary pair interactions represented by an $N$-by-$N$ tensor whose values are elements of $\mathcal S$, where $N$ is the number of nodes and $\mathcal S$ is the space of the pairwise interactions between the nodes. As an information-theoretic benchmark, we study data sets generated by a non-binary stochastic block model, and derive fundamental information criteria for the recovery of the community memberships as $N \to \infty$. Examples of applications include weighted networks ($\mathcal S = \mathbb R$), link-labeled networks $(\mathcal S = \{0, 1, \dots, L\}$), multiplex networks $(\mathcal S = \{0,1\}^M$) and temporal networks ($\mathcal S = \{0,1\}^T$).
For temporal interactions, we show that (i) even a small increase in $T$ may have a big impact on the recovery of community memberships, (ii) consistent recovery is possible even for very sparse data (e.g.\ bounded average degree) when $T$ is large enough. We also present several estimation algorithms, both offline and online, which fully utilise the temporal nature of the observed data. We analyse the accuracy of the proposed estimation algorithms under various assumptions on data sparsity and identifiability. Numerical experiments show that even a poor initial estimate (e.g., blind random guess) of the community assignment leads to high accuracy obtained by the online algorithm after a small number of iterations, and remarkably so also in very sparse regimes.
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR)
Cite as: arXiv:2008.04790 [math.ST]
  (or arXiv:2008.04790v3 [math.ST] for this version)

Submission history

From: Maximilien Dreveton [view email]
[v1] Tue, 11 Aug 2020 15:33:59 GMT (555kb)
[v2] Wed, 12 Aug 2020 15:31:33 GMT (773kb)
[v3] Fri, 7 Jan 2022 09:57:53 GMT (1853kb)
[v4] Mon, 28 Mar 2022 09:58:16 GMT (558kb)
[v5] Tue, 30 Aug 2022 09:15:32 GMT (409kb)

Link back to: arXiv, form interface, contact.