We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Community recovery in non-binary and temporal stochastic block models

Abstract: This article studies the estimation of latent community memberships from pairwise interactions in a network of $N$ nodes, where the observed interactions can be of arbitrary type, including binary, categorical, and vector-valued, and not excluding even more general objects such as time series or spatial point patterns. As a generative model for such data, we introduce a stochastic block model with a general measurable interaction space $\mathcal S$, for which we derive information-theoretic bounds for the minimum achievable error rate. These bounds yield sharp criteria for the existence of consistent and strongly consistent estimators in terms of data sparsity, statistical similarity between intra- and inter-block interaction distributions, and the shape and size of the interaction space. The general framework makes it possible to study temporal and multiplex networks with $\mathcal S = \{0,1\}^T$, in settings where both $N \to \infty$ and $T \to \infty$, and the temporal interaction patterns are correlated over time. For temporal Markov interactions, we derive sharp consistency thresholds. We also present fast online estimation algorithms which fully utilise the non-binary nature of the observed data. Numerical experiments on synthetic and real data show that these algorithms rapidly produce accurate estimates even for very sparse data arrays.
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR)
MSC classes: 62H30, 60J10, 90B15, 91D30
Cite as: arXiv:2008.04790 [math.ST]
  (or arXiv:2008.04790v5 [math.ST] for this version)

Submission history

From: Maximilien Dreveton [view email]
[v1] Tue, 11 Aug 2020 15:33:59 GMT (555kb)
[v2] Wed, 12 Aug 2020 15:31:33 GMT (773kb)
[v3] Fri, 7 Jan 2022 09:57:53 GMT (1853kb)
[v4] Mon, 28 Mar 2022 09:58:16 GMT (558kb)
[v5] Tue, 30 Aug 2022 09:15:32 GMT (409kb)

Link back to: arXiv, form interface, contact.