Community recovery in non-binary and temporal stochastic block models

11 August 2020

Abstract

This article studies the estimation of community memberships from non-binary pair interactions represented by an $N$ -by- $N$ tensor whose values are elements of $\mathcal S$ , where $N$ is the number of nodes and $\mathcal S$ is the space of the pairwise interactions between the nodes. As an information-theoretic benchmark, we study data sets generated by a non-binary stochastic block model, and derive fundamental information criteria for the recovery of the community memberships as $N \to \infty$ . Examples of applications include weighted networks ( $\mathcal S = \mathbb R$ ), link-labeled networks $(\mathcal S = \{0, 1, \dots, L\}$ ), multiplex networks $(\mathcal S = \{0,1\}^M$ ) and temporal networks ( $\mathcal S = \{0,1\}^T$ ). For temporal interactions, we show that (i) even a small increase in $T$ may have a big impact on the recovery of community memberships, (ii) consistent recovery is possible even for very sparse data (e.g.\ bounded average degree) when $T$ is large enough. We also present several estimation algorithms, both offline and online, which fully utilise the temporal nature of the observed data. We analyse the accuracy of the proposed estimation algorithms under various assumptions on data sparsity and identifiability. Numerical experiments show that even a poor initial estimate (e.g., blind random guess) of the community assignment leads to high accuracy obtained by the online algorithm after a small number of iterations, and remarkably so also in very sparse regimes.

View on arXiv

Comments on this paper