Detection in the stochastic block model with multiple clusters: proof of the achievability conjectures, acyclic BP, and the information-computation gap

In a paper that initiated the modern study of the stochastic block model, Decelle et al., backed by Mossel et al., made the following conjecture: Denote by the number of balanced communities, the probability of connecting inside communities and across, and set ; for any , it is possible to detect communities efficiently whenever (the KS threshold), whereas for , it is possible to detect communities information-theoretically for some . Massouli\é, Mossel et al.\ and Bordenave et al.\ succeeded in proving that the KS threshold is efficiently achievable for , while Mossel et al.\ proved that it cannot be crossed information-theoretically for . The above conjecture remained open for . This paper proves this conjecture, further extending the efficient detection to non-symmetrical SBMs with a generalized notion of detection and KS threshold. For the efficient part, a linearized acyclic belief propagation (ABP) algorithm is developed and proved to detect communities for any down to the KS threshold in time . Achieving this requires showing optimality of ABP in the presence of cycles, a challenge for message passing algorithms. The paper further connects ABP to a power iteration method with a nonbacktracking operator of generalized order, formalizing the interplay between message passing and spectral methods. For the information-theoretic (IT) part, a non-efficient algorithm sampling a typical clustering is shown to break down the KS threshold at . The emerging gap is shown to be large in some cases; if , the KS threshold reads whereas the IT bound reads , making the SBM a good study-case for information-computation gaps.
View on arXiv