We consider the problem of identifying underlying community-like structures in graphs. Towards this end we study the Stochastic Block Model (SBM) on -clusters: a random model on vertices, partitioned in equal sized clusters, with edges sampled independently across clusters with probability and within clusters with probability , . The goal is to recover the initial "hidden" partition of . We study semidefinite programming (SDP) based algorithms in this context. In the regime and we show that a certain natural SDP based algorithm solves the problem of {\em exact recovery} in the -community SBM, with high probability, whenever , as long as . This threshold is known to be the information theoretically optimal. We also study the case when . In this case however we achieve recovery guarantees that no longer match the optimal condition , thus leaving achieving optimality for this range an open question.
View on arXiv