6
27

Exact Recovery in the Hypergraph Stochastic Block Model: a Spectral Algorithm

Abstract

We consider the exact recovery problem in the hypergraph stochastic block model (HSBM) with kk blocks of equal size. More precisely, we consider a random dd-uniform hypergraph HH with nn vertices partitioned into kk clusters of size s=n/ks = n / k. Hyperedges ee are added independently with probability pp if ee is contained within a single cluster and qq otherwise, where 0q<p10 \leq q < p \leq 1. We present a spectral algorithm which recovers the clusters exactly with high probability, given mild conditions on n,k,p,qn, k, p, q, and dd. Our algorithm is based on the adjacency matrix of HH, which is a symmetric n×nn \times n matrix whose (u,v)(u, v)-th entry is the number of hyperedges containing both uu and vv. To the best of our knowledge, our algorithm is the first to guarantee exact recovery when the number of clusters k=Θ(n)k=\Theta(\sqrt{n}).

View on arXiv
Comments on this paper