Community detection for binary graphical models in high dimension

Let components be partitioned into two communities, denoted and , possibly of different sizes. Assume that they are connected via a directed and weighted Erd\"os-R\ényi random graph (DWER) with unknown parameter The weights assigned to the existing connections are of mean-field type, scaling as . At each time unit, we observe the state of each component: either it sends some signal to its successors (in the directed graph) or remains silent otherwise. In this paper, we show that it is possible to find the communities and based only on the activity of the components observed over time units. More specifically, we propose a simple algorithm for which the probability of {\it exact recovery} converges to as long as , as and diverge. Interestingly, this simple algorithm does not require any prior knowledge on the other model parameters (e.g. the edge probability ). The key step in our analysis is to derive an asymptotic approximation of the one unit time-lagged covariance matrix associated to the states of the components, as diverges. This asymptotic approximation relies on the study of the behavior of the solutions of a matrix equation of Stein type satisfied by the simultaneous (0-lagged) covariance matrix associated to the states of the components. This study is challenging, specially because the simultaneous covariance matrix is random since it depends on the underlying DWER random graph.
View on arXiv