Square Hellinger Subadditivity for Bayesian Networks and its Applications to Identity Testing

We show that the square Hellinger distance between two Bayesian networks on the same directed graph, , is subadditive with respect to the neighborhoods of . Namely, if and are the probability distributions defined by two Bayesian networks on the same DAG, our inequality states that the square Hellinger distance, , between and is upper bounded by the sum, , of the square Hellinger distances between the marginals of and on every node and its parents in the DAG. Importantly, our bound does not involve the conditionals but the marginals of and . We derive a similar inequality for more general Markov Random Fields. As an application of our inequality, we show that distinguishing whether two Bayesian networks and on the same (but potentially unknown) DAG satisfy vs can be performed from samples, where is the maximum in-degree of the DAG and the domain of each variable of the Bayesian networks. If and are defined on potentially different and potentially unknown trees, the sample complexity becomes , whose dependence on is optimal up to logarithmic factors. Lastly, if and are product distributions over and is known, the sample complexity becomes , which is optimal up to constant factors.
View on arXiv