18
0

Computational Complexity of Normalizing Constants for the Product of Determinantal Point Processes

Abstract

We consider the product of determinantal point processes (DPPs), a point process whose probability mass is proportional to the product of principal minors of multiple matrices, as a natural, promising generalization of DPPs. We study the computational complexity of computing its normalizing constant, which is among the most essential probabilistic inference tasks. Our complexity-theoretic results (almost) rule out the existence of efficient algorithms for this task unless the input matrices are forced to have favorable structures. In particular, we prove the following: (1) Computing Sdet(AS,S)p\sum_S\det({\bf A}_{S,S})^p exactly for every (fixed) positive even integer pp is UP-hard and Mod3_3P-hard, which gives a negative answer to an open question posed by Kulesza and Taskar. (2) Sdet(AS,S)det(BS,S)det(CS,S)\sum_S\det({\bf A}_{S,S})\det({\bf B}_{S,S})\det({\bf C}_{S,S}) is NP-hard to approximate within a factor of 2O(I1ϵ)2^{O(|I|^{1-\epsilon})} or 2O(n1/ϵ)2^{O(n^{1/\epsilon})} for any ϵ>0\epsilon>0, where I|I| is the input size and nn is the order of the input matrix. This result is stronger than the #P-hardness for the case of two matrices derived by Gillenwater. (3) There exists a kO(k)nO(1)k^{O(k)}n^{O(1)}-time algorithm for computing Sdet(AS,S)det(BS,S)\sum_S\det({\bf A}_{S,S})\det({\bf B}_{S,S}), where kk is the maximum rank of A\bf A and B\bf B or the treewidth of the graph formed by nonzero entries of A\bf A and B\bf B. Such parameterized algorithms are said to be fixed-parameter tractable. These results can be extended to the fixed-size case. Further, we present two applications of fixed-parameter tractable algorithms given a matrix A\bf A of treewidth ww: (4) We can compute a 2n2p12^{\frac{n}{2p-1}}-approximation to Sdet(AS,S)p\sum_S\det({\bf A}_{S,S})^p for any fractional number p>1p>1 in wO(wp)nO(1)w^{O(wp)}n^{O(1)} time. (5) We can find a 2n2^{\sqrt n}-approximation to unconstrained MAP inference in wO(wn)nO(1)w^{O(w\sqrt n)}n^{O(1)} time.

View on arXiv
Comments on this paper