Density Estimation in Infinite Dimensional Exponential Families

In this paper, we consider an infinite dimensional exponential family, of probability densities, which are parametrized by functions in a reproducing kernel Hilbert space, and show it to be quite rich in the sense that a broad class of densities on can be approximated arbitrarily well in Kullback-Leibler (KL) divergence by elements in . The main goal of the paper is to estimate an unknown density, through an element in . Standard techniques like maximum likelihood estimation (MLE) or pseudo MLE (based on the method of sieves), which are based on minimizing the KL divergence between and , do not yield practically useful estimators because of their inability to efficiently handle the log-partition function. Instead, we propose an estimator, based on minimizing the \emph{Fisher divergence}, between and , which involves solving a simple finite-dimensional linear system. When , we show that the proposed estimator is consistent, and provide a convergence rate of in Fisher divergence under the smoothness assumption that for some , where is a certain Hilbert-Schmidt operator on and denotes the image of . We also investigate the misspecified case of and show that as , and provide a rate for this convergence under a similar smoothness condition as above. Through numerical simulations we demonstrate that the proposed estimator outperforms the non-parametric kernel density estimator, and that the advantage with the proposed estimator grows as increases.
View on arXiv