42
125

Density Estimation in Infinite Dimensional Exponential Families

Abstract

In this paper, we consider an infinite dimensional exponential family, P\mathcal{P} of probability densities, which are parametrized by functions in a reproducing kernel Hilbert space, HH and show it to be quite rich in the sense that a broad class of densities on Rd\mathbb{R}^d can be approximated arbitrarily well in Kullback-Leibler (KL) divergence by elements in P\mathcal{P}. The main goal of the paper is to estimate an unknown density, p0p_0 through an element in P\mathcal{P}. Standard techniques like maximum likelihood estimation (MLE) or pseudo MLE (based on the method of sieves), which are based on minimizing the KL divergence between p0p_0 and P\mathcal{P}, do not yield practically useful estimators because of their inability to efficiently handle the log-partition function. Instead, we propose an estimator, p^n\hat{p}_n based on minimizing the \emph{Fisher divergence}, J(p0p)J(p_0\Vert p) between p0p_0 and pPp\in \mathcal{P}, which involves solving a simple finite-dimensional linear system. When p0Pp_0\in\mathcal{P}, we show that the proposed estimator is consistent, and provide a convergence rate of nmin{23,2β+12β+2}n^{-\min\left\{\frac{2}{3},\frac{2\beta+1}{2\beta+2}\right\}} in Fisher divergence under the smoothness assumption that logp0R(Cβ)\log p_0\in\mathcal{R}(C^\beta) for some β0\beta\ge 0, where CC is a certain Hilbert-Schmidt operator on HH and R(Cβ)\mathcal{R}(C^\beta) denotes the image of CβC^\beta. We also investigate the misspecified case of p0Pp_0\notin\mathcal{P} and show that J(p0p^n)infpPJ(p0p)J(p_0\Vert\hat{p}_n)\rightarrow \inf_{p\in\mathcal{P}}J(p_0\Vert p) as nn\rightarrow\infty, and provide a rate for this convergence under a similar smoothness condition as above. Through numerical simulations we demonstrate that the proposed estimator outperforms the non-parametric kernel density estimator, and that the advantage with the proposed estimator grows as dd increases.

View on arXiv
Comments on this paper