78

Asymptotic Properties of Bayesian Predictive Densities When the Distributions of Data and Target Variables are Different

Abstract

Bayesian predictive densities when the observed data xx and the target variable yy to be predicted have different distributions are investigated by using the framework of information geometry. The performance of predictive densities is evaluated by the Kullback--Leibler divergence. The parametric models are formulated as Riemannian manifolds. In the conventional setting in which xx and yy have the same distribution, the Fisher--Rao metric and the Jeffreys prior play essential roles. In the present setting in which xx and yy have different distributions, a new metric, which we call the predictive metric, constructed by using the Fisher information matrices of xx and yy, and the volume element based on the predictive metric play the corresponding roles. It is shown that Bayesian predictive densities based on priors constructed by using non-constant positive superharmonic functions with respect to the predictive metric asymptotically dominate those based on the volume element prior of the predictive metric.

View on arXiv
Comments on this paper