18
3

The square root rule for adaptive importance sampling

Abstract

In adaptive importance sampling, and other contexts, we have K>1K>1 unbiased and uncorrelated estimates μ^k\hat\mu_k of a common quantity μ\mu. The optimal unbiased linear combination weights them inversely to their variances but those weights are unknown and hard to estimate. A simple deterministic square root rule based on a working model that Var(μ^k)k1/2\mathrm{Var}(\hat\mu_k)\propto k^{-1/2} gives an unbisaed estimate of μ\mu that is nearly optimal under a wide range of alternative variance patterns. We show that if Var(μ^k)ky\mathrm{Var}(\hat\mu_k)\propto k^{-y} for an unknown rate parameter y[0,1]y\in [0,1] then the square root rule yields the optimal variance rate with a constant that is too large by at most 9/89/8 for any 0y10\le y\le 1 and any number KK of estimates. Numerical work shows that rule is similarly robust to some other patterns with mildly decreasing variance as kk increases.

View on arXiv
Comments on this paper