Deterministic Finite-Memory Bias Estimation

In this paper we consider the problem of estimating a Bernoulli parameter using finite memory. Let be a sequence of independent identically distributed Bernoulli random variables with expectation , where . Consider a finite-memory deterministic machine with states, that updates its state at each time according to the rule , where is a deterministic time-invariant function. Assume that the machine outputs an estimate at each time point according to some fixed mapping from the state space to the unit interval. The quality of the estimation procedure is measured by the asymptotic risk, which is the long-term average of the instantaneous quadratic risk. The main contribution of this paper is an upper bound on the smallest worst-case asymptotic risk any such machine can attain. This bound coincides with a lower bound derived by Leighton and Rivest, to imply that is the minimax asymptotic risk for deterministic -state machines. In particular, our result disproves a longstanding conjecture for this quantity, also posed by Leighton and Rivest.
View on arXiv