Optimistic Agents are Asymptotically Optimal

29 September 2012

Marcus Hutter

Abstract

We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.

View on arXiv

Comments on this paper