Optimal Activation of Halting Multi-Armed Bandit Models

Abstract
We study new types of dynamic allocation problems the {\sl Halting Bandit} models. As an application, we obtain new proofs for the classic Gittins index decomposition result and recent results of the authors in `Multi-armed bandits under general depreciation and commitment.'
View on arXivComments on this paper