13
58

Nearly Instance Optimal Sample Complexity Bounds for Top-k Arm Selection

Abstract

In the Best-kk-Arm problem, we are given nn stochastic bandit arms, each associated with an unknown reward distribution. We are required to identify the kk arms with the largest means by taking as few samples as possible. In this paper, we make progress towards a complete characterization of the instance-wise sample complexity bounds for the Best-kk-Arm problem. On the lower bound side, we obtain a novel complexity term to measure the sample complexity that every Best-kk-Arm instance requires. This is derived by an interesting and nontrivial reduction from the Best-11-Arm problem. We also provide an elimination-based algorithm that matches the instance-wise lower bound within doubly-logarithmic factors. The sample complexity of our algorithm strictly dominates the state-of-the-art for Best-kk-Arm (module constant factors).

View on arXiv
Comments on this paper