On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

50 / 147 papers shown

Title
Near Optimal Best Arm Identification for Clustered Bandits Yash Nikhil Karamchandani Avishek Ghosh 23 0 0 15 May 2025
Sample Complexity of Identifying the Nonredundancy of Nontransitive Games in Dueling Bandits Shang Lu Shuji Kijima 40 0 0 08 May 2025
On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds Shubhada Agrawal Aaditya Ramdas 29 0 0 28 Apr 2025
On the Problem of Best Arm Retention Houshuang Chen Yuchen He Chihao Zhang 39 0 0 16 Apr 2025
Cost-Aware Optimal Pairwise Pure Exploration Di Wu Chengshuai Shi Ruida Zhou Cong Shen 41 0 0 10 Mar 2025
Online Clustering with Bandit Information G Dhinesh Chandran Srinivas Reddy Kota Srikrishna Bhashyam 69 0 0 20 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time Shen Li Yuyang Zhang Zhaolin Ren Claire Liang Na Li J. Shah 42 0 0 03 Jan 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation Jimmy Wang Ethan Che Daniel R. Jiang Hongseok Namkoong 42 0 0 08 Aug 2024
On Speeding Up Language Model Evaluation Jin Peng Zhou Christian K. Belardi Ruihan Wu Travis Zhang Carla P. Gomes Wen Sun Kilian Q. Weinberger 58 1 0 08 Jul 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang Honghao Wei Lei Ying OffRL 67 1 0 11 Jun 2024
Adaptive Online Experimental Design for Causal Discovery Muhammad Qasim Elahi Lai Wei Murat Kocaoglu Mahsa Ghasemi CML 41 1 0 19 May 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 42 2 0 08 Feb 2024
HoloBeam: Learning Optimal Beamforming in Far-Field Holographic Metasurface Transceivers D. Ghosh M. Hanawal Nikola Zlatanov 15 0 0 30 Dec 2023
Best Arm Identification in Batched Multi-armed Bandit Problems Sheng Cao Simai He Ruoqing Jiang Jin Xu Hongsong Yuan 15 1 0 21 Dec 2023
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances Masahiro Kato 38 3 0 20 Dec 2023
Bandit Pareto Set Identification: the Fixed Budget Setting Cyrille Kone Emilie Kaufmann Laura Richert 43 3 0 07 Nov 2023
An Anytime Algorithm for Good Arm Identification Marc Jourdan Clémence Réda 30 2 0 16 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption Shubhada Agrawal Timothée Mathieu D. Basu Odalric-Ambrym Maillard 30 2 0 28 Sep 2023
On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence Achraf Azize Marc Jourdan Aymen Al Marjani D. Basu 44 3 0 05 Sep 2023
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Shintaro Nakamura Masashi Sugiyama 16 4 0 20 Aug 2023
More PAC-Bayes bounds: From bounded losses, to losses with general tail behaviors, to anytime validity Borja Rodríguez Gálvez Ragnar Thobaben Mikael Skoglund 30 9 0 21 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface Xiaoping Zhou Botao Hao Jian Kang Tor Lattimore Lexin Li 35 2 0 17 May 2023
Bayesian Synthetic Likelihood David T. Frazier Christopher C. Drovandi David J. Nott 39 217 0 09 May 2023
Best Arm Identification with Fairness Constraints on Subpopulations Yuhang Wu Zeyu Zheng Tingyu Zhu 19 8 0 08 Apr 2023
Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games Arnab Maiti Kevin G. Jamieson Lillian J. Ratliff 36 6 0 19 Mar 2023
Open Problem: Optimal Best Arm Identification with Fixed Budget Chao Qin 27 18 0 02 Mar 2023
Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation D. Preil M. Krapp AI4CE 14 1 0 15 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$ -top exploration Alexandra Cimpean T. Verstraeten L. Willem N. Hens Ann Nowé Pieter J. K. Libin 21 2 0 30 Jan 2023
UB3: Best Beam Identification in Millimeter Wave Systems via Pure Exploration Unimodal Bandits D. Ghosh Haseen Rahman M. Hanawal Nikola Zlatanov 13 1 0 26 Dec 2022
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget Fathima Zarin Faizal Jayakrishnan Nair 16 7 0 27 Nov 2022
Bayesian Fixed-Budget Best-Arm Identification Alexia Atsidakou S. Katariya Sujay Sanghavi B. Kveton 33 11 0 15 Nov 2022
Adaptive Data Depth via Multi-Armed Bandits Tavor Z. Baharav T. Lai 23 1 0 08 Nov 2022
Adaptive Experimental Design and Counterfactual Inference Tanner Fiez Sergio Gamez Arick Chen Houssam Nassif Lalit P. Jain 25 7 0 25 Oct 2022
Anytime-valid off-policy inference for contextual bandits Ian Waudby-Smith Lili Wu Aaditya Ramdas Nikos Karampatziakis Paul Mineiro OffRL 43 25 0 19 Oct 2022
Federated Best Arm Identification with Heterogeneous Clients Zhirui Chen P. Karthik Vincent Y. F. Tan Yeow Meng Chee FedML 39 8 0 14 Oct 2022
Thompson Sampling with Virtual Helping Agents Kartikey Pant Amod Hegde K. V. Srinivas 17 0 0 16 Sep 2022
Best Arm Identification with Contextual Information under a Small Gap Masahiro Kato Masaaki Imaizumi Takuya Ishihara T. Kitagawa 27 2 0 15 Sep 2022
Almost Cost-Free Communication in Federated Best Arm Identification Kota Srinivas Reddy P. Karthik Vincent Y. F. Tan FedML 36 11 0 19 Aug 2022
Best Policy Identification in Linear MDPs Jerome Taupin Yassir Jedra Alexandre Proutière 44 4 0 11 Aug 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference Debangshu Banerjee Avishek Ghosh Sayak Ray Chowdhury Aditya Gopalan 35 9 0 23 Jul 2022
Unsupervised Crowdsourcing with Accuracy and Cost Guarantees Yash Didwania Jayakrishnan Nair N. Hemachandra 21 1 0 05 Jul 2022
Active Learning with Safety Constraints Romain Camilleri Andrew Wagenmaker Jamie Morgenstern Lalit P. Jain Kevin G. Jamieson 28 12 0 22 Jun 2022
On the Finite-Time Performance of the Knowledge Gradient Algorithm Yanwen Li Siyang Gao 32 4 0 14 Jun 2022
Best Arm Identification in Restless Markov Multi-Armed Bandits P. Karthik Kota Srinivas Reddy Vincent Y. F. Tan 30 4 0 29 Mar 2022
Approximate Function Evaluation via Multi-Armed Bandits Tavor Z. Baharav Gary Cheng Mert Pilanci David Tse 19 6 0 18 Mar 2022
Meta-Learning for Simple Regret Minimization Javad Azizi B. Kveton Mohammad Ghavamzadeh S. Katariya 22 10 0 25 Feb 2022
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank Preference Bandits Suprovat Ghoshal Aadirupa Saha 23 11 0 23 Feb 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Chao Qin Daniel Russo 58 6 0 18 Feb 2022
Best Arm Identification with Safety Constraints Zhenlin Wang Andrew Wagenmaker Kevin G. Jamieson 27 21 0 23 Nov 2021
Sequential Community Mode Estimation S. Jain Shreyas Goenka Divyam Bapna Nikhil Karamchandani Jaya Nair 6 2 0 16 Nov 2021