Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1407.4443
Cited By
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
16 July 2014
E. Kaufmann
Olivier Cappé
Aurélien Garivier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"
50 / 147 papers shown
Title
Near Optimal Best Arm Identification for Clustered Bandits
Yash
Nikhil Karamchandani
Avishek Ghosh
23
0
0
15 May 2025
Sample Complexity of Identifying the Nonredundancy of Nontransitive Games in Dueling Bandits
Shang Lu
Shuji Kijima
40
0
0
08 May 2025
On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds
Shubhada Agrawal
Aaditya Ramdas
29
0
0
28 Apr 2025
On the Problem of Best Arm Retention
Houshuang Chen
Yuchen He
Chihao Zhang
39
0
0
16 Apr 2025
Cost-Aware Optimal Pairwise Pure Exploration
Di Wu
Chengshuai Shi
Ruida Zhou
Cong Shen
41
0
0
10 Mar 2025
Online Clustering with Bandit Information
G Dhinesh Chandran
Srinivas Reddy Kota
Srikrishna Bhashyam
69
0
0
20 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Shen Li
Yuyang Zhang
Zhaolin Ren
Claire Liang
Na Li
J. Shah
42
0
0
03 Jan 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
42
0
0
08 Aug 2024
On Speeding Up Language Model Evaluation
Jin Peng Zhou
Christian K. Belardi
Ruihan Wu
Travis Zhang
Carla P. Gomes
Wen Sun
Kilian Q. Weinberger
58
1
0
08 Jul 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang
Honghao Wei
Lei Ying
OffRL
67
1
0
11 Jun 2024
Adaptive Online Experimental Design for Causal Discovery
Muhammad Qasim Elahi
Lai Wei
Murat Kocaoglu
Mahsa Ghasemi
CML
41
1
0
19 May 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
42
2
0
08 Feb 2024
HoloBeam: Learning Optimal Beamforming in Far-Field Holographic Metasurface Transceivers
D. Ghosh
M. Hanawal
Nikola Zlatanov
15
0
0
30 Dec 2023
Best Arm Identification in Batched Multi-armed Bandit Problems
Sheng Cao
Simai He
Ruoqing Jiang
Jin Xu
Hongsong Yuan
15
1
0
21 Dec 2023
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances
Masahiro Kato
38
3
0
20 Dec 2023
Bandit Pareto Set Identification: the Fixed Budget Setting
Cyrille Kone
Emilie Kaufmann
Laura Richert
43
3
0
07 Nov 2023
An Anytime Algorithm for Good Arm Identification
Marc Jourdan
Clémence Réda
30
2
0
16 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
30
2
0
28 Sep 2023
On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence
Achraf Azize
Marc Jourdan
Aymen Al Marjani
D. Basu
44
3
0
05 Sep 2023
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Shintaro Nakamura
Masashi Sugiyama
16
4
0
20 Aug 2023
More PAC-Bayes bounds: From bounded losses, to losses with general tail behaviors, to anytime validity
Borja Rodríguez Gálvez
Ragnar Thobaben
Mikael Skoglund
30
9
0
21 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
35
2
0
17 May 2023
Bayesian Synthetic Likelihood
David T. Frazier
Christopher C. Drovandi
David J. Nott
39
217
0
09 May 2023
Best Arm Identification with Fairness Constraints on Subpopulations
Yuhang Wu
Zeyu Zheng
Tingyu Zhu
19
8
0
08 Apr 2023
Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games
Arnab Maiti
Kevin G. Jamieson
Lillian J. Ratliff
36
6
0
19 Mar 2023
Open Problem: Optimal Best Arm Identification with Fixed Budget
Chao Qin
27
18
0
02 Mar 2023
Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation
D. Preil
M. Krapp
AI4CE
14
1
0
15 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian
m
m
m
-top exploration
Alexandra Cimpean
T. Verstraeten
L. Willem
N. Hens
Ann Nowé
Pieter J. K. Libin
21
2
0
30 Jan 2023
UB3: Best Beam Identification in Millimeter Wave Systems via Pure Exploration Unimodal Bandits
D. Ghosh
Haseen Rahman
M. Hanawal
Nikola Zlatanov
13
1
0
26 Dec 2022
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Fathima Zarin Faizal
Jayakrishnan Nair
16
7
0
27 Nov 2022
Bayesian Fixed-Budget Best-Arm Identification
Alexia Atsidakou
S. Katariya
Sujay Sanghavi
B. Kveton
33
11
0
15 Nov 2022
Adaptive Data Depth via Multi-Armed Bandits
Tavor Z. Baharav
T. Lai
23
1
0
08 Nov 2022
Adaptive Experimental Design and Counterfactual Inference
Tanner Fiez
Sergio Gamez
Arick Chen
Houssam Nassif
Lalit P. Jain
25
7
0
25 Oct 2022
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Federated Best Arm Identification with Heterogeneous Clients
Zhirui Chen
P. Karthik
Vincent Y. F. Tan
Yeow Meng Chee
FedML
39
8
0
14 Oct 2022
Thompson Sampling with Virtual Helping Agents
Kartikey Pant
Amod Hegde
K. V. Srinivas
17
0
0
16 Sep 2022
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
27
2
0
15 Sep 2022
Almost Cost-Free Communication in Federated Best Arm Identification
Kota Srinivas Reddy
P. Karthik
Vincent Y. F. Tan
FedML
36
11
0
19 Aug 2022
Best Policy Identification in Linear MDPs
Jerome Taupin
Yassir Jedra
Alexandre Proutière
44
4
0
11 Aug 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
35
9
0
23 Jul 2022
Unsupervised Crowdsourcing with Accuracy and Cost Guarantees
Yash Didwania
Jayakrishnan Nair
N. Hemachandra
21
1
0
05 Jul 2022
Active Learning with Safety Constraints
Romain Camilleri
Andrew Wagenmaker
Jamie Morgenstern
Lalit P. Jain
Kevin G. Jamieson
28
12
0
22 Jun 2022
On the Finite-Time Performance of the Knowledge Gradient Algorithm
Yanwen Li
Siyang Gao
32
4
0
14 Jun 2022
Best Arm Identification in Restless Markov Multi-Armed Bandits
P. Karthik
Kota Srinivas Reddy
Vincent Y. F. Tan
30
4
0
29 Mar 2022
Approximate Function Evaluation via Multi-Armed Bandits
Tavor Z. Baharav
Gary Cheng
Mert Pilanci
David Tse
19
6
0
18 Mar 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank Preference Bandits
Suprovat Ghoshal
Aadirupa Saha
23
11
0
23 Feb 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation
Chao Qin
Daniel Russo
58
6
0
18 Feb 2022
Best Arm Identification with Safety Constraints
Zhenlin Wang
Andrew Wagenmaker
Kevin G. Jamieson
27
21
0
23 Nov 2021
Sequential Community Mode Estimation
S. Jain
Shreyas Goenka
Divyam Bapna
Nikhil Karamchandani
Jaya Nair
6
2
0
16 Nov 2021
1
2
3
Next