ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1407.4443
  4. Cited By
On the Complexity of Best Arm Identification in Multi-Armed Bandit
  Models

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014
E. Kaufmann
Olivier Cappé
Aurélien Garivier
ArXivPDFHTML

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

50 / 147 papers shown
Title
Near Optimal Best Arm Identification for Clustered Bandits
Near Optimal Best Arm Identification for Clustered Bandits
Yash
Nikhil Karamchandani
Avishek Ghosh
23
0
0
15 May 2025
Sample Complexity of Identifying the Nonredundancy of Nontransitive Games in Dueling Bandits
Sample Complexity of Identifying the Nonredundancy of Nontransitive Games in Dueling Bandits
Shang Lu
Shuji Kijima
40
0
0
08 May 2025
On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds
On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds
Shubhada Agrawal
Aaditya Ramdas
29
0
0
28 Apr 2025
On the Problem of Best Arm Retention
On the Problem of Best Arm Retention
Houshuang Chen
Yuchen He
Chihao Zhang
39
0
0
16 Apr 2025
Cost-Aware Optimal Pairwise Pure Exploration
Di Wu
Chengshuai Shi
Ruida Zhou
Cong Shen
41
0
0
10 Mar 2025
Online Clustering with Bandit Information
Online Clustering with Bandit Information
G Dhinesh Chandran
Srinivas Reddy Kota
Srikrishna Bhashyam
69
0
0
20 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Enhancing Preference-based Linear Bandits via Human Response Time
Shen Li
Yuyang Zhang
Zhaolin Ren
Claire Liang
Na Li
J. Shah
42
0
0
03 Jan 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
42
0
0
08 Aug 2024
On Speeding Up Language Model Evaluation
On Speeding Up Language Model Evaluation
Jin Peng Zhou
Christian K. Belardi
Ruihan Wu
Travis Zhang
Carla P. Gomes
Wen Sun
Kilian Q. Weinberger
58
1
0
08 Jul 2024
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang
Honghao Wei
Lei Ying
OffRL
67
1
0
11 Jun 2024
Adaptive Online Experimental Design for Causal Discovery
Adaptive Online Experimental Design for Causal Discovery
Muhammad Qasim Elahi
Lai Wei
Murat Kocaoglu
Mahsa Ghasemi
CML
41
1
0
19 May 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
42
2
0
08 Feb 2024
HoloBeam: Learning Optimal Beamforming in Far-Field Holographic
  Metasurface Transceivers
HoloBeam: Learning Optimal Beamforming in Far-Field Holographic Metasurface Transceivers
D. Ghosh
M. Hanawal
Nikola Zlatanov
15
0
0
30 Dec 2023
Best Arm Identification in Batched Multi-armed Bandit Problems
Best Arm Identification in Batched Multi-armed Bandit Problems
Sheng Cao
Simai He
Ruoqing Jiang
Jin Xu
Hongsong Yuan
15
1
0
21 Dec 2023
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed
  Gaussian Bandits with Unknown Variances
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances
Masahiro Kato
38
3
0
20 Dec 2023
Bandit Pareto Set Identification: the Fixed Budget Setting
Bandit Pareto Set Identification: the Fixed Budget Setting
Cyrille Kone
Emilie Kaufmann
Laura Richert
43
3
0
07 Nov 2023
An Anytime Algorithm for Good Arm Identification
An Anytime Algorithm for Good Arm Identification
Marc Jourdan
Clémence Réda
30
2
0
16 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded
  Stochastic Corruption
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
30
2
0
28 Sep 2023
On the Complexity of Differentially Private Best-Arm Identification with
  Fixed Confidence
On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence
Achraf Azize
Marc Jourdan
Aymen Al Marjani
D. Basu
44
3
0
05 Sep 2023
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of
  Multi-Armed Bandit
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Shintaro Nakamura
Masashi Sugiyama
16
4
0
20 Aug 2023
More PAC-Bayes bounds: From bounded losses, to losses with general tail
  behaviors, to anytime validity
More PAC-Bayes bounds: From bounded losses, to losses with general tail behaviors, to anytime validity
Borja Rodríguez Gálvez
Ragnar Thobaben
Mikael Skoglund
30
9
0
21 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer
  Interface
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
35
2
0
17 May 2023
Bayesian Synthetic Likelihood
Bayesian Synthetic Likelihood
David T. Frazier
Christopher C. Drovandi
David J. Nott
39
217
0
09 May 2023
Best Arm Identification with Fairness Constraints on Subpopulations
Best Arm Identification with Fairness Constraints on Subpopulations
Yuhang Wu
Zeyu Zheng
Tingyu Zhu
19
8
0
08 Apr 2023
Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games
Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games
Arnab Maiti
Kevin G. Jamieson
Lillian J. Ratliff
36
6
0
19 Mar 2023
Open Problem: Optimal Best Arm Identification with Fixed Budget
Open Problem: Optimal Best Arm Identification with Fixed Budget
Chao Qin
27
18
0
02 Mar 2023
Genetic multi-armed bandits: a reinforcement learning approach for
  discrete optimization via simulation
Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation
D. Preil
M. Krapp
AI4CE
14
1
0
15 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top exploration
Evaluating COVID-19 vaccine allocation policies using Bayesian mmm-top exploration
Alexandra Cimpean
T. Verstraeten
L. Willem
N. Hens
Ann Nowé
Pieter J. K. Libin
21
2
0
30 Jan 2023
UB3: Best Beam Identification in Millimeter Wave Systems via Pure
  Exploration Unimodal Bandits
UB3: Best Beam Identification in Millimeter Wave Systems via Pure Exploration Unimodal Bandits
D. Ghosh
Haseen Rahman
M. Hanawal
Nikola Zlatanov
13
1
0
26 Dec 2022
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Fathima Zarin Faizal
Jayakrishnan Nair
16
7
0
27 Nov 2022
Bayesian Fixed-Budget Best-Arm Identification
Bayesian Fixed-Budget Best-Arm Identification
Alexia Atsidakou
S. Katariya
Sujay Sanghavi
B. Kveton
33
11
0
15 Nov 2022
Adaptive Data Depth via Multi-Armed Bandits
Adaptive Data Depth via Multi-Armed Bandits
Tavor Z. Baharav
T. Lai
23
1
0
08 Nov 2022
Adaptive Experimental Design and Counterfactual Inference
Adaptive Experimental Design and Counterfactual Inference
Tanner Fiez
Sergio Gamez
Arick Chen
Houssam Nassif
Lalit P. Jain
25
7
0
25 Oct 2022
Anytime-valid off-policy inference for contextual bandits
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Federated Best Arm Identification with Heterogeneous Clients
Federated Best Arm Identification with Heterogeneous Clients
Zhirui Chen
P. Karthik
Vincent Y. F. Tan
Yeow Meng Chee
FedML
39
8
0
14 Oct 2022
Thompson Sampling with Virtual Helping Agents
Thompson Sampling with Virtual Helping Agents
Kartikey Pant
Amod Hegde
K. V. Srinivas
17
0
0
16 Sep 2022
Best Arm Identification with Contextual Information under a Small Gap
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
27
2
0
15 Sep 2022
Almost Cost-Free Communication in Federated Best Arm Identification
Almost Cost-Free Communication in Federated Best Arm Identification
Kota Srinivas Reddy
P. Karthik
Vincent Y. F. Tan
FedML
36
11
0
19 Aug 2022
Best Policy Identification in Linear MDPs
Best Policy Identification in Linear MDPs
Jerome Taupin
Yassir Jedra
Alexandre Proutière
44
4
0
11 Aug 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications
  for Inference
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
35
9
0
23 Jul 2022
Unsupervised Crowdsourcing with Accuracy and Cost Guarantees
Unsupervised Crowdsourcing with Accuracy and Cost Guarantees
Yash Didwania
Jayakrishnan Nair
N. Hemachandra
21
1
0
05 Jul 2022
Active Learning with Safety Constraints
Active Learning with Safety Constraints
Romain Camilleri
Andrew Wagenmaker
Jamie Morgenstern
Lalit P. Jain
Kevin G. Jamieson
28
12
0
22 Jun 2022
On the Finite-Time Performance of the Knowledge Gradient Algorithm
On the Finite-Time Performance of the Knowledge Gradient Algorithm
Yanwen Li
Siyang Gao
32
4
0
14 Jun 2022
Best Arm Identification in Restless Markov Multi-Armed Bandits
Best Arm Identification in Restless Markov Multi-Armed Bandits
P. Karthik
Kota Srinivas Reddy
Vincent Y. F. Tan
30
4
0
29 Mar 2022
Approximate Function Evaluation via Multi-Armed Bandits
Approximate Function Evaluation via Multi-Armed Bandits
Tavor Z. Baharav
Gary Cheng
Mert Pilanci
David Tse
19
6
0
18 Mar 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank
  Preference Bandits
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank Preference Bandits
Suprovat Ghoshal
Aadirupa Saha
23
11
0
23 Feb 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary
  Variation
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation
Chao Qin
Daniel Russo
58
6
0
18 Feb 2022
Best Arm Identification with Safety Constraints
Best Arm Identification with Safety Constraints
Zhenlin Wang
Andrew Wagenmaker
Kevin G. Jamieson
27
21
0
23 Nov 2021
Sequential Community Mode Estimation
Sequential Community Mode Estimation
S. Jain
Shreyas Goenka
Divyam Bapna
Nikhil Karamchandani
Jaya Nair
6
2
0
16 Nov 2021
123
Next