Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.16073
Cited By
Optimal Best-arm Identification in Linear Bandits
29 June 2020
Yassir Jedra
Alexandre Proutiere
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal Best-arm Identification in Linear Bandits"
48 / 48 papers shown
Title
Pure Exploration with Feedback Graphs
Alessio Russo
Yichen Song
Aldo Pacchiano
48
0
0
10 Mar 2025
Cost-Aware Optimal Pairwise Pure Exploration
Di Wu
Chengshuai Shi
Ruida Zhou
Cong Shen
41
0
0
10 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits
Elise Crépon
Aurélien Garivier
Wouter M. Koolen
47
1
0
29 Jan 2025
Online Clustering with Bandit Information
G Dhinesh Chandran
Srinivas Reddy Kota
Srikrishna Bhashyam
71
0
0
20 Jan 2025
Best-Arm Identification in Unimodal Bandits
Riccardo Poiani
Marc Jourdan
E. Kaufmann
Rémy Degenne
32
0
0
04 Nov 2024
Near Optimal Pure Exploration in Logistic Bandits
Eduardo Ochoa Rivera
Ambuj Tewari
30
0
0
28 Oct 2024
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
40
2
0
06 Jun 2024
Efficient Prompt Optimization Through the Lens of Best Arm Identification
Chengshuai Shi
Kun Yang
Zihan Chen
Jundong Li
Jing Yang
Cong Shen
50
6
0
15 Feb 2024
Optimal Thresholding Linear Bandit
Eduardo Ochoa Rivera
Ambuj Tewari
20
0
0
11 Feb 2024
Experiment Planning with Function Approximation
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
37
3
0
10 Jan 2024
Data-driven optimal stopping: A pure exploration analysis
Soren Christensen
Niklas Dexheimer
Claudia Strauch
49
2
0
10 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits
Recep Can Yavas
Vincent Y. F. Tan
22
2
0
01 Nov 2023
Towards Instance-Optimality in Online PAC Reinforcement Learning
Aymen Al Marjani
Andrea Tirinzoni
Emilie Kaufmann
OffRL
18
4
0
31 Oct 2023
Pure Exploration in Asynchronous Federated Bandits
Zichen Wang
Chuanhao Li
Chenyu Song
Lianghui Wang
Quanquan Gu
Huazheng Wang
FedML
38
1
0
17 Oct 2023
Optimal Exploration is no harder than Thompson Sampling
Zhaoqi Li
Kevin Jamieson
Lalit P. Jain
27
2
0
09 Oct 2023
Experimental Designs for Heteroskedastic Variance
Justin Weltz
Tanner Fiez
Alex Volfovsky
Eric B. Laber
Blake Mason
Houssam Nassif
Lalit P. Jain
32
3
0
06 Oct 2023
Thompson Exploration with Best Challenger Rule in Best Arm Identification
Jongyeong Lee
Junya Honda
Masashi Sugiyama
33
3
0
01 Oct 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
27
3
0
15 Sep 2023
Pure Exploration under Mediators' Feedback
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
27
1
0
29 Aug 2023
Certified Multi-Fidelity Zeroth-Order Optimization
Étienne de Montbrun
Sébastien Gerchinovitz
11
1
0
02 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Zhihan Xiong
Romain Camilleri
Maryam Fazel
Lalit P. Jain
Kevin G. Jamieson
15
1
0
27 Jul 2023
Pure Exploration in Bandits with Linear Constraints
Emil Carlsson
Debabrota Basu
Fredrik D. Johansson
Devdatt Dubhashi
42
2
0
22 Jun 2023
Cooperative Thresholded Lasso for Sparse Linear Bandit
Haniyeh Barghi
Xiaotong Cheng
S. Maghsudi
34
0
0
30 May 2023
Best Arm Identification in Bandits with Limited Precision Sampling
Kota Srinivas Reddy
P. Karthik
Nikhil Karamchandani
Jayakrishnan Nair
32
2
0
10 May 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
28
0
0
19 Feb 2023
Active learning for data streams: a survey
Davide Cacciarelli
M. Kulahci
30
40
0
17 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits
Yihan Du
Longbo Huang
Wen Sun
27
4
0
09 Feb 2023
Best Arm Identification in Stochastic Bandits: Beyond
β
−
β-
β
−
optimality
Arpan Mukherjee
A. Tajer
33
3
0
10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning
Susan Athey
Undral Byambadalai
Vitor Hadad
Sanath Kumar Krishnamurthy
Weiwen Leung
Joseph Jay Williams
35
13
0
22 Nov 2022
Best Policy Identification in Linear MDPs
Jerome Taupin
Yassir Jedra
Alexandre Proutiere
44
4
0
11 Aug 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
Arpan Mukherjee
A. Tajer
16
5
0
22 Jul 2022
Choosing Answers in
ε
\varepsilon
ε
-Best-Answer Identification for Linear Bandits
Marc Jourdan
Rémy Degenne
19
1
0
09 Jun 2022
Information-Directed Selection for Top-Two Algorithms
Wei You
Chao Qin
Zihao Wang
Shuoguang Yang
38
13
0
24 May 2022
On Elimination Strategies for Bandit Fixed-Confidence Identification
Andrea Tirinzoni
Rémy Degenne
47
7
0
22 May 2022
On the complexity of All
ε
\varepsilon
ε
-Best Arms Identification
Aymen Al Marjani
Tomás Kocák
Aurélien Garivier
16
4
0
13 Feb 2022
Optimal Clustering with Bandit Feedback
Junwen Yang
Zixin Zhong
Vincent Y. F. Tan
19
12
0
09 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach
Filippo Vannella
Alexandre Proutiere
Yassir Jedra
Jaeseong Jeong
25
7
0
06 Jan 2022
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
31
9
0
02 Nov 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Han Zhong
Jiayi Huang
Lin F. Yang
Liwei Wang
24
7
0
26 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
26
17
0
21 Jul 2021
The Role of Contextual Information in Best Arm Identification
Masahiro Kato
Kaito Ariu
43
18
0
26 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
15
23
0
09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits
Junwen Yang
Vincent Y. F. Tan
20
24
0
27 May 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
35
0
0
12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Zihan Zhang
Jiaqi Yang
Xiangyang Ji
S. Du
71
38
0
29 Jan 2021
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Yuko Kuroki
Junya Honda
Masashi Sugiyama
OffRL
24
1
0
31 Dec 2020
Best Arm Identification in Graphical Bilinear Bandits
Geovani Rizk
Albert Thomas
Igor Colin
R. Laraki
Y. Chevaleyre
13
6
0
14 Dec 2020
Thresholded Lasso Bandit
Kaito Ariu
Kenshi Abe
Alexandre Proutiere
27
17
0
22 Oct 2020
1