Optimal Best-arm Identification in Linear Bandits

29 June 2020

Papers citing "Optimal Best-arm Identification in Linear Bandits"

48 / 48 papers shown

Title
Pure Exploration with Feedback Graphs Alessio Russo Yichen Song Aldo Pacchiano 48 0 0 10 Mar 2025
Cost-Aware Optimal Pairwise Pure Exploration Di Wu Chengshuai Shi Ruida Zhou Cong Shen 41 0 0 10 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits Elise Crépon Aurélien Garivier Wouter M. Koolen 47 1 0 29 Jan 2025
Online Clustering with Bandit Information G Dhinesh Chandran Srinivas Reddy Kota Srikrishna Bhashyam 71 0 0 20 Jan 2025
Best-Arm Identification in Unimodal Bandits Riccardo Poiani Marc Jourdan E. Kaufmann Rémy Degenne 32 0 0 04 Nov 2024
Near Optimal Pure Exploration in Logistic Bandits Eduardo Ochoa Rivera Ambuj Tewari 30 0 0 28 Oct 2024
Optimal Batched Linear Bandits Xuanfei Ren Tianyuan Jin Pan Xu 40 2 0 06 Jun 2024
Efficient Prompt Optimization Through the Lens of Best Arm Identification Chengshuai Shi Kun Yang Zihan Chen Jundong Li Jing Yang Cong Shen 50 6 0 15 Feb 2024
Optimal Thresholding Linear Bandit Eduardo Ochoa Rivera Ambuj Tewari 20 0 0 11 Feb 2024
Experiment Planning with Function Approximation Aldo Pacchiano Jonathan Lee Emma Brunskill OffRL 37 3 0 10 Jan 2024
Data-driven optimal stopping: A pure exploration analysis Soren Christensen Niklas Dexheimer Claudia Strauch 49 2 0 10 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits Recep Can Yavas Vincent Y. F. Tan 22 2 0 01 Nov 2023
Towards Instance-Optimality in Online PAC Reinforcement Learning Aymen Al Marjani Andrea Tirinzoni Emilie Kaufmann OffRL 18 4 0 31 Oct 2023
Pure Exploration in Asynchronous Federated Bandits Zichen Wang Chuanhao Li Chenyu Song Lianghui Wang Quanquan Gu Huazheng Wang FedML 38 1 0 17 Oct 2023
Optimal Exploration is no harder than Thompson Sampling Zhaoqi Li Kevin Jamieson Lalit P. Jain 27 2 0 09 Oct 2023
Experimental Designs for Heteroskedastic Variance Justin Weltz Tanner Fiez Alex Volfovsky Eric B. Laber Blake Mason Houssam Nassif Lalit P. Jain 32 3 0 06 Oct 2023
Thompson Exploration with Best Challenger Rule in Best Arm Identification Jongyeong Lee Junya Honda Masashi Sugiyama 33 3 0 01 Oct 2023
Price of Safety in Linear Best Arm Identification Xuedong Shang Igor Colin M. Barlier Hamza Cherkaoui LLMSV 27 3 0 15 Sep 2023
Pure Exploration under Mediators' Feedback Riccardo Poiani Alberto Maria Metelli Marcello Restelli 27 1 0 29 Aug 2023
Certified Multi-Fidelity Zeroth-Order Optimization Étienne de Montbrun Sébastien Gerchinovitz 11 1 0 02 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity Zhihan Xiong Romain Camilleri Maryam Fazel Lalit P. Jain Kevin G. Jamieson 15 1 0 27 Jul 2023
Pure Exploration in Bandits with Linear Constraints Emil Carlsson Debabrota Basu Fredrik D. Johansson Devdatt Dubhashi 42 2 0 22 Jun 2023
Cooperative Thresholded Lasso for Sparse Linear Bandit Haniyeh Barghi Xiaotong Cheng S. Maghsudi 34 0 0 30 May 2023
Best Arm Identification in Bandits with Limited Precision Sampling Kota Srinivas Reddy P. Karthik Nikhil Karamchandani Jayakrishnan Nair 32 2 0 10 May 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill 28 0 0 19 Feb 2023
Active learning for data streams: a survey Davide Cacciarelli M. Kulahci 30 40 0 17 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits Yihan Du Longbo Huang Wen Sun 27 4 0 09 Feb 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$ optimality Arpan Mukherjee A. Tajer 33 3 0 10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning Susan Athey Undral Byambadalai Vitor Hadad Sanath Kumar Krishnamurthy Weiwen Leung Joseph Jay Williams 35 13 0 22 Nov 2022
Best Policy Identification in Linear MDPs Jerome Taupin Yassir Jedra Alexandre Proutiere 44 4 0 11 Aug 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits Arpan Mukherjee A. Tajer 16 5 0 22 Jul 2022
$Choosing Answers in $\varepsilon$-Best-Answer Identification for Linear Bandits$ Choosing Answers in $\varepsilon$ -Best-Answer Identification for Linear Bandits Marc Jourdan Rémy Degenne 19 1 0 09 Jun 2022
Information-Directed Selection for Top-Two Algorithms Wei You Chao Qin Zihao Wang Shuoguang Yang 38 13 0 24 May 2022
On Elimination Strategies for Bandit Fixed-Confidence Identification Andrea Tirinzoni Rémy Degenne 47 7 0 22 May 2022
$On the complexity of All $\varepsilon$-Best Arms Identification$ On the complexity of All $\varepsilon$ -Best Arms Identification Aymen Al Marjani Tomás Kocák Aurélien Garivier 16 4 0 13 Feb 2022
Optimal Clustering with Bandit Feedback Junwen Yang Zixin Zhong Vincent Y. F. Tan 19 12 0 09 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach Filippo Vannella Alexandre Proutiere Yassir Jedra Jaeseong Jeong 25 7 0 06 Jan 2022
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification Clémence Réda Andrea Tirinzoni Rémy Degenne 31 9 0 02 Nov 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs Han Zhong Jiayi Huang Lin F. Yang Liwei Wang 24 7 0 26 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits Andrea Zanette Kefan Dong Jonathan Lee Emma Brunskill OffRL 26 17 0 21 Jul 2021
The Role of Contextual Information in Best Arm Identification Masahiro Kato Kaito Ariu 43 18 0 26 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits Javad Azizi B. Kveton Mohammad Ghavamzadeh 15 23 0 09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits Junwen Yang Vincent Y. F. Tan 20 24 0 27 May 2021
Pure Exploration with Structured Preference Feedback Shubham Gupta Aadirupa Saha S. Katariya 35 0 0 12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP Zihan Zhang Jiaqi Yang Xiangyang Ji S. Du 71 38 0 29 Jan 2021
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation Yuko Kuroki Junya Honda Masashi Sugiyama OffRL 24 1 0 31 Dec 2020
Best Arm Identification in Graphical Bilinear Bandits Geovani Rizk Albert Thomas Igor Colin R. Laraki Y. Chevaleyre 13 6 0 14 Dec 2020
Thresholded Lasso Bandit Kaito Ariu Kenshi Abe Alexandre Proutiere 27 17 0 22 Oct 2020