Simple Bayesian Algorithms for Best Arm Identification

26 February 2016

Papers citing "Simple Bayesian Algorithms for Best Arm Identification"

43 / 43 papers shown

Title
On the Problem of Best Arm Retention Houshuang Chen Yuchen He Chihao Zhang 39 0 0 16 Apr 2025
Identifying the Best Transition Law Mehrasa Ahmadipour Elise Crépon Aurélien Garivier 80 0 0 17 Feb 2025
Stochastically Constrained Best Arm Identification with Thompson Sampling Le Yang Siyang Gao Cheng Li Yi Wang 30 0 0 08 Jan 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation Jimmy Wang Ethan Che Daniel R. Jiang Hongseok Namkoong 47 0 0 08 Aug 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF Akhil Agnihotri Rahul Jain Deepak Ramachandran Zheng Wen OffRL 42 2 0 13 Jun 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 42 2 0 08 Feb 2024
Best Arm Identification in Batched Multi-armed Bandit Problems Sheng Cao Simai He Ruoqing Jiang Jin Xu Hongsong Yuan 15 1 0 21 Dec 2023
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances Masahiro Kato 38 3 0 20 Dec 2023
Adaptive maximization of social welfare Nicolò Cesa-Bianchi Roberto Colomboni Maximilian Kasy 30 0 0 14 Oct 2023
On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence Achraf Azize Marc Jourdan Aymen Al Marjani D. Basu 44 3 0 05 Sep 2023
Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation Zezhong Zhang T. Yuan 15 0 0 24 May 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface Xiaoping Zhou Botao Hao Jian Kang Tor Lattimore Lexin Li 35 2 0 17 May 2023
Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning Stephen Wissow Masataro Asai 32 2 0 16 May 2023
Getting to "rate-optimal'' in ranking & selection Harun Avci B. Nelson A. Wachter 22 0 0 04 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$ -top exploration Alexandra Cimpean T. Verstraeten L. Willem N. Hens Ann Nowé Pieter J. K. Libin 21 2 0 30 Jan 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$ optimality Arpan Mukherjee A. Tajer 33 3 0 10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning Susan Athey Undral Byambadalai Vitor Hadad Sanath Kumar Krishnamurthy Weiwen Leung Joseph Jay Williams 35 13 0 22 Nov 2022
Bayesian Fixed-Budget Best-Arm Identification Alexia Atsidakou S. Katariya Sujay Sanghavi B. Kveton 33 11 0 15 Nov 2022
Thompson Sampling with Virtual Helping Agents Kartikey Pant Amod Hegde K. V. Srinivas 19 0 0 16 Sep 2022
Best Arm Identification with Contextual Information under a Small Gap Masahiro Kato Masaaki Imaizumi Takuya Ishihara T. Kitagawa 27 2 0 15 Sep 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits Arpan Mukherjee A. Tajer 16 5 0 22 Jul 2022
On the Finite-Time Performance of the Knowledge Gradient Algorithm Yanwen Li Siyang Gao 38 4 0 14 Jun 2022
Top Two Algorithms Revisited Marc Jourdan Rémy Degenne Dorian Baudry R. D. Heide E. Kaufmann 26 38 0 13 Jun 2022
Rate-Constrained Remote Contextual Bandits Francesco Pase Deniz Gündüz M. Zorzi 34 8 0 26 Apr 2022
An Analysis of Ensemble Sampling Chao Qin Zheng Wen Xiuyuan Lu Benjamin Van Roy 32 21 0 02 Mar 2022
Partial Likelihood Thompson Sampling Han Wu Stefan Wager LM&MA 30 1 0 02 Mar 2022
Meta-Learning for Simple Regret Minimization Javad Azizi B. Kveton Mohammad Ghavamzadeh S. Katariya 22 10 0 25 Feb 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Chao Qin Daniel Russo 58 6 0 18 Feb 2022
Selecting the Best Optimizing System Nian Si Zeyu Zheng 23 1 0 09 Jan 2022
Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization Tong Li Jacob Nogas Haochen Song Harsh Kumar A. Durand Anna N. Rafferty Nina Deliu S. Villar Joseph Jay Williams 29 5 0 15 Dec 2021
Vector Optimization with Stochastic Bandit Feedback Shiao Liu Jian Huang 33 8 0 23 Oct 2021
Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling Kaito Ariu Masahiro Kato Junpei Komiyama K. McAlinn Chao Qin 65 24 0 16 Sep 2021
Bayesian decision-making under misspecified priors with applications to meta-learning Max Simchowitz Christopher Tosh A. Krishnamurthy Daniel J. Hsu Thodoris Lykouris Miroslav Dudík Robert Schapire 40 49 0 03 Jul 2021
Navigating to the Best Policy in Markov Decision Processes Aymen Al Marjani Aurélien Garivier Alexandre Proutiere 35 21 0 05 Jun 2021
Optimal Algorithms for Range Searching over Multi-Armed Bandits Siddharth Barman Ramakrishnan Krishnamurthy S. Rahul 20 0 0 04 May 2021
Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments Joseph Jay Williams Jacob Nogas Nina Deliu Hammad Shaikh S. Villar A. Durand Anna N. Rafferty AAML 22 10 0 22 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference Maria Dimakopoulou Zhimei Ren Zhengyuan Zhou 29 34 0 25 Feb 2021
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Mack Sweeney M. Adelsberg Kathryn B. Laskey C. Domeniconi 21 1 0 07 Oct 2020
Are sample means in multi-armed bandits positively or negatively biased? Jaehyeok Shin Aaditya Ramdas Alessandro Rinaldo 11 35 0 27 May 2019
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling E. Kaufmann Wouter M. Koolen Aurélien Garivier 16 25 0 04 Jun 2018
Improving the Expected Improvement Algorithm Chao Qin Diego Klabjan Daniel Russo 27 135 0 29 May 2017
Learning the distribution with largest mean: two bandit frameworks E. Kaufmann Aurélien Garivier 24 19 0 31 Jan 2017
On Bayesian index policies for sequential resource allocation E. Kaufmann 41 84 0 06 Jan 2016