ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.08448
  4. Cited By
Simple Bayesian Algorithms for Best Arm Identification

Simple Bayesian Algorithms for Best Arm Identification

26 February 2016
Daniel Russo
ArXivPDFHTML

Papers citing "Simple Bayesian Algorithms for Best Arm Identification"

43 / 43 papers shown
Title
On the Problem of Best Arm Retention
On the Problem of Best Arm Retention
Houshuang Chen
Yuchen He
Chihao Zhang
39
0
0
16 Apr 2025
Identifying the Best Transition Law
Identifying the Best Transition Law
Mehrasa Ahmadipour
Elise Crépon
Aurélien Garivier
80
0
0
17 Feb 2025
Stochastically Constrained Best Arm Identification with Thompson Sampling
Stochastically Constrained Best Arm Identification with Thompson Sampling
Le Yang
Siyang Gao
Cheng Li
Yi Wang
30
0
0
08 Jan 2025
AExGym: Benchmarks and Environments for Adaptive Experimentation
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
47
0
0
08 Aug 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF
Online Bandit Learning with Offline Preference Data for Improved RLHF
Akhil Agnihotri
Rahul Jain
Deepak Ramachandran
Zheng Wen
OffRL
42
2
0
13 Jun 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
42
2
0
08 Feb 2024
Best Arm Identification in Batched Multi-armed Bandit Problems
Best Arm Identification in Batched Multi-armed Bandit Problems
Sheng Cao
Simai He
Ruoqing Jiang
Jin Xu
Hongsong Yuan
15
1
0
21 Dec 2023
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed
  Gaussian Bandits with Unknown Variances
Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances
Masahiro Kato
38
3
0
20 Dec 2023
Adaptive maximization of social welfare
Adaptive maximization of social welfare
Nicolò Cesa-Bianchi
Roberto Colomboni
Maximilian Kasy
30
0
0
14 Oct 2023
On the Complexity of Differentially Private Best-Arm Identification with
  Fixed Confidence
On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence
Achraf Azize
Marc Jourdan
Aymen Al Marjani
D. Basu
44
3
0
05 Sep 2023
Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic
  Experimentation
Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation
Zezhong Zhang
T. Yuan
15
0
0
24 May 2023
Sequential Best-Arm Identification with Application to Brain-Computer
  Interface
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
35
2
0
17 May 2023
Scale-Adaptive Balancing of Exploration and Exploitation in Classical
  Planning
Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning
Stephen Wissow
Masataro Asai
32
2
0
16 May 2023
Getting to "rate-optimal'' in ranking & selection
Getting to "rate-optimal'' in ranking & selection
Harun Avci
B. Nelson
A. Wachter
22
0
0
04 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top exploration
Evaluating COVID-19 vaccine allocation policies using Bayesian mmm-top exploration
Alexandra Cimpean
T. Verstraeten
L. Willem
N. Hens
Ann Nowé
Pieter J. K. Libin
21
2
0
30 Jan 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$optimality
Best Arm Identification in Stochastic Bandits: Beyond β−β-β−optimality
Arpan Mukherjee
A. Tajer
33
3
0
10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving:
  Within-Experiment Outcomes versus Policy Learning
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning
Susan Athey
Undral Byambadalai
Vitor Hadad
Sanath Kumar Krishnamurthy
Weiwen Leung
Joseph Jay Williams
35
13
0
22 Nov 2022
Bayesian Fixed-Budget Best-Arm Identification
Bayesian Fixed-Budget Best-Arm Identification
Alexia Atsidakou
S. Katariya
Sujay Sanghavi
B. Kveton
33
11
0
15 Nov 2022
Thompson Sampling with Virtual Helping Agents
Thompson Sampling with Virtual Helping Agents
Kartikey Pant
Amod Hegde
K. V. Srinivas
19
0
0
16 Sep 2022
Best Arm Identification with Contextual Information under a Small Gap
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
27
2
0
15 Sep 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
Arpan Mukherjee
A. Tajer
16
5
0
22 Jul 2022
On the Finite-Time Performance of the Knowledge Gradient Algorithm
On the Finite-Time Performance of the Knowledge Gradient Algorithm
Yanwen Li
Siyang Gao
38
4
0
14 Jun 2022
Top Two Algorithms Revisited
Top Two Algorithms Revisited
Marc Jourdan
Rémy Degenne
Dorian Baudry
R. D. Heide
E. Kaufmann
26
38
0
13 Jun 2022
Rate-Constrained Remote Contextual Bandits
Rate-Constrained Remote Contextual Bandits
Francesco Pase
Deniz Gündüz
M. Zorzi
34
8
0
26 Apr 2022
An Analysis of Ensemble Sampling
An Analysis of Ensemble Sampling
Chao Qin
Zheng Wen
Xiuyuan Lu
Benjamin Van Roy
32
21
0
02 Mar 2022
Partial Likelihood Thompson Sampling
Partial Likelihood Thompson Sampling
Han Wu
Stefan Wager
LM&MA
30
1
0
02 Mar 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary
  Variation
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation
Chao Qin
Daniel Russo
58
6
0
18 Feb 2022
Selecting the Best Optimizing System
Selecting the Best Optimizing System
Nian Si
Zeyu Zheng
23
1
0
09 Jan 2022
Algorithms for Adaptive Experiments that Trade-off Statistical Analysis
  with Reward: Combining Uniform Random Assignment and Reward Maximization
Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization
Tong Li
Jacob Nogas
Haochen Song
Harsh Kumar
A. Durand
Anna N. Rafferty
Nina Deliu
S. Villar
Joseph Jay Williams
29
5
0
15 Dec 2021
Vector Optimization with Stochastic Bandit Feedback
Vector Optimization with Stochastic Bandit Feedback
Shiao Liu
Jian Huang
33
8
0
23 Oct 2021
Policy Choice and Best Arm Identification: Asymptotic Analysis of
  Exploration Sampling
Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling
Kaito Ariu
Masahiro Kato
Junpei Komiyama
K. McAlinn
Chao Qin
65
24
0
16 Sep 2021
Bayesian decision-making under misspecified priors with applications to
  meta-learning
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
40
49
0
03 Jul 2021
Navigating to the Best Policy in Markov Decision Processes
Navigating to the Best Policy in Markov Decision Processes
Aymen Al Marjani
Aurélien Garivier
Alexandre Proutiere
35
21
0
05 Jun 2021
Optimal Algorithms for Range Searching over Multi-Armed Bandits
Optimal Algorithms for Range Searching over Multi-Armed Bandits
Siddharth Barman
Ramakrishnan Krishnamurthy
S. Rahul
20
0
0
04 May 2021
Challenges in Statistical Analysis of Data Collected by a Bandit
  Algorithm: An Empirical Exploration in Applications to Adaptively Randomized
  Experiments
Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments
Joseph Jay Williams
Jacob Nogas
Nina Deliu
Hammad Shaikh
S. Villar
A. Durand
Anna N. Rafferty
AAML
22
10
0
22 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference
Online Multi-Armed Bandits with Adaptive Inference
Maria Dimakopoulou
Zhimei Ren
Zhengyuan Zhou
29
34
0
25 Feb 2021
Effects of Model Misspecification on Bayesian Bandits: Case Studies in
  UX Optimization
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
21
1
0
07 Oct 2020
Are sample means in multi-armed bandits positively or negatively biased?
Are sample means in multi-armed bandits positively or negatively biased?
Jaehyeok Shin
Aaditya Ramdas
Alessandro Rinaldo
11
35
0
27 May 2019
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
16
25
0
04 Jun 2018
Improving the Expected Improvement Algorithm
Improving the Expected Improvement Algorithm
Chao Qin
Diego Klabjan
Daniel Russo
27
135
0
29 May 2017
Learning the distribution with largest mean: two bandit frameworks
Learning the distribution with largest mean: two bandit frameworks
E. Kaufmann
Aurélien Garivier
24
19
0
31 Jan 2017
On Bayesian index policies for sequential resource allocation
On Bayesian index policies for sequential resource allocation
E. Kaufmann
41
84
0
06 Jan 2016
1