Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1311.1894
Cited By
Optimality of Thompson Sampling for Gaussian Bandits Depends on Priors
8 November 2013
Junya Honda
Akimichi Takemura
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimality of Thompson Sampling for Gaussian Bandits Depends on Priors"
14 / 14 papers shown
Title
A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms
Dorian Baudry
Kazuya Suzuki
Junya Honda
37
5
0
10 Mar 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian
m
m
m
-top exploration
Alexandra Cimpean
T. Verstraeten
L. Willem
N. Hens
Ann Nowé
Pieter J. K. Libin
26
2
0
30 Jan 2023
Gaussian Imagination in Bandit Learning
Yueyang Liu
Adithya M. Devraj
Benjamin Van Roy
Kuang Xu
45
7
0
06 Jan 2022
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
40
28
0
13 Aug 2021
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
63
18
0
12 Jul 2021
Meta Dynamic Pricing: Transfer Learning Across Experiments
Hamsa Bastani
D. Simchi-Levi
Ruihao Zhu
56
89
0
28 Feb 2019
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation
Thomy Phan
Lenz Belzner
Thomas Gabor
Kyrill Schmid
OffRL
37
15
0
17 Apr 2018
A Scale Free Algorithm for Stochastic Bandits with Bounded Kurtosis
Tor Lattimore
40
19
0
27 Mar 2017
On Bayesian index policies for sequential resource allocation
E. Kaufmann
48
84
0
06 Jan 2016
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit
Giuseppe Burtini
Jason L. Loeppky
Ramon Lawrence
46
119
0
02 Oct 2015
Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
Junpei Komiyama
Junya Honda
Hiroshi Nakagawa
32
134
0
02 Jun 2015
Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret
Wesley Cowan
M. Katehakis
46
14
0
12 May 2015
An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Wesley Cowan
M. Katehakis
34
27
0
08 May 2015
Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Wesley Cowan
Junya Honda
M. Katehakis
37
22
0
22 Apr 2015
1