Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.03162
Cited By
Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling
10 September 2017
Iñigo Urteaga
C. Wiggins
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling"
7 / 7 papers shown
Title
An Information-Theoretic Analysis of Thompson Sampling
Daniel Russo
Benjamin Van Roy
153
425
0
21 Mar 2014
Thompson Sampling for 1-Dimensional Exponential Family Bandits
N. Korda
E. Kaufmann
Rémi Munos
71
155
0
12 Jul 2013
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
195
701
0
11 Jan 2013
Further Optimal Regret Bounds for Thompson Sampling
Shipra Agrawal
Navin Goyal
107
442
0
15 Sep 2012
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
195
1,000
0
15 Sep 2012
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences
Odalric-Ambrym Maillard
Rémi Munos
Gilles Stoltz
90
146
0
29 May 2011
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
459
2,951
0
28 Feb 2010
1