Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling

10 September 2017

Papers citing "Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling"

7 / 7 papers shown

Title
An Information-Theoretic Analysis of Thompson Sampling Daniel Russo Benjamin Van Roy 153 425 0 21 Mar 2014
Thompson Sampling for 1-Dimensional Exponential Family Bandits N. Korda E. Kaufmann Rémi Munos 71 155 0 12 Jul 2013
Learning to Optimize Via Posterior Sampling Daniel Russo Benjamin Van Roy 195 701 0 11 Jan 2013
Further Optimal Regret Bounds for Thompson Sampling Shipra Agrawal Navin Goyal 107 442 0 15 Sep 2012
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 195 1,000 0 15 Sep 2012
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences Odalric-Ambrym Maillard Rémi Munos Gilles Stoltz 90 146 0 29 May 2011
A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong Li Wei Chu John Langford Robert Schapire 459 2,951 0 28 Feb 2010