Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

11 October 2019

Papers citing "Old Dog Learns New Tricks: Randomized UCB for Bandit Problems"

9 / 9 papers shown

Title
Random Latent Exploration for Deep Reinforcement Learning Srinath Mahankali Zhang-Wei Hong Ayush Sekhari Alexander Rakhlin Pulkit Agrawal 199 3 0 18 Jul 2024
Perturbed-History Exploration in Stochastic Linear Bandits Branislav Kveton Csaba Szepesvári Mohammad Ghavamzadeh Craig Boutilier 31 42 0 21 Mar 2019
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits Branislav Kveton Csaba Szepesvári Sharan Vaswani Zheng Wen Mohammad Ghavamzadeh Tor Lattimore 135 70 0 13 Nov 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling C. Riquelme George Tucker Jasper Snoek BDL 66 365 0 26 Feb 2018
Ensemble Sampling Xiuyuan Lu Benjamin Van Roy 123 119 0 20 May 2017
Bootstrapped Thompson Sampling and Deep Exploration Ian Osband Benjamin Van Roy 138 105 0 01 Jul 2015
Thompson Sampling for Complex Bandit Problems Aditya Gopalan Shie Mannor Yishay Mansour 144 202 0 03 Nov 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 195 997 0 15 Sep 2012
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond Aurélien Garivier Olivier Cappé 164 612 0 12 Feb 2011