ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.04928
  4. Cited By
Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

11 October 2019
Sharan Vaswani
Abbas Mehrabian
A. Durand
Branislav Kveton
ArXivPDFHTML

Papers citing "Old Dog Learns New Tricks: Randomized UCB for Bandit Problems"

9 / 9 papers shown
Title
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
199
3
0
18 Jul 2024
Perturbed-History Exploration in Stochastic Linear Bandits
Perturbed-History Exploration in Stochastic Linear Bandits
Branislav Kveton
Csaba Szepesvári
Mohammad Ghavamzadeh
Craig Boutilier
31
42
0
21 Mar 2019
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Branislav Kveton
Csaba Szepesvári
Sharan Vaswani
Zheng Wen
Mohammad Ghavamzadeh
Tor Lattimore
135
70
0
13 Nov 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep
  Networks for Thompson Sampling
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
66
365
0
26 Feb 2018
Ensemble Sampling
Ensemble Sampling
Xiuyuan Lu
Benjamin Van Roy
123
119
0
20 May 2017
Bootstrapped Thompson Sampling and Deep Exploration
Bootstrapped Thompson Sampling and Deep Exploration
Ian Osband
Benjamin Van Roy
138
105
0
01 Jul 2015
Thompson Sampling for Complex Bandit Problems
Thompson Sampling for Complex Bandit Problems
Aditya Gopalan
Shie Mannor
Yishay Mansour
144
202
0
03 Nov 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
195
997
0
15 Sep 2012
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Aurélien Garivier
Olivier Cappé
164
612
0
12 Feb 2011
1