ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1102.2490
  4. Cited By
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond

The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond

12 February 2011
Aurélien Garivier
Olivier Cappé
ArXivPDFHTML

Papers citing "The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond"

4 / 4 papers shown
Title
Diversity-Aware Reinforcement Learning for de novo Drug Design
Diversity-Aware Reinforcement Learning for de novo Drug Design
Hampus Gummesson Svensson
C. Tyrchan
Ola Engkvist
M. Chehreghani
44
2
0
14 Oct 2024
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
177
6
0
19 Aug 2024
Optimal Batched Best Arm Identification
Optimal Batched Best Arm Identification
Tianyuan Jin
Yu Yang
Jing Tang
Xiaokui Xiao
Pan Xu
62
3
0
21 Oct 2023
Selective Uncertainty Propagation in Offline RL
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
128
0
0
01 Feb 2023
1