The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond

12 February 2011

Papers citing "The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond"

4 / 4 papers shown

Title
Diversity-Aware Reinforcement Learning for de novo Drug Design Hampus Gummesson Svensson C. Tyrchan Ola Engkvist M. Chehreghani 44 2 0 14 Oct 2024
Contextual Bandits for Unbounded Context Distributions Puning Zhao Xiaogang Xu Zhe Liu Huiwen Wu Qin Zhang Zong Ke Tianhang Zheng 177 6 0 19 Aug 2024
Optimal Batched Best Arm Identification Tianyuan Jin Yu Yang Jing Tang Xiaokui Xiao Pan Xu 62 3 0 21 Oct 2023
Selective Uncertainty Propagation in Offline RL Sanath Kumar Krishnamurthy Shrey Modi Tanmay Gangwani S. Katariya Branislav Kveton A. Rangi OffRL 128 0 0 01 Feb 2023