Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

15 June 2023

Papers citing "Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"

3 / 3 papers shown

Title
Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam Thomas L. Griffiths LLMAG 94 1 0 29 Apr 2025
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits Ziyi Huang Henry Lam Haofeng Zhang 33 0 0 20 Jun 2024
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Chen-Yu Wei Mehdi Jafarnia-Jahromi Haipeng Luo Hiteshi Sharma R. Jain 107 100 0 15 Oct 2019