Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.08803
Cited By
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
15 June 2023
Amin Karbasi
Nikki Lijing Kuang
Yi Ma
Siddharth Mitra
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"
3 / 3 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
94
1
0
29 Apr 2025
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
Ziyi Huang
Henry Lam
Haofeng Zhang
33
0
0
20 Jun 2024
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
100
0
15 Oct 2019
1