ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.08803
  4. Cited By
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning

Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

15 June 2023
Amin Karbasi
Nikki Lijing Kuang
Yi Ma
Siddharth Mitra
    OffRL
ArXivPDFHTML

Papers citing "Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"

3 / 3 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
94
1
0
29 Apr 2025
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
Ziyi Huang
Henry Lam
Haofeng Zhang
33
0
0
20 Jun 2024
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
100
0
15 Oct 2019
1