ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.18246
  4. Cited By
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

29 May 2023
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
    BDL
    OffRL
ArXivPDFHTML

Papers citing "Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo"

20 / 20 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
89
0
0
29 Apr 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
54
0
0
23 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
74
0
0
27 Feb 2025
Muti-Fidelity Prediction and Uncertainty Quantification with Laplace Neural Operators for Parametric Partial Differential Equations
Muti-Fidelity Prediction and Uncertainty Quantification with Laplace Neural Operators for Parametric Partial Differential Equations
Haoyang Zheng
Guang Lin
AI4CE
48
0
0
01 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
52
2
0
29 Jan 2025
Upper and Lower Bounds for Distributionally Robust Off-Dynamics
  Reinforcement Learning
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Zhishuai Liu
Weixin Wang
Pan Xu
28
1
0
30 Sep 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
More Efficient Randomized Exploration for Reinforcement Learning via
  Approximate Sampling
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
32
4
0
18 Jun 2024
Sequential Decision Making with Expert Demonstrations under Unobserved
  Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Rahul G. Krishnan
Vasilis Syrgkanis
39
0
0
10 Apr 2024
Prior-dependent analysis of posterior sampling reinforcement learning
  with function approximation
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
Yingru Li
Zhi-Quan Luo
19
1
0
17 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable
  Efficiency with Linear Function Approximation
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
34
8
0
23 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
18
6
0
05 Feb 2024
Accelerating Approximate Thompson Sampling with Underdamped Langevin
  Monte Carlo
Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo
Haoyang Zheng
Wei Deng
Christian Moya
Guang Lin
22
6
0
22 Jan 2024
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling
  on Sparse Hypergraphs
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin
Hao-Lun Hsu
William Chang
Pan Xu
13
1
0
24 Dec 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with
  Linear Function Approximation
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu-Xiang Wang
Yian Ma
24
6
0
29 Oct 2023
Faster Convergence of Stochastic Gradient Langevin Dynamics for
  Non-Log-Concave Sampling
Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling
Difan Zou
Pan Xu
Quanquan Gu
33
35
0
19 Oct 2020
D2RL: Deep Dense Architectures in Reinforcement Learning
D2RL: Deep Dense Architectures in Reinforcement Learning
Samarth Sinha
Homanga Bharadhwaj
A. Srinivas
Animesh Garg
OffRL
AI4CE
43
56
0
19 Oct 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
145
0
04 May 2020
On the Convergence of Stochastic Gradient MCMC Algorithms with
  High-Order Integrators
On the Convergence of Stochastic Gradient MCMC Algorithms with High-Order Integrators
Changyou Chen
Nan Ding
Lawrence Carin
32
158
0
21 Oct 2016
MCMC using Hamiltonian dynamics
MCMC using Hamiltonian dynamics
Radford M. Neal
179
3,262
0
09 Jun 2012
1