ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.02811
  4. Cited By
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors

Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors

9 January 2020
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
    OffRL
ArXivPDFHTML

Papers citing "Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors"

50 / 68 papers shown
Title
Secure Physical Layer Communications for Low-Altitude Economy Networking: A Survey
Secure Physical Layer Communications for Low-Altitude Economy Networking: A Survey
Lingyi Cai
Jiacheng Wang
Ruichen Zhang
Y. Zhang
Tao Jiang
Dusit Niyato
Xianbin Wang
Abbas Jamalipour
X. Shen
49
0
0
12 Apr 2025
A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi
A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi
Chenyu Zhang
Shiying Sun
Kuan Liu
Chuanbao Zhou
Xiaoguang Zhao
M. Tan
Yuanmin Huang
55
0
0
31 Mar 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
54
0
0
31 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
54
0
0
23 Mar 2025
Transferable Latent-to-Latent Locomotion Policy for Efficient and Versatile Motion Control of Diverse Legged Robots
Transferable Latent-to-Latent Locomotion Policy for Efficient and Versatile Motion Control of Diverse Legged Robots
Ziang Zheng
Guojian Zhan
Bin Shuai
Shengtao Qin
J. Li
Tao Zhang
Shengbo Eben Li
44
0
0
22 Mar 2025
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Lei Zhao
Lin Cai
Wu-Sheng Lu
47
0
0
25 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
60
2
0
29 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
37
0
0
22 Jan 2025
Distributional Reinforcement Learning based Integrated Decision Making
  and Control for Autonomous Surface Vehicles
Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles
Xi Lin
Paul Szenher
Yewei Huang
Brendan Englot
81
1
0
12 Dec 2024
Conformal Symplectic Optimization for Stable Reinforcement Learning
Conformal Symplectic Optimization for Stable Reinforcement Learning
Yao Lyu
Xiangteng Zhang
Shengbo Eben Li
Jingliang Duan
Letian Tao
Qing Xu
Lei He
Keqiang Li
68
0
0
03 Dec 2024
Risk-sensitive control as inference with Rényi divergence
Risk-sensitive control as inference with Rényi divergence
Kaito Ito
Kenji Kashima
36
1
0
04 Nov 2024
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement
  Learning and Application in UAV Hovering
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
Qihan Qi
Xinsong Yang
Gang Xia
Daniel W. C. Ho
Pengyang Tang
36
0
0
09 Oct 2024
Solving Multi-Goal Robotic Tasks with Decision Transformer
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
32
1
0
08 Oct 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
23
2
0
18 Sep 2024
Parallel Distributional Deep Reinforcement Learning for Mapless
  Navigation of Terrestrial Mobile Robots
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
30
3
0
11 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
Rocket Landing Control with Random Annealing Jump Start Reinforcement
  Learning
Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning
Yuxuan Jiang
Yujie Yang
Zhiqian Lan
Guojian Zhan
Shengbo Eben Li
Qi Sun
Jian Ma
Tianwen Yu
Changwu Zhang
38
1
0
21 Jul 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
64
3
0
29 May 2024
Diffusion Actor-Critic with Entropy Regulator
Diffusion Actor-Critic with Entropy Regulator
Yinuo Wang
Likun Wang
Yuxuan Jiang
Wenjun Zou
Tong Liu
...
Wenxuan Wang
Liming Xiao
Jiang Wu
Jingliang Duan
Shengbo Eben Li
DiffM
42
10
0
24 May 2024
Control Policy Correction Framework for Reinforcement Learning-based
  Energy Arbitrage Strategies
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies
Seyed soroush Karimi madahi
Gargya Gokhale
Marie-Sophie Verwee
Bert Claessens
Chris Develder
39
4
0
29 Apr 2024
Canonical Form of Datatic Description in Control Systems
Canonical Form of Datatic Description in Control Systems
Guojian Zhan
Ziang Zheng
Shengbo Eben Li
30
1
0
04 Mar 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
35
1
0
17 Jan 2024
Distributional Reinforcement Learning-based Energy Arbitrage Strategies
  in Imbalance Settlement Mechanism
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism
Seyed soroush Karimi madahi
Bert Claessens
Chris Develder
29
1
0
23 Dec 2023
Communication-Efficient Soft Actor-Critic Policy Collaboration via
  Regulated Segment Mixture in Internet of Vehicles
Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu
Rongpeng Li
Chengchao Liang
Zhifeng Zhao
33
0
0
15 Dec 2023
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for
  Deep Reinforcement Learning
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning
Junmin Zhong
Ruofan Wu
Jennie Si
OffRL
15
1
0
07 Nov 2023
Training Multi-layer Neural Networks on Ising Machine
Training Multi-layer Neural Networks on Ising Machine
Xujie Song
Tong Liu
Shengbo Eben Li
Jingliang Duan
Wenxuan Wang
Keqiang Li
31
0
0
06 Nov 2023
RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value
  Factorization
RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization
Siqi Shen
Chennan Ma
Chao Li
Weiquan Liu
Yongquan Fu
Songzhu Mei
Xinwang Liu
Cheng-Yu Wang
20
10
0
03 Nov 2023
Optimization Landscape of Policy Gradient Methods for Discrete-time
  Static Output Feedback
Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback
Jingliang Duan
Jie Li
Xuyang Chen
Kai Zhao
Shengbo Eben Li
Lin Zhao
18
5
0
29 Oct 2023
Bridging the Gap between Newton-Raphson Method and Regularized Policy
  Iteration
Bridging the Gap between Newton-Raphson Method and Regularized Policy Iteration
Zeyang Li
Chuxiong Hu
Yunan Wang
Guojian Zhan
Jie Li
Shengbo Eben Li
32
0
0
11 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
Distributional Soft Actor-Critic with Three Refinements
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODD
OffRL
22
2
0
09 Oct 2023
Hybrid of representation learning and reinforcement learning for dynamic
  and complex robotic motion planning
Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning
Chengmin Zhou
Xin Lu
Jiapeng Dai
Bingding Huang
Xiaoxu Liu
Pasi Fränti
24
2
0
07 Sep 2023
Parallel Distributional Prioritized Deep Reinforcement Learning for
  Unmanned Aerial Vehicles
Parallel Distributional Prioritized Deep Reinforcement Learning for Unmanned Aerial Vehicles
A. H. Kolling
V. A. Kich
J. C. Jesus
Andressa Cavalcante da Silva
Ricardo B. Grando
Paulo L. J. Drews-Jr
D. T. Gamarra
25
3
0
01 Sep 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning
  Algorithm
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
26
1
0
11 Jun 2023
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative
  Inference
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference
Ziyang Zhang
Yang Zhao
Huan Li
Changyao Lin
Jie Liu
40
13
0
02 Jun 2023
Distributional Reinforcement Learning with Dual Expectile-Quantile
  Regression
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression
Sami Jullien
Romain Deffayet
J. Renders
Paul T. Groth
Maarten de Rijke
OOD
77
1
0
26 May 2023
Train a Real-world Local Path Planner in One Hour via Partially
  Decoupled Reinforcement Learning and Vectorized Diversity
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
Jinghao Xin
Jinwoo Kim
Zehan Li
Ning Li
OffRL
28
3
0
07 May 2023
Invariance to Quantile Selection in Distributional Continuous Control
Invariance to Quantile Selection in Distributional Continuous Control
Felix Grün
Muhammad Saif-ur-Rehman
Tobias Glasmachers
Ioannis Iossifidis
15
0
0
29 Dec 2022
Smoothing Policy Iteration for Zero-sum Markov Games
Smoothing Policy Iteration for Zero-sum Markov Games
Yangang Ren
Yao Lyu
Wenxuan Wang
Sheng Li
Zeyang Li
Jingliang Duan
31
1
0
03 Dec 2022
A Survey on Reinforcement Learning in Aviation Applications
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
23
52
0
03 Nov 2022
Safe Model-Based Reinforcement Learning with an Uncertainty-Aware
  Reachability Certificate
Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate
Dongjie Yu
Wenjun Zou
Yujie Yang
Haitong Ma
Sheng Li
Jingliang Duan
Jianyu Chen
29
14
0
14 Oct 2022
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous
  Driving via Semantic Masked World Model
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
Zeyu Gao
Yao Mu
Chen Chen
Yangang Ren
Shengbo Eben Li
Ping Luo
Yanfeng Lu
17
27
0
08 Oct 2022
Synthesize Efficient Safety Certificates for Learning-Based Safe Control
  using Magnitude Regularization
Synthesize Efficient Safety Certificates for Learning-Based Safe Control using Magnitude Regularization
Haotian Zheng
Haitong Ma
Sifa Zheng
Shengbo Eben Li
Jianqiang Wang
15
1
0
23 Sep 2022
Revisiting Discrete Soft Actor-Critic
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
46
12
0
21 Sep 2022
On the Optimization Landscape of Dynamic Output Feedback: A Case Study
  for Linear Quadratic Regulator
On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator
Jingliang Duan
Wenhan Cao
Yanggu Zheng
Lin Zhao
20
3
0
12 Sep 2022
A Risk-Sensitive Approach to Policy Optimization
A Risk-Sensitive Approach to Policy Optimization
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
29
6
0
19 Aug 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous
  Control
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
27
4
0
27 Jul 2022
Reachability Constrained Reinforcement Learning
Reachability Constrained Reinforcement Learning
Dongjie Yu
Haitong Ma
Sheng Li
Jianyu Chen
63
54
0
16 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Diverse Imitation Learning via Self-Organizing Generative Models
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
26
1
0
06 May 2022
MicroRacer: a didactic environment for Deep Reinforcement Learning
MicroRacer: a didactic environment for Deep Reinforcement Learning
Andrea Asperti
Marco Del Brutto
27
0
0
20 Mar 2022
Conservative Distributional Reinforcement Learning with Safety
  Constraints
Conservative Distributional Reinforcement Learning with Safety Constraints
Hengrui Zhang
Youfang Lin
Sheng Han
Shuo Wang
Kai Lv
OffRL
21
5
0
18 Jan 2022
12
Next