ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,658 papers shown
Title
Autonomous Driving at Unsignalized Intersections: A Review of
  Decision-Making Challenges and Reinforcement Learning-Based Solutions
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions
Mohammad K. Al-Sharman
Luc Edes
Bert Sun
Vishal Jayakumar
Mohamed A. Daoud
Derek Rayside
W. Melek
33
1
0
20 Sep 2024
Using High-Level Patterns to Estimate How Humans Predict a Robot will Behave
Using High-Level Patterns to Estimate How Humans Predict a Robot will Behave
Sagar Parekh
Lauren Bramblett
Nicola Bezzo
Dylan P. Losey
37
0
0
20 Sep 2024
Human-Robot Cooperative Distribution Coupling for Hamiltonian-Constrained Social Navigation
Human-Robot Cooperative Distribution Coupling for Hamiltonian-Constrained Social Navigation
Weizheng Wang
Chao Yu
Yu Wang
Byung-Cheol Min
236
2
0
20 Sep 2024
Improving Soft-Capture Phase Success in Space Debris Removal Missions:
  Leveraging Deep Reinforcement Learning and Tactile Feedback
Improving Soft-Capture Phase Success in Space Debris Removal Missions: Leveraging Deep Reinforcement Learning and Tactile Feedback
Bahador Beigomi
Zheng H. Zhu
39
0
0
18 Sep 2024
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
Peng Liu
Jiawei Zhu
Cong Xu
Ming Zhao
Bin Wang
36
1
0
18 Sep 2024
Automating proton PBS treatment planning for head and neck cancers using
  policy gradient-based deep reinforcement learning
Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
Qingqing Wang
Chang Chang
OffRL
26
0
0
17 Sep 2024
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
Amogh Joshi
Adarsh Kosta
Kaushik Roy
OffRL
55
2
0
16 Sep 2024
KAN v.s. MLP for Offline Reinforcement Learning
KAN v.s. MLP for Offline Reinforcement Learning
Haihong Guo
Fengxin Li
Jiao Li
Hongyan Liu
OffRL
38
0
0
15 Sep 2024
Learning Causally Invariant Reward Functions from Diverse Demonstrations
Learning Causally Invariant Reward Functions from Diverse Demonstrations
Ivan Ovinnikov
Eugene Bykovets
J. M. Buhmann
CML
40
0
0
12 Sep 2024
Autonomous loading of ore piles with Load-Haul-Dump machines using Deep
  Reinforcement Learning
Autonomous loading of ore piles with Load-Haul-Dump machines using Deep Reinforcement Learning
Rodrigo Salas
Francisco Leiva
Javier Ruiz-del-Solar
OffRL
40
0
0
11 Sep 2024
Multi-Type Preference Learning: Empowering Preference-Based
  Reinforcement Learning with Equal Preferences
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
Z. Liu
Junjie Xu
Xingjiao Wu
J. Yang
Liang He
28
0
0
11 Sep 2024
Combating Spatial Disorientation in a Dynamic Self-Stabilization Task
  Using AI Assistants
Combating Spatial Disorientation in a Dynamic Self-Stabilization Task Using AI Assistants
Sheikh Mannan
Paige Hansen
Vivekanand Pandey Vimal
Hannah N. Davies
Paul DiZio
Nikhil Krishnaswamy
45
1
0
09 Sep 2024
Simplex-enabled Safe Continual Learning Machine
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
44
3
0
05 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
39
0
0
02 Sep 2024
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Zihao Sheng
Zilin Huang
Sikai Chen
41
9
0
30 Aug 2024
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
Pritthijit Nath
Henry Moss
Emily Shuckburgh
Mark Webb
AI4Cl
AI4CE
56
0
0
28 Aug 2024
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Georgios Bakirtzis
M. Savvas
Ruihan Zhao
Sandeep Chinchali
Ufuk Topcu
47
2
0
23 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
50
1
0
23 Aug 2024
Advances in Preference-based Reinforcement Learning: A Review
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
56
9
0
21 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
95
3
0
20 Aug 2024
q-exponential family for policy optimization
q-exponential family for policy optimization
Lingwei Zhu
Haseeb Shah
Han Wang
Yukie Nagai
Martha White
OffRL
78
0
0
14 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from
  Contrastive RL without Rewards, Demonstrations, or Subgoals
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
58
1
0
11 Aug 2024
CURLing the Dream: Contrastive Representations for World Modeling in
  Reinforcement Learning
CURLing the Dream: Contrastive Representations for World Modeling in Reinforcement Learning
V. A. Kich
J. A. Bottega
Raul Steinmetz
Ricardo B. Grando
Ayano Yorozu
Akihisa Ohya
OffRL
51
0
0
11 Aug 2024
Achieving Human Level Competitive Robot Table Tennis
Achieving Human Level Competitive Robot Table Tennis
David B. DÁmbrosio
Saminda Abeyruwan
L. Graesser
Atil Iscen
H. B. Amor
...
Vikas Sindhwani
Vincent Vanhoucke
Grace Vesom
P. Xu
Pannag R Sanketi
95
14
0
07 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
53
6
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
60
0
0
05 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
64
8
0
02 Aug 2024
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench
Moritz Meser
Aditya Bhatt
Boris Belousov
Jan Peters
29
2
0
01 Aug 2024
SAPG: Split and Aggregate Policy Gradients
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRL
OnRL
42
3
0
29 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
47
1
0
26 Jul 2024
Functional Acceleration for Policy Mirror Descent
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
34
0
0
23 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
38
3
0
18 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
26
0
0
18 Jul 2024
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
Yongyuan Liang
Tingqiang Xu
Kaizhe Hu
Guangqi Jiang
Furong Huang
Huazhe Xu
VGen
LM&Ro
DiffM
55
2
0
15 Jul 2024
Preserving the Privacy of Reward Functions in MDPs through Deception
Preserving the Privacy of Reward Functions in MDPs through Deception
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
45
0
0
13 Jul 2024
RoboMorph: Evolving Robot Morphology using Large Language Models
RoboMorph: Evolving Robot Morphology using Large Language Models
Kevin Qiu
Krzysztof Ciebiera
Krzysztof Ciebiera
Marek Cygan
Marek Cygan
Łukasz Kuciński
LM&Ro
54
0
0
11 Jul 2024
Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning
Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning
Augustine N. Mavor-Parker
Matthew J. Sargent
Caswell Barry
Lewis D. Griffin
Clare Lyle
52
2
0
09 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
49
3
0
09 Jul 2024
Enhanced Safety in Autonomous Driving: Integrating Latent State
  Diffusion Model for End-to-End Navigation
Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation
Detian Chu
Linyuan Bai
Jianuo Huang
Zhenlong Fang
Peng Zhang
Wei Kang
Haifeng Lin
50
2
0
08 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
69
0
0
06 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning
  Models
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
47
1
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
17
0
05 Jul 2024
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Kazumi Kasaura
38
0
0
02 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
73
0
0
01 Jul 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
39
3
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
53
1
0
30 Jun 2024
Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with
  Energy-Based Models
Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models
Sangwoong Yoon
Himchan Hwang
Dohyun Kwon
Yung-Kyun Noh
Frank C. Park
44
3
0
30 Jun 2024
Preference Elicitation for Offline Reinforcement Learning
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
69
1
0
26 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in
  Cyber Physical Systems
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
47
0
0
24 Jun 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement
  Learning
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
47
1
0
21 Jun 2024
Previous
123456...323334
Next