ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,128 papers shown
Title
Learning to Shape by Grinding: Cutting-surface-aware Model-based
  Reinforcement Learning
Learning to Shape by Grinding: Cutting-surface-aware Model-based Reinforcement Learning
Takumi Hachimine
Jun Morimoto
Takamitsu Matsubara
68
5
0
04 Aug 2023
End-to-End Reinforcement Learning of Koopman Models for Economic
  Nonlinear Model Predictive Control
End-to-End Reinforcement Learning of Koopman Models for Economic Nonlinear Model Predictive Control
Daniel Mayfrank
Alexander Mitsos
Manuel Dahmen
71
3
0
03 Aug 2023
Improving Wind Resistance Performance of Cascaded PID Controlled
  Quadcopters using Residual Reinforcement Learning
Improving Wind Resistance Performance of Cascaded PID Controlled Quadcopters using Residual Reinforcement Learning
Yu Ishihara
Yuichi Hazama
Kousuke Suzuki
Jerry Jun Yokono
K. Sabe
Kenta Kawamoto
26
0
0
03 Aug 2023
Avoidance Navigation Based on Offline Pre-Training Reinforcement
  Learning
Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning
W. Yang
Hao Lei
OffRL
84
1
0
03 Aug 2023
Improving Generalization in Visual Reinforcement Learning via
  Conflict-aware Gradient Agreement Augmentation
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
118
24
0
02 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark
  and Case Study for Robotics Manipulation
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
64
20
0
31 Jul 2023
End-to-End Reinforcement Learning for Torque Based Variable Height
  Hopping
End-to-End Reinforcement Learning for Torque Based Variable Height Hopping
Raghav Soni
Daniel Harnack
Hauke Isermann
Sotaro Fushimi
Shivesh Kumar
Frank Kirchner
90
9
0
31 Jul 2023
Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks
  with Surgical Robot
Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot
Tao Huang
Kai-xiang Chen
Wang Wei
Jianan Li
Yonghao Long
Qi Dou
OffRL
78
7
0
31 Jul 2023
Rating-based Reinforcement Learning
Rating-based Reinforcement Learning
Devin White
Mingkang Wu
Ellen R. Novoseller
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
ALM
78
9
0
30 Jul 2023
Primitive Skill-based Robot Learning from Human Evaluative Feedback
Primitive Skill-based Robot Learning from Human Evaluative Feedback
Ayano Hiranaka
Minjune Hwang
Sharon Lee
Chen Wang
Li Fei-Fei
Jiajun Wu
Ruohan Zhang
OffRL
86
12
0
28 Jul 2023
Autonomous Payload Thermal Control
Autonomous Payload Thermal Control
Alejandro D. Mousist
21
0
0
28 Jul 2023
Improvable Gap Balancing for Multi-Task Learning
Improvable Gap Balancing for Multi-Task Learning
Yanqi Dai
Nanyi Fei
Zhiwu Lu
77
5
0
28 Jul 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRLOnRL
76
5
0
26 Jul 2023
A Constraint Enforcement Deep Reinforcement Learning Framework for
  Optimal Energy Storage Systems Dispatch
A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch
Shengren Hou
Edgar Mauricio Salazar Duque
Peter Palensky
Pedro P. Vergara
35
4
0
26 Jul 2023
Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for
  Tactile Pushing
Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing
Max Yang
Yijiong Lin
Alex Church
John Lloyd
Dandan Zhang
David A.W. Barton
Nathan Lepora
OffRL
100
12
0
26 Jul 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
Matthieu Geist
OffRL
70
5
0
25 Jul 2023
A behavioural transformer for effective collaboration between a robot
  and a non-stationary human
A behavioural transformer for effective collaboration between a robot and a non-stationary human
Ruaridh Mon-Williams
Theodoros Stouraitis
S. Vijayakumar
86
2
0
25 Jul 2023
Communication-Efficient Orchestrations for URLLC Service via
  Hierarchical Reinforcement Learning
Communication-Efficient Orchestrations for URLLC Service via Hierarchical Reinforcement Learning
Wei Shi
Milad Ganjalizadeh
H. S. Ghadikolaei
M. Petrova
AI4CE
28
2
0
25 Jul 2023
DIP-RL: Demonstration-Inferred Preference Learning in Minecraft
DIP-RL: Demonstration-Inferred Preference Learning in Minecraft
Ellen R. Novoseller
Vinicius G. Goecks
David Watkins
J. Miller
Nicholas R. Waytowich
OffRL
64
3
0
22 Jul 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement
  Learning via Latent Landmark Graphs
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
63
1
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
76
1
0
21 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
122
2
0
21 Jul 2023
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs
Jiayu Chen
Jingdi Chen
Tian-Shing Lan
Vaneet Aggarwal
57
13
0
21 Jul 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local
  Value Regularization
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
98
26
0
21 Jul 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Model-based Offline Reinforcement Learning with Count-based Conservatism
Byeongchang Kim
Min Hwan Oh
OffRL
51
12
0
21 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
92
8
0
20 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
112
11
0
20 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game
  Testing in AAA Games
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
166
7
0
19 Jul 2023
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on
  Analyses of Interestingness
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
Pedro Sequeira
Melinda Gervasio
63
2
0
18 Jul 2023
Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent
  Reinforcement Learning (RL) Methodology
Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent Reinforcement Learning (RL) Methodology
Mehrad Jaloli
M. Cescon
OffRL
42
5
0
17 Jul 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
104
25
0
17 Jul 2023
Image-based Regularization for Action Smoothness in Autonomous Miniature
  Racing Car with Deep Reinforcement Learning
Image-based Regularization for Action Smoothness in Autonomous Miniature Racing Car with Deep Reinforcement Learning
Hoang-Giang Cao
I. Lee
Bo-Jiun Hsu
Zheng-Yi Lee
Yu-Wei Shih
Hsueh-Cheng Wang
I-Chen Wu
77
2
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
125
5
0
16 Jul 2023
Bayesian inference for data-efficient, explainable, and safe robotic
  motion planning: A review
Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review
Chengmin Zhou
Chao Wang
Haseeb Hassan
H. Shah
Bingding Huang
Pasi Fränti
3DV
103
3
0
16 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious
  Correlation
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
105
21
0
15 Jul 2023
RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization
RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization
Zhecheng Yuan
Sizhe Yang
Pu Hua
C. Chang
Kaizhe Hu
Huazhe Xu
OODOffRL
112
20
0
15 Jul 2023
SafeDreamer: Safe Reinforcement Learning with World Models
SafeDreamer: Safe Reinforcement Learning with World Models
Weidong Huang
Jiaming Ji
Borong Zhang
Chunhe Xia
Yao-Chun Yang
OffRL
81
19
0
14 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement
  Learning
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
107
7
0
13 Jul 2023
Hybrid Control Policy for Artificial Pancreas via Ensemble Deep
  Reinforcement Learning
Hybrid Control Policy for Artificial Pancreas via Ensemble Deep Reinforcement Learning
Wenzhou Lv
Tianyu Wu
Luolin Xiong
Liang Wu
Jianglei Zhou
Yang Tang
Feng Qian
55
2
0
13 Jul 2023
Budgeting Counterfactual for Offline RL
Budgeting Counterfactual for Offline RL
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
68
3
0
12 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
80
4
0
12 Jul 2023
Bag of Views: An Appearance-based Approach to Next-Best-View Planning
  for 3D Reconstruction
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction
Sara Hatami Gazani
Matthew Tucsok
I. Mantegh
Homayoun Najjaran
53
4
0
11 Jul 2023
Boosting Feedback Efficiency of Interactive Reinforcement Learning by
  Adaptive Learning from Scores
Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
Shukai Liu
Chenming Wu
Ying Li
Liang Zhang
88
0
0
11 Jul 2023
A Versatile Door Opening System with Mobile Manipulator through Adaptive
  Position-Force Control and Reinforcement Learning
A Versatile Door Opening System with Mobile Manipulator through Adaptive Position-Force Control and Reinforcement Learning
Gyuree Kang
Hyunki Seong
Daegyu Lee
David Hyunchul Shim
50
6
0
10 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
162
23
0
10 Jul 2023
SAR: Generalization of Physiological Agility and Dexterity via
  Synergistic Action Representation
SAR: Generalization of Physiological Agility and Dexterity via Synergistic Action Representation
C. Berg
Vittorio Caggiano
Vikash Kumar
54
15
0
07 Jul 2023
Discovering Hierarchical Achievements in Reinforcement Learning via
  Contrastive Learning
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
95
5
0
07 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
38
14
0
06 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
Learning to Solve Tasks with Exploring Prior Behaviours
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
109
4
0
06 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
  Learning
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
62
5
0
06 Jul 2023
Previous
123...303132...818283
Next