Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,128 papers shown
Title
Learning to Shape by Grinding: Cutting-surface-aware Model-based Reinforcement Learning
Takumi Hachimine
Jun Morimoto
Takamitsu Matsubara
68
5
0
04 Aug 2023
End-to-End Reinforcement Learning of Koopman Models for Economic Nonlinear Model Predictive Control
Daniel Mayfrank
Alexander Mitsos
Manuel Dahmen
71
3
0
03 Aug 2023
Improving Wind Resistance Performance of Cascaded PID Controlled Quadcopters using Residual Reinforcement Learning
Yu Ishihara
Yuichi Hazama
Kousuke Suzuki
Jerry Jun Yokono
K. Sabe
Kenta Kawamoto
26
0
0
03 Aug 2023
Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning
W. Yang
Hao Lei
OffRL
84
1
0
03 Aug 2023
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
118
24
0
02 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
64
20
0
31 Jul 2023
End-to-End Reinforcement Learning for Torque Based Variable Height Hopping
Raghav Soni
Daniel Harnack
Hauke Isermann
Sotaro Fushimi
Shivesh Kumar
Frank Kirchner
90
9
0
31 Jul 2023
Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot
Tao Huang
Kai-xiang Chen
Wang Wei
Jianan Li
Yonghao Long
Qi Dou
OffRL
78
7
0
31 Jul 2023
Rating-based Reinforcement Learning
Devin White
Mingkang Wu
Ellen R. Novoseller
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
ALM
78
9
0
30 Jul 2023
Primitive Skill-based Robot Learning from Human Evaluative Feedback
Ayano Hiranaka
Minjune Hwang
Sharon Lee
Chen Wang
Li Fei-Fei
Jiajun Wu
Ruohan Zhang
OffRL
86
12
0
28 Jul 2023
Autonomous Payload Thermal Control
Alejandro D. Mousist
21
0
0
28 Jul 2023
Improvable Gap Balancing for Multi-Task Learning
Yanqi Dai
Nanyi Fei
Zhiwu Lu
77
5
0
28 Jul 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
76
5
0
26 Jul 2023
A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch
Shengren Hou
Edgar Mauricio Salazar Duque
Peter Palensky
Pedro P. Vergara
35
4
0
26 Jul 2023
Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing
Max Yang
Yijiong Lin
Alex Church
John Lloyd
Dandan Zhang
David A.W. Barton
Nathan Lepora
OffRL
100
12
0
26 Jul 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
Matthieu Geist
OffRL
70
5
0
25 Jul 2023
A behavioural transformer for effective collaboration between a robot and a non-stationary human
Ruaridh Mon-Williams
Theodoros Stouraitis
S. Vijayakumar
86
2
0
25 Jul 2023
Communication-Efficient Orchestrations for URLLC Service via Hierarchical Reinforcement Learning
Wei Shi
Milad Ganjalizadeh
H. S. Ghadikolaei
M. Petrova
AI4CE
28
2
0
25 Jul 2023
DIP-RL: Demonstration-Inferred Preference Learning in Minecraft
Ellen R. Novoseller
Vinicius G. Goecks
David Watkins
J. Miller
Nicholas R. Waytowich
OffRL
64
3
0
22 Jul 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
63
1
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
76
1
0
21 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
122
2
0
21 Jul 2023
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs
Jiayu Chen
Jingdi Chen
Tian-Shing Lan
Vaneet Aggarwal
57
13
0
21 Jul 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
98
26
0
21 Jul 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Byeongchang Kim
Min Hwan Oh
OffRL
51
12
0
21 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
92
8
0
20 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
112
11
0
20 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
166
7
0
19 Jul 2023
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
Pedro Sequeira
Melinda Gervasio
63
2
0
18 Jul 2023
Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent Reinforcement Learning (RL) Methodology
Mehrad Jaloli
M. Cescon
OffRL
42
5
0
17 Jul 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
104
25
0
17 Jul 2023
Image-based Regularization for Action Smoothness in Autonomous Miniature Racing Car with Deep Reinforcement Learning
Hoang-Giang Cao
I. Lee
Bo-Jiun Hsu
Zheng-Yi Lee
Yu-Wei Shih
Hsueh-Cheng Wang
I-Chen Wu
77
2
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
125
5
0
16 Jul 2023
Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review
Chengmin Zhou
Chao Wang
Haseeb Hassan
H. Shah
Bingding Huang
Pasi Fränti
3DV
103
3
0
16 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
105
21
0
15 Jul 2023
RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization
Zhecheng Yuan
Sizhe Yang
Pu Hua
C. Chang
Kaizhe Hu
Huazhe Xu
OOD
OffRL
112
20
0
15 Jul 2023
SafeDreamer: Safe Reinforcement Learning with World Models
Weidong Huang
Jiaming Ji
Borong Zhang
Chunhe Xia
Yao-Chun Yang
OffRL
81
19
0
14 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
107
7
0
13 Jul 2023
Hybrid Control Policy for Artificial Pancreas via Ensemble Deep Reinforcement Learning
Wenzhou Lv
Tianyu Wu
Luolin Xiong
Liang Wu
Jianglei Zhou
Yang Tang
Feng Qian
55
2
0
13 Jul 2023
Budgeting Counterfactual for Offline RL
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
68
3
0
12 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
80
4
0
12 Jul 2023
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction
Sara Hatami Gazani
Matthew Tucsok
I. Mantegh
Homayoun Najjaran
53
4
0
11 Jul 2023
Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
Shukai Liu
Chenming Wu
Ying Li
Liang Zhang
88
0
0
11 Jul 2023
A Versatile Door Opening System with Mobile Manipulator through Adaptive Position-Force Control and Reinforcement Learning
Gyuree Kang
Hyunki Seong
Daegyu Lee
David Hyunchul Shim
50
6
0
10 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
162
23
0
10 Jul 2023
SAR: Generalization of Physiological Agility and Dexterity via Synergistic Action Representation
C. Berg
Vittorio Caggiano
Vikash Kumar
54
15
0
07 Jul 2023
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
95
5
0
07 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
38
14
0
06 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
109
4
0
06 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
62
5
0
06 Jul 2023
Previous
1
2
3
...
30
31
32
...
81
82
83
Next