Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Stone Tao
Fanbo Xiang
Arth Shukla
Yuzhe Qin
Xander Hinrichsen
...
Shan Luo
Roberto Calandra
Rui Chen
Shan Luo
Hao Su
120
38
0
01 Oct 2024
Continuously Improving Mobile Manipulation with Autonomous Real-World RL
Russell Mendonca
Emmanuel Panov
Bernadette Bucher
Jiuguang Wang
Deepak Pathak
OffRL
91
7
0
30 Sep 2024
Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning
Junlin Lu
Patrick Mannion
Karl Mason
66
1
0
30 Sep 2024
Enabling Multi-Robot Collaboration from Single-Human Guidance
Zhengran Ji
Lingyu Zhang
Paul Sajda
Boyuan Chen
79
2
0
30 Sep 2024
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
113
3
0
30 Sep 2024
Constrained Reinforcement Learning for Safe Heat Pump Control
Baohe Zhang
Lilli Frison
Thomas Brox
Joschka Bödecker
AI4CE
80
0
0
29 Sep 2024
Focus On What Matters: Separated Models For Visual-Based RL Generalization
Di Zhang
Bowen Lv
Hai Zhang
Feifan Yang
Junqiao Zhao
Hang Yu
Chang Huang
Hongtu Zhou
Chen Ye
Changjun Jiang
92
3
0
29 Sep 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
Taisuke Kobayashi
140
2
0
29 Sep 2024
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu
Qiayuan Liao
Zhongyu Li
Koushil Sreenath
Negar Mehr
Negar Mehr
LM&Ro
362
4
0
27 Sep 2024
Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions
David Olivares
Pierre Fournier
Pavan Vasishta
Julien Marzat
85
0
0
26 Sep 2024
Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots
Yu Wang
Wenchuan Jia
Yi Sun
Dong He
59
0
0
25 Sep 2024
FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Jiaheng Hu
Rose Hendrix
Ali Farhadi
Aniruddha Kembhavi
Roberto Martín-Martín
Peter Stone
Kuo-Hao Zeng
Kiana Ehsani
133
15
0
25 Sep 2024
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
Hanwen Zhang
Dusit Niyato
Wei Zhang
Changyuan Zhao
Hongyang Du
Abbas Jamalipour
Sumei Sun
Yiyang Pei
AI4CE
75
2
0
24 Sep 2024
SurgIRL: Towards Life-Long Learning for Surgical Automation by Incremental Reinforcement Learning
Yun-Jie Ho
Zih-Yun Chiu
Yuheng Zhi
Michael C. Yip
OffRL
116
0
0
24 Sep 2024
Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks
Ahmed Shokry
Walid Gomaa
Tobias Zaenker
Murad Dawood
Shady A. Maged
Mohammed I. Awad
Maren Bennewitz
Maren Bennewitz
OffRL
85
0
0
24 Sep 2024
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Min Tan
Yushun Tao
Boyun Zheng
GaoSheng Xie
Lijuan Feng
Zeyang Xia
Jing Xiong
103
0
0
24 Sep 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization
E. Kargar
Ville Kyrki
OffRL
71
0
0
22 Sep 2024
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
96
4
0
21 Sep 2024
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su
Xiaoshuang Chen
Yao Wang
Yulin Wu
Ziqiang Zhang
Kaiqiao Zhan
Ben Wang
Kun Gai
AI4TS
80
1
0
20 Sep 2024
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions
Mohammad K. Al-Sharman
Luc Edes
Bert Sun
Vishal Jayakumar
Mohamed A. Daoud
Derek Rayside
W. Melek
87
2
0
20 Sep 2024
Using High-Level Patterns to Estimate How Humans Predict a Robot will Behave
Sagar Parekh
Lauren Bramblett
Nicola Bezzo
Dylan P. Losey
109
0
0
20 Sep 2024
Human-Robot Cooperative Distribution Coupling for Hamiltonian-Constrained Social Navigation
Weizheng Wang
Chao Yu
Yu Wang
Byung-Cheol Min
427
2
0
20 Sep 2024
Infrastructure-less UWB-based Active Relative Localization
Valerio Brunacci
Alberto Dionigi
Alessio De Angelis
Gabriele Costante
77
1
0
19 Sep 2024
Fine Manipulation Using a Tactile Skin: Learning in Simulation and Sim-to-Real Transfer
Ulf Kasolowsky
Berthold Bäuml
65
2
0
19 Sep 2024
Improving Soft-Capture Phase Success in Space Debris Removal Missions: Leveraging Deep Reinforcement Learning and Tactile Feedback
Bahador Beigomi
Zheng H. Zhu
65
0
0
18 Sep 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
64
3
0
18 Sep 2024
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&Ro
OCL
91
0
0
18 Sep 2024
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning
Claude Formanek
Louise Beyers
C. Tilbury
Jonathan P. Shock
Arnu Pretorius
OffRL
88
2
0
18 Sep 2024
Discovering Conceptual Knowledge with Analytic Ontology Templates for Articulated Objects
Jianhua Sun
Yuxuan Li
Longfei Xu
Jiude Wei
Liang Chai
Cewu Lu
87
2
0
18 Sep 2024
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems
Peng Liu
Jiawei Zhu
Cong Xu
Ming Zhao
Bin Wang
118
1
0
18 Sep 2024
Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
Qingqing Wang
Chang Chang
OffRL
32
1
0
17 Sep 2024
MoDex: Planning High-Dimensional Dexterous Control via Learning Neural Internal Models
Tong Wu
Shoujie Li
Chuqiao Lyu
Kit-Wa Sou
Wang Sing Chan
Wenbo Ding
113
0
0
17 Sep 2024
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
Amogh Joshi
Adarsh Kosta
Kaushik Roy
OffRL
109
2
0
16 Sep 2024
KAN v.s. MLP for Offline Reinforcement Learning
Haihong Guo
Fengxin Li
Jiao Li
Hongyan Liu
OffRL
80
0
0
15 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
82
2
0
13 Sep 2024
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
Haoyi Niu
Qimao Chen
Tenglong Liu
Jianxiong Li
Guyue Zhou
Yi Zhang
Jianming Hu
Xianyuan Zhan
104
0
0
13 Sep 2024
Stochastic Reinforcement Learning with Stability Guarantees for Control of Unknown Nonlinear Systems
Thanin Quartz
Ruikun Zhou
H. Sterck
Jun Liu
29
1
0
12 Sep 2024
Towards Online Safety Corrections for Robotic Manipulation Policies
Ariana Spalter
Mark Roberts
Laura M. Hiatt
OffRL
OnRL
30
0
0
12 Sep 2024
Composing Option Sequences by Adaptation: Initial Results
Charles A. Meehan
Paul Rademacher
Mark Roberts
Laura M. Hiatt
52
0
0
12 Sep 2024
Learning Causally Invariant Reward Functions from Diverse Demonstrations
Ivan Ovinnikov
Eugene Bykovets
J. M. Buhmann
CML
96
0
0
12 Sep 2024
Autonomous loading of ore piles with Load-Haul-Dump machines using Deep Reinforcement Learning
Rodrigo Salas
Francisco Leiva
Javier Ruiz-del-Solar
OffRL
44
0
0
11 Sep 2024
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
Z. Liu
Junjie Xu
Xingjiao Wu
J. Yang
Liang He
92
0
0
11 Sep 2024
Combating Spatial Disorientation in a Dynamic Self-Stabilization Task Using AI Assistants
Sheikh Mannan
Paige Hansen
Vivekanand Pandey Vimal
Hannah N. Davies
Paul DiZio
Nikhil Krishnaswamy
62
1
0
09 Sep 2024
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan
Chenyou Fan
Shuang Qiu
Jiyuan Shi
Chenjia Bai
112
4
0
09 Sep 2024
Reward-Directed Score-Based Diffusion Models via q-Learning
Xuefeng Gao
Jiale Zha
X. Zhou
DiffM
75
3
0
07 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
103
2
0
07 Sep 2024
Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework
Daniel J. Tan
Qianyi Xu
K. See
Dilruk Perera
Mengling Feng
AI4CE
34
0
0
06 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
93
0
0
06 Sep 2024
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
80
3
0
05 Sep 2024
RoboKoop: Efficient Control Conditioned Representations from Visual Input in Robotics using Koopman Operator
Hemant Kumawat
Biswadeep Chakraborty
Saibal Mukhopadhyay
131
3
0
04 Sep 2024
Previous
1
2
3
...
11
12
13
...
81
82
83
Next