Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
Robust Contact-rich Manipulation through Implicit Motor Adaptation
Teng Xue
Amirreza Razmjoo
Suhan Shetty
Sylvain Calinon
193
1
0
16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
186
1
0
16 Dec 2024
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
Fengshuo Bai
Runze Liu
Yali Du
Ying Wen
Yaodong Yang
AAML
130
5
0
14 Dec 2024
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
150
0
0
13 Dec 2024
Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles
Xi Lin
Paul Szenher
Yewei Huang
Brendan Englot
113
1
0
12 Dec 2024
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
283
6
0
10 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
192
10
0
09 Dec 2024
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
Arth Shukla
Stone Tao
Hao Su
198
6
0
09 Dec 2024
Learning Speed-Adaptive Walking Agent Using Imitation Learning with Physics-Informed Simulation
Yi-Hung Chiu
Ung Hee Lee
Changseob Song
Manaen Hu
Inseung Kang
AI4CE
76
1
0
05 Dec 2024
Inverse Delayed Reinforcement Learning
S. Zhan
Qingyuan Wu
Zhian Ruan
Frank Yang
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
146
0
0
04 Dec 2024
MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle
Wuyue Yang
Liangrong Peng
Guojie Li
L. Hong
72
0
0
03 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep Chinchali
Ufuk Topcu
OffRL
144
0
0
02 Dec 2024
RoboHanger: Learning Generalizable Robotic Hanger Insertion for Diverse Garments
Yuxing Chen
Songlin Wei
Bowen Xiao
Jiangran Lyu
Jiayi Chen
Feng Zhu
Hongan Wang
143
0
0
02 Dec 2024
A Cross-Scene Benchmark for Open-World Drone Active Tracking
Haowei Sun
Jinwu Hu
Zhirui Zhang
Haoyuan Tian
Xinze Xie
Yufeng Wang
Zhuliang Yu
Xiaohua Xie
Mingkui Tan
139
0
0
01 Dec 2024
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
130
0
0
28 Nov 2024
Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Esmaeel Mohammadi
D. O. Arroyo
A. A. Hansen
Mikkel Stokholm-Bjerregaard
S. Gros
Akhil S. Anand
Petar Durdevic
97
0
0
27 Nov 2024
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai
Meimei Su
Yang Lyu
Zhunga Liu
Chunhui Zhao
Quan Pan
119
0
0
27 Nov 2024
Object-centric proto-symbolic behavioural reasoning from pixels
R. S. V. Bergen
Justus F. Hübotter
Pablo Lanillos
LM&Ro
OCL
214
1
0
26 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
181
0
0
24 Nov 2024
Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium
Zeyang Li
Navid Azizan
135
1
0
22 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
203
1
0
22 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
188
1
0
22 Nov 2024
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
103
1
0
21 Nov 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
SuPLE: Robot Learning with Lyapunov Rewards
Phu Nguyen
Daniel Polani
Stas Tiomkin
109
0
0
20 Nov 2024
UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments
Chunru Lin
Jugang Fan
Yian Wang
Zeyuan Yang
Zhehuan Chen
Lixing Fang
Tsun-Hsuan Wang
Zhou Xian
Chuang Gan
128
2
0
19 Nov 2024
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Yongyan Wen
Siyuan Li
Rongchang Zuo
Lei Yuan
Hangyu Mao
P. Liu
140
0
0
19 Nov 2024
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan
Anan Schütt
Jill-Jênn Vie
Arun-Balajiee Lekshmi-Narayanan
Elisabeth André
Nicolas Vayatis
AI4Ed
127
0
0
18 Nov 2024
Continual Task Learning through Adaptive Policy Self-Composition
Shengchao Hu
Yuhang Zhou
Ziqing Fan
Jifeng Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
130
0
0
18 Nov 2024
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
151
3
0
18 Nov 2024
Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Feng Chen
Fuguang Han
Cong Guan
Lei Yuan
Zhilong Zhang
Yang Yu
Zongzhang Zhang
98
1
0
16 Nov 2024
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Yihong Guo
Yixuan Wang
Yuanyuan Shi
Pan Xu
Anqi Liu
77
5
0
15 Nov 2024
Precision-Focused Reinforcement Learning Model for Robotic Object Pushing
Lara Bergmann
David P. Leins
R. Haschke
Klaus Neumann
95
3
0
13 Nov 2024
Robotic Control Optimization Through Kernel Selection in Safe Bayesian Optimization
Lihao Zheng
Hongxuan Wang
Xiaocong Li
Jun Ma
P. Vadakkepat
48
1
0
12 Nov 2024
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Karim Abdel Sadek
Matteo Nulli
Joan Velja
Jort Vincenti
59
0
0
11 Nov 2024
OCMDP: Observation-Constrained Markov Decision Process
Taiyi Wang
Jianheng Liu
Bryan Lee
Zhihao Wu
Yu Wu
126
1
0
11 Nov 2024
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud Feature-based Task Partitioning
Xiu Yuan
78
0
0
11 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
143
2
0
11 Nov 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&Ro
VGen
161
5
0
11 Nov 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
78
0
0
09 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
102
6
0
08 Nov 2024
Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
Bavo Lesy
Ali Anwar
Siegfried Mercelis
69
0
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
234
6
0
07 Nov 2024
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Alexandre Galashov
Michalis K. Titsias
Andras Gyorgy
Clare Lyle
Razvan Pascanu
Yee Whye Teh
Maneesh Sahani
107
4
0
06 Nov 2024
Risk-sensitive control as inference with Rényi divergence
Kaito Ito
Kenji Kashima
71
1
0
04 Nov 2024
Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis
John Komp
Dananjay Srinivas
Maria Leonor Pacheco
Ashutosh Trivedi
AI4TS
56
0
0
04 Nov 2024
Diversity Progress for Goal Selection in Discriminability-Motivated RL
Erik M. Lintunen
Nadia M. Ady
Christian Guckelsberger
84
1
0
03 Nov 2024
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Ziqing Fan
Shengchao Hu
Yuhang Zhou
Li Shen
Ya Zhang
Yanfeng Wang
Dacheng Tao
OffRL
117
0
0
02 Nov 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
88
1
0
01 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAML
OffRL
145
3
0
01 Nov 2024
Previous
1
2
3
...
8
9
10
...
81
82
83
Next