Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
Deep Reinforcement Learning for Autonomous Vehicle Intersection Navigation
Badr Ben Elallid
H. E. Alaoui
N. Benamar
39
4
0
30 Sep 2023
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Daoce Wang
Chi Jin
OffRL
DiffM
101
35
0
29 Sep 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
81
10
0
29 Sep 2023
TranDRL: A Transformer-Driven Deep Reinforcement Learning Enabled Prescriptive Maintenance Framework
Yang Zhao
Jiaxi Yang
Wenbo Wang
Helin Yang
Dusit Niyato
AI4TS
AI4CE
102
9
0
29 Sep 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAtt
LRM
151
1
0
29 Sep 2023
Predicting Object Interactions with Behavior Primitives: An Application in Stowing Tasks
Haonan Chen
Yilong Niu
Kaiwen Hong
Shuijing Liu
Yixuan Wang
Yunzhu Li
Katherine Driggs-Campbell
77
13
0
28 Sep 2023
HyperPPO: A scalable method for finding small policies for robotic control
Luming Tang
Zhehui Huang
Gaurav Sukhatme
76
4
0
28 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
82
1
0
28 Sep 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
78
9
0
28 Sep 2023
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
64
3
0
28 Sep 2023
Task-Oriented Koopman-Based Control with Contrastive Encoder
Xubo Lyu
Hanyang Hu
Seth Siriya
Ye Pu
Mo Chen
90
8
0
28 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
85
1
0
27 Sep 2023
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd Murphey
AI4CE
DiffM
114
5
0
26 Sep 2023
V2X-Lead: LiDAR-based End-to-End Autonomous Driving with Vehicle-to-Everything Communication Integration
Zhi-Guo Deng
Yanjun Shi
Weiming Shen
123
0
0
26 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
92
4
0
26 Sep 2023
Tempo Adaptation in Non-stationary Reinforcement Learning
Hyunin Lee
Yuhao Ding
Jongmin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
81
3
0
26 Sep 2023
Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization
Chenyang Miao
Yunduan Cui
Huiyun Li
Xin Wu
128
5
0
26 Sep 2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn
P. DÓro
Harley Wiltzer
Pierre-Luc Bacon
Marc G. Bellemare
98
3
0
26 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
89
12
0
25 Sep 2023
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation
Zihang Wang
Maowei Jiang
AI4CE
84
0
0
25 Sep 2023
Hierarchical Reinforcement Learning Based on Planning Operators
Jing Zhang
Emmanuel Dean
Karinne Ramirez-Amaro
OffRL
76
3
0
25 Sep 2023
Stackelberg Driver Model for Continual Policy Improvement in Scenario-Based Closed-Loop Autonomous Driving
Haoyi Niu
Qimao Chen
Yingyue Li
Yi Zhang
Jianming Hu
111
3
0
25 Sep 2023
Continual Driving Policy Optimization with Closed-Loop Individualized Curricula
Haoyi Niu
Yi Tian Xu
Xingjian Jiang
Jianming Hu
154
3
0
25 Sep 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
Xu Zhao
Duzhen Zhang
Liyuan Han
Tielin Zhang
Bo Xu
88
7
0
25 Sep 2023
Policy Stitching: Learning Transferable Robot Policies
Pingcheng Jian
Easop Lee
Zachary I. Bell
Michael M. Zavlanos
Boyuan Chen
OffRL
61
8
0
24 Sep 2023
A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data
Wenqiang Li
Weijun Li
Lina Yu
Min Wu
Linjun Sun
Jingyi Liu
Yanjie Li
Shu Wei
Yusong Deng
Meilan Hao
74
6
0
24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Hai Zhang
Hang Yu
Junqiao Zhao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
97
10
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
96
20
0
22 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Chao Yu
Yi Wu
Yu Wang
112
30
0
22 Sep 2023
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Haoyi Niu
Tianying Ji
Bingqi Liu
Haocheng Zhao
Xiangyu Zhu
Jianying Zheng
Pengfei Huang
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
AI4CE
129
9
0
22 Sep 2023
Uncertainty-driven Exploration Strategies for Online Grasp Learning
Yitian Shi
Philipp Schillinger
Miroslav Gabriel
Alexander Kuss
Zohar Feldman
Hanna Ziesche
Ngo Anh Vien
OffRL
OnRL
61
4
0
21 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
63
0
0
21 Sep 2023
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
Tianbao Xie
Siheng Zhao
Chen Henry Wu
Yitao Liu
Qian Luo
Victor Zhong
Yanchao Yang
Tao Yu
LM&Ro
139
65
0
20 Sep 2023
Multi-Step Model Predictive Safety Filters: Reducing Chattering by Increasing the Prediction Horizon
Federico Pizarro Bejarano
Lukas Brunke
Angela P. Schoellig
72
9
0
20 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Wenjun Huang
Yunduan Cui
Huiyun Li
Xin Wu
MU
128
0
0
20 Sep 2023
Memory-based Controllers for Efficient Data-driven Control of Soft Robots
Yuzhe Wu
Ehsan Nekouei
32
3
0
19 Sep 2023
Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration
Jinning Li
Xinyi Liu
Banghua Zhu
Jiantao Jiao
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
OnRL
110
10
0
18 Sep 2023
Visual Forecasting as a Mid-level Representation for Avoidance
Hsuan-Kung Yang
Tsung-Chih Chiang
Ting-Ru Liu
Chun-Wei Huang
Jou-Min Liu
Chun-Yi Lee
77
0
0
17 Sep 2023
From Knowing to Doing: Learning Diverse Motor Skills through Instruction Learning
Linqi Ye
Jiayi Li
Yi Cheng
Xianhao Wang
Bin Liang
Yan Peng
67
7
0
17 Sep 2023
Projected Task-Specific Layers for Multi-Task Reinforcement Learning
Josselin Somerville Roberts
Julia Di
45
1
0
15 Sep 2023
A Bayesian Approach to Robust Inverse Reinforcement Learning
Ran Wei
Siliang Zeng
Chenliang Li
Alfredo García
Anthony D. McDonald
Mingyi Hong
OffRL
90
4
0
15 Sep 2023
VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning
K. Weerakoon
A. Sathyamoorthy
Mohamed Bashir Elnoor
Dinesh Manocha
OffRL
60
0
0
14 Sep 2023
Deep Reinforcement Learning-based Scheduling for Optimizing System Load and Response Time in Edge and Fog Computing Environments
Zhiyu Wang
M. Goudarzi
Mingming Gong
Rajkumar Buyya
86
65
0
14 Sep 2023
Safe Reinforcement Learning with Dual Robustness
Zeyang Li
Chuxiong Hu
Yunan Wang
Yujie Yang
Shengbo Eben Li
OffRL
67
8
0
13 Sep 2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Hao Sun
Alihan Huyuk
M. Schaar
OffRL
LRM
98
30
0
13 Sep 2023
Learning topological operations on meshes with application to block decomposition of polygons
Arjun Narayanan
Yulong Pan
Per-Olof Persson
AI4CE
47
2
0
12 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
97
17
0
12 Sep 2023
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh
Stephen Tu
Vikas Sindhwani
DiffM
113
8
0
11 Sep 2023
Signal Temporal Logic Neural Predictive Control
Yue Meng
Chuchu Fan
60
16
0
10 Sep 2023
Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning
Chengmin Zhou
Xin Lu
Jiapeng Dai
Bingding Huang
Xiaoxu Liu
Pasi Fränti
73
2
0
07 Sep 2023
Previous
1
2
3
...
28
29
30
...
81
82
83
Next