Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,676 papers shown
Title
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
27
27
0
28 Oct 2023
Learning to bag with a simulation-free reinforcement learning framework for robots
Francisco Munguia-Galeano
Jihong Zhu
Juan David Hernández
Ze Ji
35
0
0
22 Oct 2023
SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network
Lei Yao
Yong Zhang
Zilong Yan
Jialu Tian
31
3
0
13 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
38
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
39
1
0
12 Oct 2023
Bridging the Gap between Newton-Raphson Method and Regularized Policy Iteration
Zeyang Li
Chuxiong Hu
Yunan Wang
Guojian Zhan
Jie Li
Shengbo Eben Li
32
0
0
11 Oct 2023
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
46
20
0
10 Oct 2023
Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot Collaboration
Jakob Thumm
Felix Trost
Matthias Althoff
OffRL
44
6
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
33
6
0
09 Oct 2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Xiong-Hui Chen
Junyin Ye
Hang Zhao
Yi-Chen Li
Haoran Shi
...
Si-Hang Yang
Anqi Huang
Kai Xu
Zongzhang Zhang
Yang Yu
35
0
0
09 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Intelligent DRL-Based Adaptive Region of Interest for Delay-sensitive Telemedicine Applications
Abdulrahman Soliman
Amr M. Mohamed
Elias Yaacoub
Nikhil V. Navkar
A. Erbad
16
2
0
08 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
Tuomas Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
53
48
0
06 Oct 2023
RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels
Alexander Shmakov
Avisek Naug
Vineet Gundecha
Sahand Ghorbanpour
Ricardo Luna Gutierrez
Ashwin Ramesh Babu
Antonio Guillen-Perez
Soumyendu Sarkar
39
11
0
05 Oct 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
Woojun Kim
Jeonghye Kim
Young-Jin Sung
28
5
0
05 Oct 2023
Multi-Agent Reinforcement Learning for Power Grid Topology Optimization
E. V. D. Sar
Alessandro Zocca
Sandjai Bhulai
37
0
0
04 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
143
165
0
04 Oct 2023
A General Offline Reinforcement Learning Framework for Interactive Recommendation
Teng Xiao
Donglin Wang
OffRL
44
73
0
01 Oct 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAtt
LRM
59
1
0
29 Sep 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
49
8
0
28 Sep 2023
Task-Oriented Koopman-Based Control with Contrastive Encoder
Xubo Lyu
Hanyang Hu
Seth Siriya
Ye Pu
Mo Chen
36
6
0
28 Sep 2023
V2X-Lead: LiDAR-based End-to-End Autonomous Driving with Vehicle-to-Everything Communication Integration
Zhi-Guo Deng
Yanjun Shi
Weiming Shen
45
0
0
26 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
46
1
0
26 Sep 2023
Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization
Chenyang Miao
Yunduan Cui
Huiyun Li
Xin Wu
31
5
0
26 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
37
9
0
25 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
29
17
0
22 Sep 2023
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Haoyi Niu
Tianying Ji
Bingqi Liu
Haocheng Zhao
Xiangyu Zhu
Jianying Zheng
Pengfei Huang
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
AI4CE
34
7
0
22 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Chao Yu
Yi Wu
Yu Wang
41
28
0
22 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
39
0
0
21 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Wenjun Huang
Yunduan Cui
Huiyun Li
Xin Wu
MU
27
0
0
20 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
41
13
0
12 Sep 2023
Signal Temporal Logic Neural Predictive Control
Yue Meng
Chuchu Fan
34
15
0
10 Sep 2023
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Miguel Abreu
Luis Paulo Reis
N. Lau
41
6
0
06 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
47
10
0
05 Sep 2023
Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning
Qisen Yang
Huanqian Wang
Mukun Tong
Wenjie Shi
Gao Huang
Shiji Song
40
5
0
04 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
35
8
0
04 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
51
13
0
31 Aug 2023
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
Alberto Dionigi
Simone Felicioni
Mirko Leomanni
G. Costante
29
9
0
31 Aug 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
28
0
0
31 Aug 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
34
4
0
29 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
39
6
0
29 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
53
10
0
28 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Impression-Aware Recommender Systems
F. B. P. Maurera
Maurizio Ferrari Dacrema
P. Castells
Paolo Cremonesi
AI4TS
47
2
0
15 Aug 2023
Hierarchical generative modelling for autonomous robots
Kai Yuan
Noor Sajid
Karl J. Friston
Zhibin Li
31
12
0
15 Aug 2023
RL-based Variable Horizon Model Predictive Control of Multi-Robot Systems using Versatile On-Demand Collision Avoidance
Shreyash Gupta
Abhinav Kumar
N. S. Tripathy
S. Shah
24
0
0
14 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
33
1
0
14 Aug 2023
Synthesizing Programmatic Policies with Actor-Critic Algorithms and ReLU Networks
S. Orfanos
Levi H. S. Lelis
27
6
0
04 Aug 2023
Learning to Shape by Grinding: Cutting-surface-aware Model-based Reinforcement Learning
Takumi Hachimine
Jun Morimoto
Takamitsu Matsubara
24
5
0
04 Aug 2023
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
46
22
0
02 Aug 2023
Previous
1
2
3
...
8
9
10
...
32
33
34
Next