ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
Robust Contact-rich Manipulation through Implicit Motor Adaptation
Robust Contact-rich Manipulation through Implicit Motor Adaptation
Teng Xue
Amirreza Razmjoo
Suhan Shetty
Sylvain Calinon
193
1
0
16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
186
1
0
16 Dec 2024
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted
  Behaviors
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
Fengshuo Bai
Runze Liu
Yali Du
Ying Wen
Yaodong Yang
AAML
130
5
0
14 Dec 2024
Advances in Transformers for Robotic Applications: A Review
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
150
0
0
13 Dec 2024
Distributional Reinforcement Learning based Integrated Decision Making
  and Control for Autonomous Surface Vehicles
Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles
Xi Lin
Paul Szenher
Yewei Huang
Brendan Englot
113
1
0
12 Dec 2024
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
283
6
0
10 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class
  and Backbone
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRLOnRL
192
10
0
09 Dec 2024
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
Arth Shukla
Stone Tao
Hao Su
198
6
0
09 Dec 2024
Learning Speed-Adaptive Walking Agent Using Imitation Learning with
  Physics-Informed Simulation
Learning Speed-Adaptive Walking Agent Using Imitation Learning with Physics-Informed Simulation
Yi-Hung Chiu
Ung Hee Lee
Changseob Song
Manaen Hu
Inseung Kang
AI4CE
76
1
0
05 Dec 2024
Inverse Delayed Reinforcement Learning
Inverse Delayed Reinforcement Learning
S. Zhan
Qingyuan Wu
Zhian Ruan
Frank Yang
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
146
0
0
04 Dec 2024
MEP-Net: Generating Solutions to Scientific Problems with Limited
  Knowledge by Maximum Entropy Principle
MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle
Wuyue Yang
Liangrong Peng
Guojie Li
L. Hong
72
0
0
03 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep Chinchali
Ufuk Topcu
OffRL
144
0
0
02 Dec 2024
RoboHanger: Learning Generalizable Robotic Hanger Insertion for Diverse Garments
RoboHanger: Learning Generalizable Robotic Hanger Insertion for Diverse Garments
Yuxing Chen
Songlin Wei
Bowen Xiao
Jiangran Lyu
Jiayi Chen
Feng Zhu
Hongan Wang
143
0
0
02 Dec 2024
A Cross-Scene Benchmark for Open-World Drone Active Tracking
A Cross-Scene Benchmark for Open-World Drone Active Tracking
Haowei Sun
Jinwu Hu
Zhirui Zhang
Haoyuan Tian
Xinze Xie
Yufeng Wang
Zhuliang Yu
Xiaohua Xie
Mingkui Tan
139
0
0
01 Dec 2024
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
130
0
0
28 Nov 2024
Application of Soft Actor-Critic Algorithms in Optimizing Wastewater
  Treatment with Time Delays Integration
Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Esmaeel Mohammadi
D. O. Arroyo
A. A. Hansen
Mikkel Stokholm-Bjerregaard
S. Gros
Akhil S. Anand
Petar Durdevic
97
0
0
27 Nov 2024
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai
Meimei Su
Yang Lyu
Zhunga Liu
Chunhui Zhao
Quan Pan
119
0
0
27 Nov 2024
Object-centric proto-symbolic behavioural reasoning from pixels
Object-centric proto-symbolic behavioural reasoning from pixels
R. S. V. Bergen
Justus F. Hübotter
Pablo Lanillos
LM&RoOCL
214
1
0
26 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
181
0
0
24 Nov 2024
Safe Multi-Agent Reinforcement Learning with Convergence to Generalized
  Nash Equilibrium
Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium
Zeyang Li
Navid Azizan
135
1
0
22 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
203
1
0
22 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
188
1
0
22 Nov 2024
Umbrella Reinforcement Learning -- computationally efficient tool for
  hard non-linear problems
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
103
1
0
21 Nov 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and
  Cloud Computing Environments
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
SuPLE: Robot Learning with Lyapunov Rewards
SuPLE: Robot Learning with Lyapunov Rewards
Phu Nguyen
Daniel Polani
Stas Tiomkin
109
0
0
20 Nov 2024
UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded
  Soft Environments
UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments
Chunru Lin
Jugang Fan
Yian Wang
Zeyuan Yang
Zhehuan Chen
Lixing Fang
Tsun-Hsuan Wang
Zhou Xian
Chuang Gan
128
2
0
19 Nov 2024
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for
  Long-Horizon Control Tasks
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Yongyan Wen
Siyuan Li
Rongchang Zuo
Lei Yuan
Hangyu Mao
P. Liu
140
0
0
19 Nov 2024
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan
Anan Schütt
Jill-Jênn Vie
Arun-Balajiee Lekshmi-Narayanan
Elisabeth André
Nicolas Vayatis
AI4Ed
127
0
0
18 Nov 2024
Continual Task Learning through Adaptive Policy Self-Composition
Shengchao Hu
Yuhang Zhou
Ziqing Fan
Jifeng Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
130
0
0
18 Nov 2024
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
151
3
0
18 Nov 2024
Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Feng Chen
Fuguang Han
Cong Guan
Lei Yuan
Zhilong Zhang
Yang Yu
Zongzhang Zhang
98
1
0
16 Nov 2024
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward
  Augmented Imitation
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Yihong Guo
Yixuan Wang
Yuanyuan Shi
Pan Xu
Anqi Liu
77
5
0
15 Nov 2024
Precision-Focused Reinforcement Learning Model for Robotic Object
  Pushing
Precision-Focused Reinforcement Learning Model for Robotic Object Pushing
Lara Bergmann
David P. Leins
R. Haschke
Klaus Neumann
95
3
0
13 Nov 2024
Robotic Control Optimization Through Kernel Selection in Safe Bayesian
  Optimization
Robotic Control Optimization Through Kernel Selection in Safe Bayesian Optimization
Lihao Zheng
Hongxuan Wang
Xiaocong Li
Jun Ma
P. Vadakkepat
48
1
0
12 Nov 2024
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Karim Abdel Sadek
Matteo Nulli
Joan Velja
Jort Vincenti
59
0
0
11 Nov 2024
OCMDP: Observation-Constrained Markov Decision Process
OCMDP: Observation-Constrained Markov Decision Process
Taiyi Wang
Jianheng Liu
Bryan Lee
Zhihao Wu
Yu Wu
126
1
0
11 Nov 2024
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud
  Feature-based Task Partitioning
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud Feature-based Task Partitioning
Xiu Yuan
78
0
0
11 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
143
2
0
11 Nov 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&RoVGen
161
5
0
11 Nov 2024
State Chrono Representation for Enhancing Generalization in
  Reinforcement Learning
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
78
0
0
09 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
102
6
0
08 Nov 2024
Evaluating Robustness of Reinforcement Learning Algorithms for
  Autonomous Shipping
Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
Bavo Lesy
Ali Anwar
Siegfried Mercelis
69
0
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
234
6
0
07 Nov 2024
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter
  Reset
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Alexandre Galashov
Michalis K. Titsias
Andras Gyorgy
Clare Lyle
Razvan Pascanu
Yee Whye Teh
Maneesh Sahani
107
4
0
06 Nov 2024
Risk-sensitive control as inference with Rényi divergence
Risk-sensitive control as inference with Rényi divergence
Kaito Ito
Kenji Kashima
71
1
0
04 Nov 2024
Show, Don't Tell: Learning Reward Machines from Demonstrations for
  Reinforcement Learning-Based Cardiac Pacemaker Synthesis
Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis
John Komp
Dananjay Srinivas
Maria Leonor Pacheco
Ashutosh Trivedi
AI4TS
56
0
0
04 Nov 2024
Diversity Progress for Goal Selection in Discriminability-Motivated RL
Diversity Progress for Goal Selection in Discriminability-Motivated RL
Erik M. Lintunen
Nadia M. Ady
Christian Guckelsberger
84
1
0
03 Nov 2024
Task-Aware Harmony Multi-Task Decision Transformer for Offline
  Reinforcement Learning
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Ziqing Fan
Shengchao Hu
Yuhang Zhou
Li Shen
Ya Zhang
Yanfeng Wang
Dacheng Tao
OffRL
117
0
0
02 Nov 2024
Provably and Practically Efficient Adversarial Imitation Learning with
  General Function Approximation
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
88
1
0
01 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAMLOffRL
145
3
0
01 Nov 2024
Previous
123...8910...818283
Next