ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
126
100
0
24 Feb 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
100
5
0
24 Feb 2023
Neural Laplace Control for Continuous-time Delayed Systems
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
107
10
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OODAAML
105
8
0
24 Feb 2023
Model-Based Uncertainty in Value Functions
Model-Based Uncertainty in Value Functions
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
115
15
0
24 Feb 2023
To the Noise and Back: Diffusion for Shared Autonomy
To the Noise and Back: Diffusion for Shared Autonomy
Takuma Yoneda
Luzhe Sun
Ge Yang
Bradly C. Stadie
Matthew R. Walter
DiffM
93
29
0
23 Feb 2023
Diverse Policy Optimization for Structured Action Space
Diverse Policy Optimization for Structured Action Space
Wenhao Li
Baoxiang Wang
Shanchao Yang
H. Zha
OffRL
75
1
0
23 Feb 2023
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material
  Objects
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects
Zhenjia Xu
Zhou Xian
Xingyu Lin
Cheng Chi
Zhiao Huang
Chuang Gan
Shuran Song
79
29
0
22 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task
  Planning for an Underactuated Cooperative Robotic task
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
103
0
0
22 Feb 2023
Learning Agile Flights through Narrow Gaps with Varying Angles using
  Onboard Sensing
Learning Agile Flights through Narrow Gaps with Varying Angles using Onboard Sensing
Yuhan Xie
Minghao Lu
Rui Peng
Peng Lu
81
11
0
22 Feb 2023
Reinforcement Learning for Block Decomposition of CAD Models
Reinforcement Learning for Block Decomposition of CAD Models
Benjamin C. DiPrete
R. Garimella
Cristina Garcia-Cardona
Navamita Ray
66
1
0
21 Feb 2023
Adversarial Model for Offline Reinforcement Learning
Adversarial Model for Offline Reinforcement Learning
M. Bhardwaj
Tengyang Xie
Byron Boots
Nan Jiang
Ching-An Cheng
AAMLOffRL
104
29
0
21 Feb 2023
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Dhawal Gupta
Yinlam Chow
Aza Tulepbergenov
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
67
3
0
21 Feb 2023
Learning to Play Text-based Adventure Games with Maximum Entropy
  Reinforcement Learning
Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning
Weichen Li
R. Devidze
Sophie Fellenz
134
3
0
21 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
57
9
0
20 Feb 2023
Differentiable Arbitrating in Zero-sum Markov Games
Differentiable Arbitrating in Zero-sum Markov Games
Jing Wang
Meichen Song
Feng Gao
Boyi Liu
Zhaoran Wang
Yi Wu
118
2
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration
  for Task Automation of Surgical Robot
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
80
23
0
20 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided
  Bounds on the Value Function
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
41
0
0
19 Feb 2023
Generalization in Visual Reinforcement Learning with the Reward Sequence
  Distribution
Generalization in Visual Reinforcement Learning with the Reward Sequence Distribution
Jie Wang
Rui Yang
Zijie Geng
Zhihao Shi
Mingxuan Ye
Qi Zhou
Shuiwang Ji
Bin Li
Yongdong Zhang
Feng Wu
87
6
0
19 Feb 2023
Stochastic Generative Flow Networks
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
129
34
0
19 Feb 2023
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
Brendan O'Donoghue
OffRL
108
7
0
18 Feb 2023
Reinforcement Learning in the Wild with Maximum Likelihood-based Model
  Transfer
Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer
Hannes Eriksson
D. Basu
Tommy Tram
Mina Alibeigi
Christos Dimitrakakis
57
1
0
18 Feb 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation
  Learning
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
103
29
0
16 Feb 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
Meta-Reinforcement Learning via Exploratory Task Clustering
Zhendong Chu
Hongning Wang
OffRL
88
7
0
15 Feb 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
87
15
0
15 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
96
52
0
14 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
119
9
0
14 Feb 2023
GFlowNet-EM for learning compositional latent variable models
GFlowNet-EM for learning compositional latent variable models
J. E. Hu
Nikolay Malkin
Moksh Jain
Katie Everett
Alexandros Graikos
Yoshua Bengio
CoGe
103
41
0
13 Feb 2023
Automatic Noise Filtering with Dynamic Sparse Training in Deep
  Reinforcement Learning
Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning
Bram Grooten
Ghada Sokar
Shibhansh Dohare
Elena Mocanu
Matthew E. Taylor
Mykola Pechenizkiy
Decebal Constantin Mocanu
66
14
0
13 Feb 2023
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Yunke Wang
Bo Du
Chang Xu
85
9
0
13 Feb 2023
Order Matters: Agent-by-agent Policy Optimization
Order Matters: Agent-by-agent Policy Optimization
Xihuai Wang
Zheng Tian
Bo Liu
Ying Wen
Jun Wang
Weinan Zhang
85
29
0
13 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
120
14
0
13 Feb 2023
Robust Representation Learning by Clustering with Bisimulation Metrics
  for Visual Reinforcement Learning with Distractions
Robust Representation Learning by Clustering with Bisimulation Metrics for Visual Reinforcement Learning with Distractions
Qiyuan Liu
Qi Zhou
Rui Yang
Jie Wang
OffRLOOD
525
15
0
12 Feb 2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
MANSA: Learning Fast and Slow in Multi-Agent Systems
D. Mguni
Hao Chen
Taher Jafferjee
Jianhong Wang
Long Fei
Xidong Feng
Stephen Marcus McAleer
Feifei Tong
Jun Wang
Yaodong Yang
72
2
0
12 Feb 2023
Distributional GFlowNets with Quantile Flows
Distributional GFlowNets with Quantile Flows
Dinghuai Zhang
L. Pan
Ricky T. Q. Chen
Aaron Courville
Yoshua Bengio
99
28
0
11 Feb 2023
Verifying Generalization in Deep Learning
Verifying Generalization in Deep Learning
Guy Amir
Osher Maayan
Tom Zelazny
Guy Katz
Michael Schapira
AAMLAI4CE
81
15
0
11 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
133
10
0
11 Feb 2023
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Pierre Clavier
E. L. Pennec
Matthieu Geist
107
14
0
10 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
151
30
0
10 Feb 2023
Controllability-Aware Unsupervised Skill Discovery
Controllability-Aware Unsupervised Skill Discovery
Seohong Park
Kimin Lee
Youngwoon Lee
Pieter Abbeel
97
43
0
10 Feb 2023
Scalability Bottlenecks in Multi-Agent Reinforcement Learning Systems
Scalability Bottlenecks in Multi-Agent Reinforcement Learning Systems
Kailash Gogineni
Peng Wei
Tian-Shing Lan
Guru Venkataramani
82
9
0
10 Feb 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse
  Reinforcement Learning
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
103
20
0
09 Feb 2023
RayNet: A Simulation Platform for Developing Reinforcement
  Learning-Driven Network Protocols
RayNet: A Simulation Platform for Developing Reinforcement Learning-Driven Network Protocols
Luca Giacomoni
Basil Benny
G. Parisis
75
3
0
09 Feb 2023
Learning Interaction-aware Motion Prediction Model for Decision-making
  in Autonomous Driving
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
79
18
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
64
9
0
08 Feb 2023
NeuronsGym: A Hybrid Framework and Benchmark for Robot Tasks with
  Sim2Real Policy Learning
NeuronsGym: A Hybrid Framework and Benchmark for Robot Tasks with Sim2Real Policy Learning
Haoran Li
Shasha Liu
Mingjun Ma
Guangzheng Hu
Yaran Chen
Dong Zhao
95
3
0
07 Feb 2023
Population-size-Aware Policy Optimization for Mean-Field Games
Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li
Xinrun Wang
Shuxin Li
Hau Chan
Bo An
69
2
0
07 Feb 2023
Utility-based Perturbed Gradient Descent: An Optimizer for Continual
  Learning
Utility-based Perturbed Gradient Descent: An Optimizer for Continual Learning
Mohamed Elsayed
A. R. Mahmood
CLL
91
6
0
07 Feb 2023
Robust Subtask Learning for Compositional Generalization
Robust Subtask Learning for Compositional Generalization
Kishor Jothimurugan
Steve Hsu
Osbert Bastani
Rajeev Alur
OffRL
81
5
0
06 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRLOnRL
162
184
0
06 Feb 2023
Previous
123...373839...818283
Next