ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,719 papers shown
Title
Constrained Reinforcement Learning for Dynamic Material Handling
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
Matthieu Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
35
3
0
22 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
48
40
0
22 May 2023
Road Planning for Slums via Deep Reinforcement Learning
Road Planning for Slums via Deep Reinforcement Learning
Y. Zheng
Hongyuan Su
Jingtao Ding
Depeng Jin
Yong Li
19
13
0
22 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
53
19
0
22 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
43
6
0
22 May 2023
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning
  of Wireless Capsule Endoscopy
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
Yameng Zhang
Long Bai
Li Liu
Hongliang Ren
Max Q.-H. Meng
34
9
0
18 May 2023
Black-Box Targeted Reward Poisoning Attack Against Online Deep
  Reinforcement Learning
Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning
Yinglun Xu
Gagandeep Singh
OffRL
AAML
39
3
0
18 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Hanna Ziesche
Leonel Rozo
31
5
0
17 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
31
3
0
17 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
42
37
0
16 May 2023
Reinforcement Learning for Safe Robot Control using Control Lyapunov
  Barrier Functions
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Desong Du
Shao-Fu Han
Naiming Qi
Haitham Bou-Ammar
Jun Wang
Wei Pan
44
15
0
16 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
43
14
0
16 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
Policy Gradient Methods in the Presence of Symmetries and State
  Abstractions
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
42
2
0
09 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular
  Procedures: A Systematic Review
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic Review
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
29
29
0
06 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
42
8
0
05 May 2023
Multi-Task Multi-Behavior MAP-Elites
Multi-Task Multi-Behavior MAP-Elites
Timothée Anne
Jean-Baptiste Mouret
MoE
39
7
0
02 May 2023
Learning to Extrapolate: A Transductive Approach
Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu
Abhishek Gupta
Max Simchowitz
Kai Zhang
Pulkit Agrawal
56
15
0
27 Apr 2023
Zero-shot Transfer Learning of Driving Policy via Socially Adversarial
  Traffic Flow
Zero-shot Transfer Learning of Driving Policy via Socially Adversarial Traffic Flow
Dongkun Zhang
Jintao Xue
Yuxiang Cui
Yunkai Wang
Eryun Liu
Wei Jing
Junbo Chen
R. Xiong
Yue Wang
44
0
0
25 Apr 2023
Hierarchical State Abstraction Based on Structural Information
  Principles
Hierarchical State Abstraction Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
Chunyang Liu
Lifang He
Philip S. Yu
36
19
0
24 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
52
32
0
20 Apr 2023
Aiding reinforcement learning for set point control
Aiding reinforcement learning for set point control
Ruoqing Zhang
Per Mattsson
T. Wigren
21
3
0
20 Apr 2023
Evolving Constrained Reinforcement Learning Policy
Evolving Constrained Reinforcement Learning Policy
Chengpeng Hu
Jiyuan Pei
Jialin Liu
Xinghu Yao
18
1
0
19 Apr 2023
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource
  Allocation for Cloud Microservices
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Quintin Fettes
Avinash Karanth
Razvan Bunescu
Brandon Beckwith
S. Subramoney
32
3
0
17 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
18
0
0
14 Apr 2023
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic
  Embedding
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding
Tongzheng Ren
Zhaolin Ren
Haitong Ma
Na Li
Bo Dai
41
10
0
08 Apr 2023
Learning Robot Manipulation from Cross-Morphology Demonstration
Learning Robot Manipulation from Cross-Morphology Demonstration
G. Salhotra
Isabella Liu
Gaurav Sukhatme
LM&Ro
27
9
0
07 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
41
3
0
07 Apr 2023
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Maximilien Le Clei
Pierre C. Bellec
21
0
0
03 Apr 2023
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via
  Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
Weikang Wan
Haoran Geng
Yun-Hai Liu
Zikang Shan
Yaodong Yang
Li Yi
He Wang
63
95
0
02 Apr 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
52
16
0
30 Mar 2023
BC-IRL: Learning Generalizable Reward Functions from Demonstrations
BC-IRL: Learning Generalizable Reward Functions from Demonstrations
Andrew Szot
Amy Zhang
Dhruv Batra
Z. Kira
Franziska Meier
OOD
OffRL
50
8
0
28 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value
  Regularization
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
41
73
0
28 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor
  Feature-Based Concurrent Composition
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
34
4
0
24 Mar 2023
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the
  MineRL BASALT 2022 Competition
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
...
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
J. Miller
Rohin Shah
37
16
0
23 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
41
0
0
22 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic
  Local Planner and Polar State Representations
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
25
4
0
21 Mar 2023
Hybrid Systems Neural Control with Region-of-Attraction Planner
Hybrid Systems Neural Control with Region-of-Attraction Planner
Yue Meng
Chuchu Fan
52
1
0
18 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
18
21
0
14 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics
  Approaches for Satellite-to-Ground Laser Communication
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication
Payam Parvizi
Runnan Zou
C. Bellinger
R. Cheriton
D. Spinello
22
2
0
13 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between
  Robustness and Regularization
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
Matthieu Geist
Shie Mannor
50
1
0
12 Mar 2023
Inference on Optimal Dynamic Policies via Softmax Approximation
Inference on Optimal Dynamic Policies via Softmax Approximation
Qizhao Chen
Morgane Austern
Vasilis Syrgkanis
OffRL
48
1
0
08 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
58
3
0
08 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement
  Learning
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
33
1
0
07 Mar 2023
Environment Transformer and Policy Optimization for Model-Based Offline
  Reinforcement Learning
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning
Pengqin Wang
Meixin Zhu
Shaojie Shen
OffRL
38
1
0
07 Mar 2023
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed
  Environments
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments
Jun Yamada
J. Collins
Ingmar Posner
55
8
0
06 Mar 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
55
6
0
05 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
39
1
0
05 Mar 2023
Wasserstein Actor-Critic: Directed Exploration via Optimism for
  Continuous-Actions Control
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control
Amarildo Likmeta
Matteo Sacco
Alberto Maria Metelli
Marcello Restelli
OffRL
31
3
0
04 Mar 2023
Previous
123...101112...333435
Next