ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
RLIF: Interactive Imitation Learning as Reinforcement Learning
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi-An Ma
Sergey Levine
OffRL
138
18
0
21 Nov 2023
Multi-Objective Reinforcement Learning Based on Decomposition: A
  Taxonomy and Framework
Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
Florian Felten
El-Ghazali Talbi
Grégoire Danoy
76
17
0
21 Nov 2023
Random Linear Projections Loss for Hyperplane-Based Optimization in
  Neural Networks
Random Linear Projections Loss for Hyperplane-Based Optimization in Neural Networks
Shyam Venkatasubramanian
Ahmed Aloui
Vahid Tarokh
119
0
0
21 Nov 2023
Resilient Control of Networked Microgrids using Vertical Federated
  Reinforcement Learning: Designs and Real-Time Test-Bed Validations
Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations
Sayak Mukherjee
Ramij-Raja Hossain
Sheik M. Mohiuddin
Yuan Liu
Wei Du
Veronica Adetola
Rohit A Jinsiwale
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
71
4
0
21 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
104
4
0
20 Nov 2023
Replay-enhanced Continual Reinforcement Learning
Replay-enhanced Continual Reinforcement Learning
Tiantian Zhang
Kevin Zehua Shen
Zichuan Lin
Bo Yuan
Xueqian Wang
Xiu Li
Deheng Ye
CLLOffRL
81
7
0
20 Nov 2023
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy
Jan Peters
Carlo DÉramo
MoE
82
19
0
19 Nov 2023
3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images
3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images
Tudor Jianu
Baoru Huang
Pierre Berthet-Rayne
S. Fichera
Anh Nguyen
67
1
0
19 Nov 2023
Decentralized Energy Marketplace via NFTs and AI-based Agents
Decentralized Energy Marketplace via NFTs and AI-based Agents
Rasoul Nikbakht
Farhana Javed
Farhad Rezazadeh
N. Bartzoudis
J. Mangues-Bafalluy
40
1
0
17 Nov 2023
Imagination-Augmented Hierarchical Reinforcement Learning for Safe and
  Interactive Autonomous Driving in Urban Environments
Imagination-Augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments
Sang-Hyun Lee
Yoonjae Jung
Seung-Woo Seo
62
2
0
17 Nov 2023
Interpretable Reinforcement Learning for Robotics and Continuous Control
Interpretable Reinforcement Learning for Robotics and Continuous Control
Rohan R. Paleja
Letian Chen
Yaru Niu
Andrew Silva
Zhaoxin Li
...
K. Chang
H. E. Tseng
Yan Wang
S. Nageshrao
Matthew C. Gombolay
82
7
0
16 Nov 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
87
1
0
16 Nov 2023
A Software-Hardware Co-Optimized Toolkit for Deep Reinforcement Learning
  on Heterogeneous Platforms
A Software-Hardware Co-Optimized Toolkit for Deep Reinforcement Learning on Heterogeneous Platforms
Yuan Meng
Michael Kinsner
Deshanand Singh
Mahesh Iyer
Viktor Prasanna
46
2
0
15 Nov 2023
Self-Supervised Curriculum Generation for Autonomous Reinforcement
  Learning without Task-Specific Knowledge
Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge
Sang-Hyun Lee
Seung-Woo Seo
ODLCLLSSL
77
3
0
15 Nov 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Chong Chen
Yi Tian Xu
Xiangyang Ji
OffRL
114
17
0
15 Nov 2023
Fairness-Driven Optimization of RIS-Augmented 5G Networks for Seamless
  3D UAV Connectivity Using DRL Algorithms
Fairness-Driven Optimization of RIS-Augmented 5G Networks for Seamless 3D UAV Connectivity Using DRL Algorithms
Yu Tian
Ahmed Alhammadi
Jiguang He
Aymen Fakhreddine
Faouzi Bader
80
0
0
14 Nov 2023
A Central Motor System Inspired Pre-training Reinforcement Learning for
  Robotic Control
A Central Motor System Inspired Pre-training Reinforcement Learning for Robotic Control
Pei Zhang
Zhaobo Hua
Jinliang Ding
80
0
0
14 Nov 2023
Data-Efficient Task Generalization via Probabilistic Model-based Meta
  Reinforcement Learning
Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning
Arjun Bhardwaj
Jonas Rothfuss
Bhavya Sukhija
Yarden As
Marco Hutter
Stelian Coros
Andreas Krause
96
5
0
13 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
85
0
0
12 Nov 2023
An advantage based policy transfer algorithm for reinforcement learning
  with metrics of transferability
An advantage based policy transfer algorithm for reinforcement learning with metrics of transferability
M. Alam
Parinaz Naghizadeh Ardabili
David Hoelzle
OffRL
57
0
0
12 Nov 2023
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Jared Markowitz
Edward W. Staley
OffRL
77
2
0
10 Nov 2023
Real-time Control of Electric Autonomous Mobility-on-Demand Systems via
  Graph Reinforcement Learning
Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning
Aaryan Singhal
Daniele Gammelli
Justin Luke
Karthik Gopalakrishnan
Dominik Helmreich
Marco Pavone
69
2
0
09 Nov 2023
Differentiable Cloth Parameter Identification and State Estimation in
  Manipulation
Differentiable Cloth Parameter Identification and State Estimation in Manipulation
Dongzhe Zheng
Siqiong Yao
Wenqiang Xu
Cewu Lu
79
6
0
09 Nov 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
103
2
0
09 Nov 2023
Force-Constrained Visual Policy: Safe Robot-Assisted Dressing via
  Multi-Modal Sensing
Force-Constrained Visual Policy: Safe Robot-Assisted Dressing via Multi-Modal Sensing
Zhanyi Sun
Yufei Wang
David Held
Zackory M. Erickson
84
4
0
07 Nov 2023
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement
  Learning Adaptation
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement Learning Adaptation
Maxwell J. Jacobson
Yexiang Xue
88
0
0
07 Nov 2023
A Novel Variational Lower Bound for Inverse Reinforcement Learning
A Novel Variational Lower Bound for Inverse Reinforcement Learning
Yikang Gui
Prashant Doshi
62
0
0
07 Nov 2023
Context Shift Reduction for Offline Meta-Reinforcement Learning
Context Shift Reduction for Offline Meta-Reinforcement Learning
Yunkai Gao
Rui Zhang
Jiaming Guo
Fan Wu
Qi Yi
...
Zidong Du
Xingui Hu
Qi Guo
Ling Li
Yunji Chen
OffRL
59
20
0
07 Nov 2023
Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers
  and Docking
Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking
Desong Du
Naiming Qi
Yanfang Liu
Wei Pan
66
0
0
07 Nov 2023
SeRO: Self-Supervised Reinforcement Learning for Recovery from
  Out-of-Distribution Situations
SeRO: Self-Supervised Reinforcement Learning for Recovery from Out-of-Distribution Situations
Chan Kim
JaeKyung Cho
C. Bobda
Seung-Woo Seo
Seong-Woo Kim
65
3
0
07 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRLOnRL
134
13
0
06 Nov 2023
Imitation Bootstrapped Reinforcement Learning
Imitation Bootstrapped Reinforcement Learning
Hengyuan Hu
Suvir Mirchandani
Dorsa Sadigh
134
27
0
03 Nov 2023
Domain Randomization via Entropy Maximization
Domain Randomization via Entropy Maximization
Gabriele Tiboni
Pascal Klink
Jan Peters
Tatiana Tommasi
Carlo DÉramo
Georgia Chalvatzaki
106
17
0
03 Nov 2023
Hierarchical Reinforcement Learning for Power Network Topology Control
Hierarchical Reinforcement Learning for Power Network Topology Control
Blazej Manczak
Jan Viebahn
H. V. Hoof
64
7
0
03 Nov 2023
Robust Adversarial Reinforcement Learning via Bounded Rationality
  Curricula
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula
Aryaman Reddi
Maximilian Tölle
Jan Peters
Georgia Chalvatzaki
Carlo DÉramo
76
7
0
03 Nov 2023
A Statistical Guarantee for Representation Transfer in Multitask
  Imitation Learning
A Statistical Guarantee for Representation Transfer in Multitask Imitation Learning
Bryan Chan
Karime Pereida
James Bergstra
92
1
0
02 Nov 2023
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning
  via Generative Simulation
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
Yufei Wang
Zhou Xian
Feng Chen
Tsun-Hsuan Wang
Yian Wang
Katerina Fragkiadaki
Zackory M. Erickson
David Held
Chuang Gan
LM&Ro
135
110
0
02 Nov 2023
Invariant Causal Imitation Learning for Generalizable Policies
Invariant Causal Imitation Learning for Generalizable Policies
Ioana Bica
Daniel Jarrett
Mihaela van der Schaar
CMLOffRLOOD
129
35
0
02 Nov 2023
Time-series Generation by Contrastive Imitation
Time-series Generation by Contrastive Imitation
Daniel Jarrett
Ioana Bica
M. Schaar
AI4TS
86
24
0
02 Nov 2023
Diffusion Models for Reinforcement Learning: A Survey
Diffusion Models for Reinforcement Learning: A Survey
Zhengbang Zhu
Hanye Zhao
Haoran He
Yichao Zhong
Shenyu Zhang
Haoquan Guo
Tingting Chen
Weinan Zhang
160
68
0
02 Nov 2023
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement
  Learning
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
Siming Lan
Rui Zhang
Qi Yi
Jiaming Guo
Shaohui Peng
...
Zidong Du
Xingui Hu
Xishan Zhang
Ling Li
Yunji Chen
90
9
0
02 Nov 2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Annie S. Chen
Govind Chada
Laura M. Smith
Archit Sharma
Zipeng Fu
Sergey Levine
Chelsea Finn
100
8
0
02 Nov 2023
Efficient Symbolic Policy Learning with Differentiable Symbolic
  Expression
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression
Jiaming Guo
Rui Zhang
Shaohui Peng
Qi Yi
Xingui Hu
...
Zidong Du
Xishan Zhang
Ling Li
Qi Guo
Yunji Chen
OffRL
74
7
0
02 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement
  Learning
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
77
7
0
01 Nov 2023
A Multi-Agent Reinforcement Learning Framework for Evaluating the U.S.
  Ending the HIV Epidemic Plan
A Multi-Agent Reinforcement Learning Framework for Evaluating the U.S. Ending the HIV Epidemic Plan
Dinesh Sharma
Ankit Shah
Chaitra Gopalappa
106
0
0
01 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample
  Complexity
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
66
5
0
31 Oct 2023
Unleashing the Power of Pre-trained Language Models for Offline
  Reinforcement Learning
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRLRALM
120
23
0
31 Oct 2023
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate
  Objective Variance in Policy Optimization Methods
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods
Zhengpeng Xie
Changdong Yu
Weizheng Qiao
103
1
0
31 Oct 2023
Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep
  Ensemble Agents
Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents
Woojun Kim
Yongjae Shin
Jongeui Park
Young-Jin Sung
OnRL
77
8
0
31 Oct 2023
Learning to Discover Skills through Guidance
Learning to Discover Skills through Guidance
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
114
6
0
31 Oct 2023
Previous
123...252627...818283
Next