ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
Open RAN LSTM Traffic Prediction and Slice Management using Deep
  Reinforcement Learning
Open RAN LSTM Traffic Prediction and Slice Management using Deep Reinforcement Learning
Fatemeh Lotfi
Fatemeh Afghah
AI4TS
70
9
0
12 Jan 2024
Identifying Policy Gradient Subspaces
Identifying Policy Gradient Subspaces
Jan Schneider-Barnes
Pierre Schumacher
Simon Guist
Tianyu Cui
Daniel Haeufle
Bernhard Scholkopf
Le Chen
87
6
0
12 Jan 2024
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer
  Problem
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem
Niklas Strauß
Matthias Schubert
52
0
0
11 Jan 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
76
2
0
11 Jan 2024
An experimental evaluation of Deep Reinforcement Learning algorithms for
  HVAC control
An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control
Antonio Manjavacas
Alejandro Campoy-Nieves
Javier Jiménez Raboso
Miguel Molina-Solana
Juan Gómez-Romero
AI4CE
68
10
0
11 Jan 2024
ReACT: Reinforcement Learning for Controller Parametrization using
  B-Spline Geometries
ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometries
Thomas Rudolf
Daniel Flögel
Tobias Schürmann
Simon Süß
S. Schwab
Sören Hohmann
AI4CE
79
1
0
10 Jan 2024
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement
  Learning
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang
Paul Weng
OffRL
73
0
0
10 Jan 2024
Autonomous Navigation of Tractor-Trailer Vehicles through Roundabout
  Intersections
Autonomous Navigation of Tractor-Trailer Vehicles through Roundabout Intersections
Daniel Attard
Josef Bajada
46
2
0
10 Jan 2024
Fully Spiking Actor Network with Intra-layer Connections for
  Reinforcement Learning
Fully Spiking Actor Network with Intra-layer Connections for Reinforcement Learning
Ding Chen
Peixi Peng
Tiejun Huang
Yonghong Tian
96
7
0
09 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
134
112
0
08 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRLOnRL
122
12
0
06 Jan 2024
Artificial Intelligence for Operations Research: Revolutionizing the
  Operations Research Process
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
101
13
0
06 Jan 2024
Interpreting Adaptive Gradient Methods by Parameter Scaling for
  Learning-Rate-Free Optimization
Interpreting Adaptive Gradient Methods by Parameter Scaling for Learning-Rate-Free Optimization
Min-Kook Suh
Seung-Woo Seo
ODL
76
0
0
06 Jan 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and
  Efficient Autonomous Driving
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
89
36
0
06 Jan 2024
Deep Reinforcement Learning for Local Path Following of an Autonomous
  Formula SAE Vehicle
Deep Reinforcement Learning for Local Path Following of an Autonomous Formula SAE Vehicle
Harvey Merton
Thomas Delamore
Karl Stol
Henry Williams
45
0
0
05 Jan 2024
XUAT-Copilot: Multi-Agent Collaborative System for Automated User
  Acceptance Testing with Large Language Model
XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model
Zhitao Wang
Wei Wang
Zirao Li
Long Wang
Can Yi
Xinjie Xu
Luyang Cao
Hanjing Su
Shouzhi Chen
Jun Zhou
ALMLLMAG
78
9
0
05 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
154
3
0
30 Dec 2023
Exploring Deep Reinforcement Learning for Robust Target Tracking using
  Micro Aerial Vehicles
Exploring Deep Reinforcement Learning for Robust Target Tracking using Micro Aerial Vehicles
Alberto Dionigi
Mirko Leomanni
Alessandro Saviolo
Giuseppe Loianno
G. Costante
79
2
0
29 Dec 2023
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam
  Intensity Control in Mu2e
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e
Chenwei Xu
Jerry Yao-Chieh Hu
A. Narayanan
M. Thieme
V. Nagaslaev
...
Rui Shi
S. Memik
A. Shuping
Kyle Hazelwood
Han Liu
43
2
0
28 Dec 2023
Gradient-based Planning with World Models
Gradient-based Planning with World Models
V. JyothirS
Siddhartha Jalagam
Yann LeCun
Vlad Sobal
83
4
0
28 Dec 2023
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement
  Learning?
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
95
3
0
28 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
78
3
0
27 Dec 2023
Efficient Reinforcement Learning via Decoupling Exploration and
  Utilization
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
115
1
0
26 Dec 2023
Generalizable Task Representation Learning for Offline
  Meta-Reinforcement Learning with Data Limitations
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Renzhe Zhou
Chenxiao Gao
Zongzhang Zhang
Yang Yu
OffRL
109
12
0
26 Dec 2023
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Wenzhang Liu
Wenzhe Cai
Kun Jiang
Guangran Cheng
Yuanda Wang
Changyin Sun
Jingyu Cao
Lele Xu
Chaoxu Mu
Changyin Sun
57
6
0
25 Dec 2023
BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Lin Sun
Weijun Wang
Tingting Yuan
Liang Mi
Haipeng Dai
Yunxin Liu
Xiaoming Fu
76
4
0
25 Dec 2023
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse
  Reward
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRLVLM
80
0
0
25 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
59
0
0
24 Dec 2023
MaDi: Learning to Mask Distractions for Generalization in Visual Deep
  Reinforcement Learning
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Bram Grooten
Tristan Tomilin
Gautham Vasan
Matthew E. Taylor
A. R. Mahmood
Meng Fang
Mykola Pechenizkiy
Decebal Constantin Mocanu
88
10
0
23 Dec 2023
Human-Centric Resource Allocation for the Metaverse With Multiaccess
  Edge Computing
Human-Centric Resource Allocation for the Metaverse With Multiaccess Edge Computing
Zijian Long
Haiwei Dong
Abdulmotaleb El Saddik
108
19
0
23 Dec 2023
Distributional Reinforcement Learning-based Energy Arbitrage Strategies
  in Imbalance Settlement Mechanism
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism
Seyed soroush Karimi madahi
Bert Claessens
Chris Develder
66
7
0
23 Dec 2023
An investigation of belief-free DRL and MCTS for inspection and
  maintenance planning
An investigation of belief-free DRL and MCTS for inspection and maintenance planning
Daniel Koutas
E. Bismut
Daniel Straub
57
2
0
22 Dec 2023
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement
  Learning with Dynamic Depth Routing
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He
Kai Li
Yifan Zang
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
MoE
92
5
0
22 Dec 2023
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for
  Connected Autonomous Vehicles
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
81
1
0
21 Dec 2023
Open-Source Reinforcement Learning Environments Implemented in MuJoCo
  with Franka Manipulator
Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator
Zichun Xu
Yuntao Li
Xiaohang Yang
Zhiyuan Zhao
Zhuang Lei
Jingdong Zhao
100
2
0
21 Dec 2023
Safe Multi-Agent Reinforcement Learning for Formation Control without
  Individual Reference Targets
Safe Multi-Agent Reinforcement Learning for Formation Control without Individual Reference Targets
Murad Dawood
Sicong Pan
Nils Dengler
Siqi Zhou
Angela P. Schoellig
Maren Bennewitz
OffRL
155
3
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
92
18
0
20 Dec 2023
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
  Noisy Environments
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
128
8
0
19 Dec 2023
Mastering Stacking of Diverse Shapes with Large-Scale Iterative
  Reinforcement Learning on Real Robots
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Thomas Lampe
A. Abdolmaleki
Sarah Bechtle
Sandy H. Huang
Jost Tobias Springenberg
...
Markus Wulfmeier
Jingwei Zhang
Francesco Nori
N. Heess
Martin Riedmiller
OffRL
84
9
0
18 Dec 2023
Solving the swing-up and balance task for the Acrobot and Pendubot with
  SAC
Solving the swing-up and balance task for the Acrobot and Pendubot with SAC
Chi Zhang
Akhil Sathuluri
Markus Zimmermann
57
4
0
18 Dec 2023
Exploring Gradient Explosion in Generative Adversarial Imitation
  Learning: A Probabilistic Perspective
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang
Yichen Zhu
Yirui Zhou
Yaxin Peng
Jian Tang
Zhiyuan Xu
Chaomin Shen
Yangchun Zhang
71
4
0
18 Dec 2023
Aligning Human Intent from Imperfect Demonstrations with
  Confidence-based Inverse soft-Q Learning
Aligning Human Intent from Imperfect Demonstrations with Confidence-based Inverse soft-Q Learning
Xizhou Bu
Wenjuan Li
Zhengxiong Liu
Zhiqiang Ma
Panfeng Huang
84
2
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
79
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned
  Sub-Trajectory Reward
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
73
2
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
84
10
0
15 Dec 2023
Peer Learning: Learning Complex Policies in Groups from Scratch via
  Action Recommendations
Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations
Cedric Derstroff
Mattia Cerrato
Jannis Brugger
Jan Peters
Stefan Kramer
39
0
0
15 Dec 2023
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline
  Pre-Training with Model Based Augmentation
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRLOnRL
45
3
0
15 Dec 2023
Communication-Efficient Soft Actor-Critic Policy Collaboration via
  Regulated Segment Mixture in Internet of Vehicles
Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu
Rongpeng Li
Chengchao Liang
Zhifeng Zhao
79
0
0
15 Dec 2023
HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement
  Learning Agents
HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents
Dániel Horváth
Jesús Bujalance Martín
Ferenc Gàbor Erdos
Z. Istenes
Fabien Moutarde
OffRL
64
1
0
14 Dec 2023
Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous
  Mobility on Demand Systems
Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems
Heiko Hoppe
Tobias Enders
Quentin Cappart
Maximilian Schiffer
85
4
0
14 Dec 2023
Previous
123...232425...818283
Next