ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
78
0
0
14 Jun 2024
Bridging the Communication Gap: Artificial Agents Learning Sign Language
  through Imitation
Bridging the Communication Gap: Artificial Agents Learning Sign Language through Imitation
Federico Tavella
Aphrodite Galata
Angelo Cangelosi
52
1
0
14 Jun 2024
Deep Bayesian Active Learning for Preference Modeling in Large Language
  Models
Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Luckeciano C. Melo
P. Tigas
Alessandro Abate
Yarin Gal
122
11
0
14 Jun 2024
Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary
  Model
Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model
Siemen Herremans
Ali Anwar
Siegfried Mercelis
66
2
0
14 Jun 2024
I Know How: Combining Prior Policies to Solve New Tasks
I Know How: Combining Prior Policies to Solve New Tasks
Malio Li
Elia Piccoli
Vincenzo Lomonaco
Davide Bacciu
CLL
56
0
0
14 Jun 2024
DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning
DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning
Zeyu Gao
Yao Mu
Jinye Qu
Mengkang Hu
Lingyue Guo
Ping Luo
Yanfeng Lu
Ping Luo
Shanghang Zhang
Yanfeng Lu
131
11
0
14 Jun 2024
AutomaChef: A Physics-informed Demonstration-guided Learning Framework
  for Granular Material Manipulation
AutomaChef: A Physics-informed Demonstration-guided Learning Framework for Granular Material Manipulation
Minglun Wei
Xintong Yang
Yu-Kun Lai
S. A. Tafrishi
Ze Ji
AI4CE
67
0
0
13 Jun 2024
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for
  Offline Reinforcement Learning
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning
Xuemin Hu
Shen Li
Yingfen Xu
Bo Tang
Long Chen
59
0
0
13 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep
  Reinforcement Learning Algorithms
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
29
1
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
101
4
0
13 Jun 2024
BaSeNet: A Learning-based Mobile Manipulator Base Pose Sequence Planning
  for Pickup Tasks
BaSeNet: A Learning-based Mobile Manipulator Base Pose Sequence Planning for Pickup Tasks
Lakshadeep Naik
Sinan Kalkan
S. Sørensen
Mikkel B. Kjærgaard
Norbert Kruger
82
1
0
12 Jun 2024
Optimizing Deep Reinforcement Learning for Adaptive Robotic Arm Control
Optimizing Deep Reinforcement Learning for Adaptive Robotic Arm Control
Jonaid Shianifar
Michael Schukat
Karl Mason
42
3
0
12 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
103
1
0
12 Jun 2024
The Max-Min Formulation of Multi-Objective Reinforcement Learning: From
  Theory to a Model-Free Algorithm
The Max-Min Formulation of Multi-Objective Reinforcement Learning: From Theory to a Model-Free Algorithm
Giseung Park
Woohyeon Byeon
Seongmin Kim
Elad Havakuk
Amir Leshem
Youngchul Sung
51
3
0
12 Jun 2024
Unifying Interpretability and Explainability for Alzheimer's Disease
  Progression Prediction
Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction
Raja Farrukh Ali
Stephanie Milani
John Woods
Emmanuel Adenij
Ayesha Farooq
Clayton Mansel
Jeffrey Burns
William Hsu
77
0
0
11 Jun 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
111
0
0
11 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
128
2
0
11 Jun 2024
Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by
  Model-based Reinforcement Learning
Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning
Xuezhi Niu
Kaige Tan
Lei Feng
64
0
0
11 Jun 2024
Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning
Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning
Wenjun Zhang
Qiong Wu
Pingyi Fan
Nan Cheng
Wen Chen
Jiangzhou Wang
Khaled B. Letaief
74
27
0
11 Jun 2024
Learning Continually by Spectral Regularization
Learning Continually by Spectral Regularization
Alex Lewandowski
Saurabh Kumar
Dale Schuurmans
András Gyorgy
Marlos C. Machado
CLL
94
6
0
10 Jun 2024
Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach
  For Adaptive Brain Stimulation
Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Michelle Pan
Mariah L. Schrum
Vivek Myers
Erdem Bıyık
Anca Dragan
43
0
0
10 Jun 2024
Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time
  Strategy Switch Identification Using Running Error Estimation
Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation
Mohidul Haque Mridul
Mohammad Foysal Khan
Redwan Ahmed Rizvee
Md. Mosaddek Khan
AAML
50
0
0
10 Jun 2024
Towards Real-World Efficiency: Domain Randomization in Reinforcement
  Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Bahador Beigomi
Zheng H. Zhu
69
0
0
10 Jun 2024
Decoupling regularization from the action space
Decoupling regularization from the action space
Sobhan Mohammadpour
Emma Frejinger
Pierre-Luc Bacon
73
2
0
10 Jun 2024
Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Calarina Muslimani
Bram Grooten
Deepak Ranganatha Sastry Mamillapalli
Mykola Pechenizkiy
Decebal Constantin Mocanu
Matthew E. Taylor
105
0
0
10 Jun 2024
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
Kartik Choudhary
Dhawal Gupta
Philip S. Thomas
OODVLM
66
0
0
09 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
104
1
0
09 Jun 2024
Multi-attribute Auction-based Resource Allocation for Twins Migration in
  Vehicular Metaverses: A GPT-based DRL Approach
Multi-attribute Auction-based Resource Allocation for Twins Migration in Vehicular Metaverses: A GPT-based DRL Approach
Yongju Tong
Junlong Chen
Minrui Xu
Jiawen Kang
Zehui Xiong
Dusit Niyato
Chau Yuen
Zhu Han
61
3
0
08 Jun 2024
Reinforcement Learning for Intensity Control: An Application to
  Choice-Based Network Revenue Management
Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management
Huiling Meng
Ningyuan Chen
Xuefeng Gao
99
1
0
08 Jun 2024
Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online
  Coverage Path Planning
Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning
Arvi Jonnarth
Ola Johansson
Michael Felsberg
OffRL
132
1
0
07 Jun 2024
Skill-aware Mutual Information Optimisation for Generalisation in
  Reinforcement Learning
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Xuehui Yu
Mhairi Dunion
Xin Li
Stefano V. Albrecht
108
2
0
07 Jun 2024
Optimization of geological carbon storage operations with multimodal
  latent dynamic model and deep reinforcement learning
Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning
Zhongzheng Wang
Yuntian Chen
Guodong Chen
Dongxiao Zhang
AI4CE
81
1
0
07 Jun 2024
Strategically Conservative Q-Learning
Strategically Conservative Q-Learning
Yutaka Shimizu
Joey Hong
Sergey Levine
Masayoshi Tomizuka
OffRLOnRL
95
0
0
06 Jun 2024
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary
  Trajectories
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
Qianlan Yang
Yu-Xiong Wang
OnRL
102
1
0
06 Jun 2024
Simulating, Fast and Slow: Learning Policies for Black-Box Optimization
Simulating, Fast and Slow: Learning Policies for Black-Box Optimization
F. V. Massoli
Tim Bakker
Thomas M. Hehn
Tribhuvanesh Orekondy
Arash Behboodi
110
0
0
06 Jun 2024
Redundancy-aware Action Spaces for Robot Learning
Redundancy-aware Action Spaces for Robot Learning
Pietro Mazzaglia
Nicholas Backshall
Xiao Ma
Stephen James
80
2
0
06 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
101
0
0
06 Jun 2024
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model
  Predictive Control
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control
Rudolf Reiter
Andrea Ghezzi
Katrin Baumgärtner
Jasper Hoffmann
Robert D. McAllister
Moritz Diehl
90
8
0
06 Jun 2024
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Bahareh Tasdighi
Nicklas Werge
Yi-Shan Wu
M. Kandemir
42
0
0
06 Jun 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through
  Continuous Action Masking
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
46
4
0
06 Jun 2024
Reflective Policy Optimization
Reflective Policy Optimization
Yaozhong Gan
Renye Yan
Zhe Wu
Junliang Xing
84
1
0
06 Jun 2024
Object Manipulation in Marine Environments using Reinforcement Learning
Object Manipulation in Marine Environments using Reinforcement Learning
Ahmed Nader
Muhayy ud Din
Mughni Irfan
Irfan Hussain
111
0
0
05 Jun 2024
DEER: A Delay-Resilient Framework for Reinforcement Learning with
  Variable Delays
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
Bo Xia
Yilun Kong
Yongzhe Chang
Bo Yuan
Zhiheng Li
Xueqian Wang
Bin Liang
OffRL
114
3
0
05 Jun 2024
"Give Me an Example Like This": Episodic Active Reinforcement Learning
  from Demonstrations
"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
93
3
0
05 Jun 2024
Representation Learning For Efficient Deep Multi-Agent Reinforcement
  Learning
Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning
Dom Huh
Prasant Mohapatra
96
1
0
05 Jun 2024
Dynamically Expanding Capacity of Autonomous Driving with Near-Miss
  Focused Training Framework
Dynamically Expanding Capacity of Autonomous Driving with Near-Miss Focused Training Framework
Ziyuan Yang
Zhaoyang Li
Jianming Hu
Yi Zhang
61
0
0
05 Jun 2024
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Riqiang Gao
Florin-Cristian Ghesu
Simon Arberet
Shahab Basiri
Esa Kuusela
Martin Kraus
Dorin Comaniciu
A. Kamen
AI4CE
50
1
0
03 Jun 2024
Learning the Target Network in Function Space
Learning the Target Network in Function Space
Kavosh Asadi
Yao Liu
Shoham Sabach
Ming Yin
Rasool Fakoor
126
0
0
03 Jun 2024
When to Sense and Control? A Time-adaptive Approach for Continuous-Time
  RL
When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
Lenart Treven
Bhavya Sukhija
Yarden As
Florian Dorfler
Andreas Krause
117
1
0
03 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training,
  Integration, and Agent-Human Alignment
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
118
2
0
03 Jun 2024
Previous
123...151617...818283
Next