ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,658 papers shown
Title
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic
  Furniture Assembly
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly
Hao-ming Lin
Radu Corcodel
Ding Zhao
45
7
0
26 Apr 2024
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
Diego Martínez Baselga
L. Riazuelo
Luis Montano
92
1
0
25 Apr 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
43
0
0
25 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
43
1
0
24 Apr 2024
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
Minh Nguyen
Chandrajit Bajaj
27
0
0
24 Apr 2024
MultiSTOP: Solving Functional Equations with Reinforcement Learning
MultiSTOP: Solving Functional Equations with Reinforcement Learning
Alessandro Trenta
Davide Bacciu
Andrea Cossu
Pietro Ferrero
13
0
0
23 Apr 2024
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
OffRL
42
5
0
23 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
42
6
0
22 Apr 2024
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against
  Perturbation
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Xulin Chen
Ruipeng Liu
Garret E. Katz
49
0
0
22 Apr 2024
Empowering Embodied Visual Tracking with Visual Foundation Models and
  Offline RL
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong
Kui Wu
Hai Ci
Churan Wang
Hao Chen
OffRL
41
2
0
15 Apr 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
45
0
0
15 Apr 2024
Collaborative Ground-Space Communications via Evolutionary
  Multi-objective Deep Reinforcement Learning
Collaborative Ground-Space Communications via Evolutionary Multi-objective Deep Reinforcement Learning
Jiahui Li
Geng Sun
Qingqing Wu
Dusit Niyato
Jiawen Kang
Abbas Jamalipour
Victor C. M. Leung
16
20
0
11 Apr 2024
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic
  Agent
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Tongzhou Mu
Yijie Guo
Jie Xu
Ankit Goyal
Hao Su
Dieter Fox
Animesh Garg
LM&Ro
57
0
0
11 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
53
6
0
09 Apr 2024
Computing Transition Pathways for the Study of Rare Events Using Deep
  Reinforcement Learning
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Bo Lin
Yangzheng Zhong
Weiqing Ren
30
0
0
08 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A
  Representation-Based Viewpoint
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Zhaolin Ren
Bo Dai
Na Li
45
1
0
07 Apr 2024
Distributionally Robust Policy and Lyapunov-Certificate Learning
Distributionally Robust Policy and Lyapunov-Certificate Learning
Kehan Long
Jorge Cortés
Nikolay Atanasov
49
3
0
03 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
45
0
0
02 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
28
2
0
02 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
46
0
0
31 Mar 2024
Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning
  with Asymmetric Rewards
Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning with Asymmetric Rewards
Yasin Sonmez
Neelay Junnarkar
Murat Arcak
42
1
0
27 Mar 2024
Exploring CausalWorld: Enhancing robotic manipulation via knowledge
  transfer and curriculum learning
Exploring CausalWorld: Enhancing robotic manipulation via knowledge transfer and curriculum learning
Xinrui Wang
Yan Jin
45
1
0
25 Mar 2024
Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm
Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm
Titouan Renard
Andreas Schlaginhaufen
Tingting Ni
Maryam Kamgarpour
64
1
0
25 Mar 2024
A Twin Delayed Deep Deterministic Policy Gradient Algorithm for
  Autonomous Ground Vehicle Navigation via Digital Twin Perception Awareness
A Twin Delayed Deep Deterministic Policy Gradient Algorithm for Autonomous Ground Vehicle Navigation via Digital Twin Perception Awareness
K. Olayemi
Mien Van
Seán F. McLoone
Yuzhu Sun
Jack Close
Minh-Nhat Nguyen
Stephen McIlvanna
60
3
0
22 Mar 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and
  Continuous Motion Planning in Dynamic Environments
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
35
1
0
17 Mar 2024
Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games
Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games
Zixuan Wu
Sean Ye
Manisha Natarajan
Matthew C. Gombolay
83
6
0
16 Mar 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
45
8
0
14 Mar 2024
Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement
  Learning Approach
Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement Learning Approach
Shuchang Yan
19
1
0
12 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
47
3
0
09 Mar 2024
Reset & Distill: A Recipe for Overcoming Negative Transfer in Continual
  Reinforcement Learning
Reset & Distill: A Recipe for Overcoming Negative Transfer in Continual Reinforcement Learning
Hongjoon Ahn
Jinu Hyeon
Youngmin Oh
Bosun Hwang
Taesup Moon
CLL
OnRL
42
2
0
08 Mar 2024
Learning Speed Adaptation for Flight in Clutter
Learning Speed Adaptation for Flight in Clutter
Guangyu Zhao
Tianyue Wu
Yeke Chen
Fei Gao
51
7
0
07 Mar 2024
Koopman-Assisted Reinforcement Learning
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
45
7
0
04 Mar 2024
Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial
  Observability with a Soft Wrist
Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Hai Nguyen
Tadashi Kozuno
C. C. Beltran-Hernandez
Masashi Hamaya
54
7
0
28 Feb 2024
Imitation-regularized Optimal Transport on Networks: Provable Robustness and Application to Logistics Planning
Imitation-regularized Optimal Transport on Networks: Provable Robustness and Application to Logistics Planning
Koshi Oishi
Yota Hashizume
Tomohiko Jimbo
Hirotaka Kaji
Kenji Kashima
OOD
48
2
0
28 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent
  World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
48
28
0
26 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
55
11
0
22 Feb 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human
  Racing Gameplay
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
37
0
0
22 Feb 2024
Learning control strategy in soft robotics through a set of
  configuration spaces
Learning control strategy in soft robotics through a set of configuration spaces
Etienne Ménager
Christian Duriez
45
0
0
21 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
41
3
0
19 Feb 2024
Multi Task Inverse Reinforcement Learning for Common Sense Reward
Multi Task Inverse Reinforcement Learning for Common Sense Reward
Neta Glazer
Aviv Navon
Aviv Shamsian
Ethan Fetaya
32
0
0
17 Feb 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang
Wenjie Qiu
Yi-Chen Li
Lei Yuan
Chengxing Jia
Zongzhang Zhang
Yang Yu
OffRL
47
1
0
17 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
34
2
0
14 Feb 2024
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Gresa Shala
André Biedenkapp
Josif Grabocka
OffRL
48
4
0
09 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of
  Decision-Making
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
19
2
0
08 Feb 2024
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
Weikang Wan
Ziyu Wang
Zackory M. Erickson
David Held
David Held
39
4
0
08 Feb 2024
Transductive Reward Inference on Graph
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
45
0
0
06 Feb 2024
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
David Wu
Sanjiban Choudhury
28
0
0
04 Feb 2024
Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators
  for Non-Repetitive Reaching Tasks
Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks
Mehdi Heydari Shahna
Seyed Adel Alizadeh Kolagar
Jouni Mattila
29
4
0
04 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
54
7
0
04 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
46
2
0
02 Feb 2024
Previous
123...678...323334
Next