ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,128 papers shown
Title
Robotic Paper Wrapping by Learning Force Control
Robotic Paper Wrapping by Learning Force Control
Hiroki Hanai
Takuya Kiyokawa
Weiwei Wan
Kensuke Harada
115
0
0
19 Mar 2025
Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd ÁI Olympics with RealAIGym' Competition
Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd ÁI Olympics with RealAIGym' Competition
Felix Wiebe
Niccolò Turcato
Alberto Dalla Libera
Jean Seong Bjorn Choe
Bumkyu Choi
...
Dennis Mronga
Boris Belousov
Jan Peters
Frank Kirchner
Shivesh Kumar
74
5
0
19 Mar 2025
Predicting Multi-Agent Specialization via Task Parallelizability
Predicting Multi-Agent Specialization via Task Parallelizability
Elizabeth Mieczkowski
Ruaridh Mon-Williams
Neil R. Bramley
Christopher G. Lucas
Natalia Vélez
Thomas Griffiths
100
1
0
19 Mar 2025
CTSAC: Curriculum-Based Transformer Soft Actor-Critic for Goal-Oriented Robot Exploration
CTSAC: Curriculum-Based Transformer Soft Actor-Critic for Goal-Oriented Robot Exploration
Chunyu Yang
Shengben Bi
Yihui Xu
Xin Zhang
99
0
0
18 Mar 2025
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Pratap Tokekar
118
0
0
18 Mar 2025
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Wei-Ting Hung
Shao-Hua Sun
Ping-Chun Hsieh
80
0
0
17 Mar 2025
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
101
0
0
17 Mar 2025
EmoBipedNav: Emotion-aware Social Navigation for Bipedal Robots with Deep Reinforcement Learning
EmoBipedNav: Emotion-aware Social Navigation for Bipedal Robots with Deep Reinforcement Learning
Wei Zhu
Abirath Raju
Abdulaziz Shamsah
Anqi Wu
S. Hutchinson
Ye Zhao
137
0
0
16 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRLOnRL
114
0
0
15 Mar 2025
Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning
Jose-Luis Holgado-Alvarez
Aryaman Reddi
Carlo DÉramo
93
0
0
14 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCVBDL
518
2
0
14 Mar 2025
Adaptive Torque Control of Exoskeletons under Spasticity Conditions via Reinforcement Learning
Andrés Chavarrías
David Rodriguez-Cianca
Pablo Lanillos
64
0
0
14 Mar 2025
Safe exploration in reproducing kernel Hilbert spaces
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
83
0
0
13 Mar 2025
LUMOS: Language-Conditioned Imitation Learning with World Models
Iman Nematollahi
Branton DeMoss
Akshay L Chandra
Nick Hawes
Wolfram Burgard
Ingmar Posner
OffRL
71
1
0
13 Mar 2025
Optimisation of the Accelerator Control by Reinforcement Learning: A Simulation-Based Approach
Anwar Ibrahim
D. Derkach
Alexey Petrenko
Fedor Ratnikov
Maxim Kaledin
146
0
0
12 Mar 2025
Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion
Nico Bohlinger
Jonathan Kinzel
Daniel Palenicek
Lukasz Antczak
Jan Peters
83
2
0
11 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
91
0
0
10 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
500
5
0
10 Mar 2025
On the Fly Adaptation of Behavior Tree-Based Policies through Reinforcement Learning
M. Iannotta
J. A. Stork
Erik Schaffernicht
Todor Stoyanov
79
0
0
08 Mar 2025
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Samuel Garcin
Trevor A. McInroe
Pablo Samuel Castro
Prakash Panangaden
Christopher G. Lucas
David Abel
Stefano V. Albrecht
116
0
0
08 Mar 2025
Mastering Continual Reinforcement Learning through Fine-Grained Sparse Network Allocation and Dormant Neuron Exploration
Chengqi Zheng
Haiyan Yin
Jianda Chen
Terence Ng
Yew-Soon Ong
Ivor Tsang
CLL
440
0
0
07 Mar 2025
Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Taco Cohen
David W. Zhang
Kunhao Zheng
Yunhao Tang
Rémi Munos
Gabriel Synnaeve
OffRL
117
1
0
07 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang
Min-hwan Oh
OffRL
124
0
0
07 Mar 2025
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean
Evangelos Chataroulas
Jordan Terry
Isaac Woungang
Nariman Farsad
Pablo Samuel Castro
LRM
159
1
0
07 Mar 2025
Refined Policy Distillation: From VLA Generalists to RL Experts
Tobias Jülg
Wolfram Burgard
Florian Walter
OffRL
71
1
0
06 Mar 2025
Can We Optimize Deep RL Policy Weights as Trajectory Modeling?
Hongyao Tang
OffRL
191
0
0
06 Mar 2025
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services
Xiaoqi Wang
Hongyang Du
Yuehong Gao
Dong In Kim
98
0
0
06 Mar 2025
Causality-Based Reinforcement Learning Method for Multi-Stage Robotic Tasks
Jiechao Deng
Ning Tan
92
0
0
05 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jing Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
340
0
0
05 Mar 2025
Active Robot Curriculum Learning from Online Human Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
121
0
0
04 Mar 2025
A comparison of visual representations for real-world reinforcement learning in the context of vacuum gripping
Nico Sutter
Valentin N. Hartmann
Stelian Coros
OffRL
98
0
0
04 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
135
1
0
04 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
91
0
0
04 Mar 2025
NavG: Risk-Aware Navigation in Crowded Environments Based on Reinforcement Learning with Guidance Points
Qianyi Zhang
Wentao Luo
Boyi Liu
Ziyang Zhang
Yaoyuan Wang
Jing Liu
84
0
0
03 Mar 2025
Differentiable Information Enhanced Model-Based Reinforcement Learning
Xiaoyuan Zhang
Xinyan Cai
Bo Liu
Weidong Huang
Song-Chun Zhu
Siyuan Qi
Y. Yang
99
0
0
03 Mar 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
141
0
0
03 Mar 2025
Enhancing Deep Reinforcement Learning-based Robot Navigation Generalization through Scenario Augmentation
Shanze Wang
Mingao Tan
Zhiyong Yang
Xinyu Wang
Xiaoyu Shen
Hailong Huang
Wei Zhang
111
0
0
03 Mar 2025
Beyond Visibility Limits: A DRL-Based Navigation Strategy for Unexpected Obstacles
Mingao Tan
Shanze Wang
Biao Huang
Zhiyong Yang
Ruoxin Chen
Xiaoyu Shen
Wei Zhang
111
0
0
03 Mar 2025
Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations
Chonlam Ho
Jianshu Hu
Haoran Wang
Qi Dou
Yutong Ban
MedIm
142
1
0
03 Mar 2025
Eau De $Q$-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Eau De QQQ-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
Tim Lukas Faust
Yogesh Tripathi
Jan Peters
Carlo DÉramo
78
0
0
03 Mar 2025
On Generalization Across Environments In Multi-Objective Reinforcement Learning
Jayden Teoh
Pradeep Varakantham
Peter Vamplew
OffRL
116
1
0
02 Mar 2025
Runtime Learning of Quadruped Robots in Wild Environments
Yihao Cai
Y. Mao
L. Sha
H. Cao
Marco Caccamo
91
1
0
02 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
72
0
0
02 Mar 2025
Passivity-Centric Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Passivity-Centric Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Heng Zhang
Gokhan Solak
Sebastian Hjorth
Arash Ajoudani
OffRL
63
1
0
01 Mar 2025
Discrete Codebook World Models for Continuous Control
Aidan Scannell
Mohammadreza Nakhaei
Kalle Kujanpää
Yi Zhao
Kevin Sebastian Luck
Dieter Büchler
Joni Pajarinen
OffRL
95
2
0
01 Mar 2025
BodyGen: Advancing Towards Efficient Embodiment Co-Design
Haofei Lu
Zhe Wu
Junliang Xing
Jianshu Li
Ruoyu Li
Zhe Li
Yuanchun Shi
76
2
0
01 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
155
0
0
28 Feb 2025
Safety Representations for Safer Policy Learning
Safety Representations for Safer Policy Learning
Kaustubh Mani
Vincent Mai
Charlie Gauthier
Annie Chen
Samer Nashed
Liam Paull
62
0
0
27 Feb 2025
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Beomyeol Yu
Taeyoung Lee
141
0
0
27 Feb 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OODOffRL
90
2
0
27 Feb 2025
Previous
123...567...818283
Next