ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,130 papers shown
Title
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement
  Learning
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
91
10
0
05 Feb 2024
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting
  Mitigation Problem
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
Maciej Wolczyk
Bartłomiej Cupiał
M. Ostaszewski
Michal Bortkiewicz
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
160
18
0
05 Feb 2024
Deep autoregressive density nets vs neural ensembles for model-based
  offline reinforcement learning
Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Abdelhakim Benechehab
Albert Thomas
Balázs Kégl
OffRL
74
2
0
05 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement
  Learning Using Unique Experiences
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Understanding What Affects Generalization Gap in Visual Reinforcement
  Learning: Theory and Empirical Evidence
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
Jiafei Lyu
Le Wan
Xiu Li
Zongqing Lu
CMLOffRL
98
2
0
05 Feb 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
84
0
0
04 Feb 2024
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
David Wu
Sanjiban Choudhury
55
0
0
04 Feb 2024
Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators
  for Non-Repetitive Reaching Tasks
Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks
Mehdi Heydari Shahna
Seyed Adel Alizadeh Kolagar
Jouni Mattila
87
7
0
04 Feb 2024
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based
  Trajectory Stitching
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li
Yixiang Shan
Zhengbang Zhu
Ting Long
Weinan Zhang
OffRL
102
17
0
04 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
111
11
0
04 Feb 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement
  Learning with Diverse Human Feedback
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan
Jianye Hao
Yi-An Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai-Wen Zhao
Yan Zheng
OffRLALM
79
16
0
04 Feb 2024
Evolution Guided Generative Flow Networks
Evolution Guided Generative Flow Networks
Zarif Ikram
Ling Pan
Dianbo Liu
160
1
0
03 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
116
4
0
02 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
84
10
0
02 Feb 2024
Fundamental Properties of Causal Entropy and Information Gain
Fundamental Properties of Causal Entropy and Information Gain
F. N. F. Q. Simoes
Mehdi Dastani
T. V. Ommen
CML
154
3
0
02 Feb 2024
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation
  Learning
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Chia-Cheng Chiang
Li-Cheng Lan
Wei-Fang Sun
Chien Feng
Cho-Jui Hsieh
Chun-Yi Lee
92
0
0
01 Feb 2024
Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour
  with Multi-Agent Reinforcement Learning
Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning
Benjamin Patrick Evans
Sumitra Ganesh
48
4
0
01 Feb 2024
Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for
  Flying Vehicles
Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles
Jinxuan Chen
Mustafa Özger
C. Cavdar
28
4
0
31 Jan 2024
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear
  Programming
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling
Zhihai Wang
Jie Wang
75
6
0
31 Jan 2024
M2CURL: Sample-Efficient Multimodal Reinforcement Learning via
  Self-Supervised Representation Learning for Robotic Manipulation
M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation
Fotios Lygerakis
Vedant Dave
Elmar Rueckert
SSL
89
4
0
30 Jan 2024
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Ryoma Furuyama
Daiki Kuyoshi
Satoshi Yamane
49
0
0
30 Jan 2024
Zero-Shot Reinforcement Learning via Function Encoders
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
133
5
0
30 Jan 2024
Attention-based Reinforcement Learning for Combinatorial Optimization:
  Application to Job Shop Scheduling Problem
Attention-based Reinforcement Learning for Combinatorial Optimization: Application to Job Shop Scheduling Problem
Jaejin Lee
Seho Kee
Mani Janakiram
George Runger
OffRL
58
3
0
29 Jan 2024
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Ziqi Zhang
Jingzehua Xu
Jinxin Liu
Zifeng Zhuang
Donglin Wang
Miao Liu
Shuai Zhang
OffRL
104
4
0
29 Jan 2024
Attentive Convolutional Deep Reinforcement Learning for Optimizing
  Solar-Storage Systems in Real-Time Electricity Markets
Attentive Convolutional Deep Reinforcement Learning for Optimizing Solar-Storage Systems in Real-Time Electricity Markets
Jinhao Li
Changlong Wang
Hao Wang
18
3
0
29 Jan 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRLOnRL
192
49
0
29 Jan 2024
R$\times$R: Rapid eXploration for Reinforcement Learning via
  Sampling-based Reset Distributions and Imitation Pre-training
R×\times×R: Rapid eXploration for Reinforcement Learning via Sampling-based Reset Distributions and Imitation Pre-training
Gagan Khandate
Tristan L. Saidi
Siqi Shang
Eric T. Chang
Yang Liu
Seth Matthew Dennis
Johnson Adams
M. Ciocarlie
135
4
0
27 Jan 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
119
4
0
26 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
122
2
0
26 Jan 2024
Scilab-RL: A software framework for efficient reinforcement learning and
  cognitive modeling research
Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
Jan Dohmen
Frank Röder
Manfred Eppe
OffRL
38
0
0
25 Jan 2024
Machine learning for industrial sensing and control: A survey and
  practical perspective
Machine learning for industrial sensing and control: A survey and practical perspective
Nathan P. Lawrence
S. Damarla
Jong Woo Kim
Aditya Tulsyan
Faraz Amjad
Kai Wang
Benoît Chachuat
Jong Min Lee
Biao Huang
R. Bhushan Gopaluni
AI4CE
79
23
0
24 Jan 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning:
  Theory, Algorithms and Implementations
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
84
0
0
24 Jan 2024
Discovering Mathematical Formulas from Data via GPT-guided Monte Carlo
  Tree Search
Discovering Mathematical Formulas from Data via GPT-guided Monte Carlo Tree Search
Yanjie Li
Weijun Li
Lina Yu
Min Wu
Jingyi Liu
Wenqiang Li
Meilan Hao
Shu Wei
Yusong Deng
106
9
0
24 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
DittoGym: Learning to Control Soft Shape-Shifting Robots
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
114
3
0
24 Jan 2024
Locality Sensitive Sparse Encoding for Learning World Models Online
Locality Sensitive Sparse Encoding for Learning World Models Online
Zi-Yan Liu
Chao Du
Wee Sun Lee
Min Lin
KELMCLLOffRL
92
11
0
23 Jan 2024
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power
  Plants
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants
Yixuan Sun
Sami Khairy
Richard B. Vilim
Rui Hu
Akshay J. Dave
101
4
0
23 Jan 2024
Building Minimal and Reusable Causal State Abstractions for
  Reinforcement Learning
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
Zizhao Wang
Caroline Wang
Xuesu Xiao
Yuke Zhu
Peter Stone
OffRL
61
5
0
23 Jan 2024
DALex: Lexicase-like Selection via Diverse Aggregation
DALex: Lexicase-like Selection via Diverse Aggregation
Andrew Ni
Lijie Ding
Lee Spector
103
6
0
23 Jan 2024
Adaptive Motion Planning for Multi-fingered Functional Grasp via Force
  Feedback
Adaptive Motion Planning for Multi-fingered Functional Grasp via Force Feedback
Dongying Tian
Xiangbo Lin
Yi Sun
105
3
0
22 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A
  Comprehensive Survey on Hybrid Algorithms
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
134
13
0
22 Jan 2024
Efficient and Generalized end-to-end Autonomous Driving System with
  Latent Deep Reinforcement Learning and Demonstrations
Efficient and Generalized end-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations
Zuojin Tang
Xiaoyu Chen
YongQiang Li
Jianyu Chen
135
3
0
22 Jan 2024
Open the Black Box: Step-based Policy Updates for Temporally-Correlated
  Episodic Reinforcement Learning
Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning
Ge Li
Hongyi Zhou
Dominik Roth
Serge Thilges
Fabian Otto
Rudolf Lioutikov
Gerhard Neumann
OffRL
92
7
0
21 Jan 2024
Visual Imitation Learning with Calibrated Contrastive Representation
Visual Imitation Learning with Calibrated Contrastive Representation
Yunke Wang
Linwei Tao
Bo Du
Yutian Lin
Chang Xu
70
0
0
21 Jan 2024
Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive
  Performance in Fin Ray Control
Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive Performance in Fin Ray Control
Xin-Yang Liu
Dariush Bodaghi
Q. Xue
Xudong Zheng
Jian-Xun Wang
115
0
0
21 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for
  Decision-Making Agents
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CELRM
121
23
0
19 Jan 2024
FREED++: Improving RL Agents for Fragment-Based Molecule Generation by
  Thorough Reproduction
FREED++: Improving RL Agents for Fragment-Based Molecule Generation by Thorough Reproduction
Alexander Telepov
Artem Tsypin
Kuzma Khrabrov
Sergey Yakukhnov
Pavel Strashnov
...
Egor Rumiantsev
Daniel Ezhov
Manvel Avetisian
Olga Popova
Artur Kadurin
74
5
0
18 Jan 2024
Exploration and Anti-Exploration with Distributional Random Network
  Distillation
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
112
17
0
18 Jan 2024
Deployable Reinforcement Learning with Variable Control Rate
Deployable Reinforcement Learning with Variable Control Rate
Dong Wang
Giovanni Beltrame
83
5
0
17 Jan 2024
Autonomous Catheterization with Open-source Simulator and Expert
  Trajectory
Autonomous Catheterization with Open-source Simulator and Expert Trajectory
Tudor Jianu
Baoru Huang
Tuan V. Vo
M. Vu
Jingxuan Kang
Hoan Nguyen
O. Omisore
Pierre Berthet-Rayne
S. Fichera
Anh Nguyen
72
7
0
17 Jan 2024
Towards Off-Policy Reinforcement Learning for Ranking Policies with
  Human Feedback
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback
Teng Xiao
Suhang Wang
OffRL
78
8
0
17 Jan 2024
Previous
123...222324...818283
Next