ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,719 papers shown
Title
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
42
2
0
06 Oct 2021
Adaptive control of a mechatronic system using constrained residual
  reinforcement learning
Adaptive control of a mechatronic system using constrained residual reinforcement learning
Tom Staessens
Tom Lefebvre
Guillaume Crevecoeur
22
16
0
06 Oct 2021
Pretraining & Reinforcement Learning: Sharpening the Axe Before Cutting
  the Tree
Pretraining & Reinforcement Learning: Sharpening the Axe Before Cutting the Tree
Saurav Kadavath
Samuel Paradis
Brian Yao
VLM
CLIP
OffRL
OnRL
22
1
0
06 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
21
2
0
06 Oct 2021
Influencing Towards Stable Multi-Agent Interactions
Influencing Towards Stable Multi-Agent Interactions
Woodrow Z. Wang
Andy Shih
Annie Xie
Dorsa Sadigh
51
34
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
25
106
0
05 Oct 2021
Continuous-Time Fitted Value Iteration for Robust Policies
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
15
9
0
05 Oct 2021
Combining Physics and Deep Learning to learn Continuous-Time Dynamics
  Models
Combining Physics and Deep Learning to learn Continuous-Time Dynamics Models
M. Lutter
Jan Peters
PINN
AI4CE
45
39
0
05 Oct 2021
Hierarchical Primitive Composition: Simultaneous Activation of Skills
  with Inconsistent Action Dimensions in Multiple Hierarchies
Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies
Jeong-Hoon Lee
Jongeun Choi
47
8
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
105
265
0
04 Oct 2021
Large Batch Experience Replay
Large Batch Experience Replay
Thibault Lahire
Matthieu Geist
Emmanuel Rachelson
OffRL
56
13
0
04 Oct 2021
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule
  Generation
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation
Soojung Yang
Doyeong Hwang
Seul Lee
Seongok Ryu
Sung Ju Hwang
42
68
0
04 Oct 2021
Seeking Visual Discomfort: Curiosity-driven Representations for
  Reinforcement Learning
Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning
Elie Aljalbout
Maximilian Ulmer
Rudolph Triebel
24
2
0
02 Oct 2021
Divergence-Regularized Multi-Agent Actor-Critic
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
Powerpropagation: A sparsity inducing weight reparameterisation
Powerpropagation: A sparsity inducing weight reparameterisation
Jonathan Richard Schwarz
Siddhant M. Jayakumar
Razvan Pascanu
P. Latham
Yee Whye Teh
98
54
0
01 Oct 2021
Neural Network Verification in Control
Neural Network Verification in Control
M. Everett
AAML
37
16
0
30 Sep 2021
Solving the Real Robot Challenge using Deep Reinforcement Learning
Solving the Real Robot Challenge using Deep Reinforcement Learning
Robert McCarthy
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
53
11
0
30 Sep 2021
Unified Data Collection for Visual-Inertial Calibration via Deep
  Reinforcement Learning
Unified Data Collection for Visual-Inertial Calibration via Deep Reinforcement Learning
Yu Ao
Le Chen
Florian Tschopp
Michel Breyer
Andrei Cramariuc
Roland Siegwart
20
4
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
46
8
0
29 Sep 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
27
17
0
29 Sep 2021
Learning Dynamics Models for Model Predictive Agents
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
57
26
0
29 Sep 2021
Lyapunov-stable neural-network control
Lyapunov-stable neural-network control
Hongkai Dai
Benoit Landry
Lujie Yang
Marco Pavone
Russ Tedrake
28
120
0
29 Sep 2021
A First-Occupancy Representation for Reinforcement Learning
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
41
15
0
28 Sep 2021
Making Curiosity Explicit in Vision-based RL
Making Curiosity Explicit in Vision-based RL
Elie Aljalbout
Maximilian Ulmer
Rudolph Triebel
OffRL
34
2
0
28 Sep 2021
Deep Reinforcement Learning with Adjustments
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
23
2
0
28 Sep 2021
Semi-Autonomous Teleoperation via Learning Non-Prehensile Manipulation
  Skills
Semi-Autonomous Teleoperation via Learning Non-Prehensile Manipulation Skills
Sangbeom Park
Yoonbyung Chai
Sunghyun Park
Jeongeun Park
Kyungjae Lee
Sungjoon Choi
SSL
36
5
0
27 Sep 2021
MetaDrive: Composing Diverse Driving Scenarios for Generalizable
  Reinforcement Learning
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
48
233
0
26 Sep 2021
Prioritized Experience-based Reinforcement Learning with Human Guidance
  for Autonomous Driving
Prioritized Experience-based Reinforcement Learning with Human Guidance for Autonomous Driving
Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
57
74
0
26 Sep 2021
Self-Enhancing Multi-filter Sequence-to-Sequence Model
Self-Enhancing Multi-filter Sequence-to-Sequence Model
Yunhao Yang
Zhaokun Xue
Andrew Whinston
45
1
0
25 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
  Algorithms
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
37
39
0
25 Sep 2021
The $f$-Divergence Reinforcement Learning Framework
The fff-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
42
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
24
30
0
24 Sep 2021
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
67
231
0
23 Sep 2021
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for
  Planning, Control, and Simulation
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation
A. Kamenev
Lirui Wang
Ollin Boer Bohan
Ishwar Kulkarni
Bilal Kartal
Artem Molchanov
Stan Birchfield
David Nistér
Nikolai Smolyanskiy
58
40
0
23 Sep 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
145
85
0
22 Sep 2021
LDC-VAE: A Latent Distribution Consistency Approach to Variational
  AutoEncoders
LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders
Xiaoyu Chen
Chen Gong
Qiang He
Xinwen Hou
Yu Liu
40
1
0
22 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep
  Reinforcement Learning
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
22
10
0
22 Sep 2021
Context-Specific Representation Abstraction for Deep Option Learning
Context-Specific Representation Abstraction for Deep Option Learning
Marwa Abdulhai
Dong-Ki Kim
Matthew D Riemer
Miao Liu
Gerald Tesauro
Jonathan P. How
OffRL
53
9
0
20 Sep 2021
Generalization in Mean Field Games by Learning Master Policies
Generalization in Mean Field Games by Learning Master Policies
Sarah Perrin
Mathieu Laurière
Julien Pérolat
Romuald Élie
Matthieu Geist
Olivier Pietquin
AI4CE
94
35
0
20 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
23
1
0
19 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
36
7
0
18 Sep 2021
Density-based Curriculum for Multi-goal Reinforcement Learning with
  Sparse Rewards
Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards
Deyu Yang
Hanbo Zhang
Xuguang Lan
Jishiyu Ding
OffRL
35
2
0
18 Sep 2021
Soft Actor-Critic With Integer Actions
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
35
12
0
17 Sep 2021
Landmark Policy Optimization for Object Navigation Task
Landmark Policy Optimization for Object Navigation Task
A. Staroverov
Aleksandr I. Panov
52
0
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
42
77
0
16 Sep 2021
Efficient Differentiable Simulation of Articulated Bodies
Efficient Differentiable Simulation of Articulated Bodies
Yi-Ling Qiao
Junbang Liang
V. Koltun
Ming Lin
AI4CE
40
56
0
16 Sep 2021
Balancing detectability and performance of attacks on the control
  channel of Markov Decision Processes
Balancing detectability and performance of attacks on the control channel of Markov Decision Processes
Alessio Russo
Alexandre Proutiere
AAML
43
6
0
15 Sep 2021
Infusing model predictive control into meta-reinforcement learning for
  mobile robots in dynamic environments
Infusing model predictive control into meta-reinforcement learning for mobile robots in dynamic environments
Jaeuk Shin
A. Hakobyan
Mingyu Park
Yeoneung Kim
Gihun Kim
Insoon Yang
42
10
0
15 Sep 2021
Learning Based Adaptive Force Control of Robotic Manipulation Based on
  Real-Time Object Stiffness Detection
Learning Based Adaptive Force Control of Robotic Manipulation Based on Real-Time Object Stiffness Detection
Zhaoxing Deng
Xutian Deng
Miao Li
OOD
15
2
0
14 Sep 2021
Reinforcement Learning with Evolutionary Trajectory Generator: A General
  Approach for Quadrupedal Locomotion
Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion
Hao-bin Shi
Bo Zhou
Hongsheng Zeng
Fan Wang
Yueqiang Dong
Jiangyong Li
Kang Wang
Hao Tian
Max Meng
50
50
0
14 Sep 2021
Previous
123...232425...333435
Next