ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 4,044 papers shown
Title
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
24
13
0
17 Apr 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
46
267
0
16 Apr 2019
Learning Probabilistic Multi-Modal Actor Models for Vision-Based Robotic
  Grasping
Learning Probabilistic Multi-Modal Actor Models for Vision-Based Robotic Grasping
Mengyuan Yan
A. Li
Mrinal Kalakrishnan
P. Pastor
15
18
0
15 Apr 2019
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement
  Learning Algorithms
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
11
64
0
15 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost
  RL
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
22
2
0
08 Apr 2019
Guided Meta-Policy Search
Guided Meta-Policy Search
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
19
57
0
01 Apr 2019
How to pick the domain randomization parameters for sim-to-real transfer
  of reinforcement learning policies?
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?
Q. Vuong
Sharad Vikram
H. Su
Sicun Gao
Henrik I. Christensen
OOD
16
48
0
28 Mar 2019
Generalized Off-Policy Actor-Critic
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
30
43
0
27 Mar 2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and
  Monte Carlo Tree Search
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Linnan Wang
Yiyang Zhao
Yuu Jinnai
Yuandong Tian
Rodrigo Fonseca
BDL
25
95
0
26 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
49
28
0
25 Mar 2019
On the use of Deep Autoencoders for Efficient Embedded Reinforcement
  Learning
On the use of Deep Autoencoders for Efficient Embedded Reinforcement Learning
Bharat Prakash
Mark Horton
Nicholas R. Waytowich
W. Hairston
Tim Oates
T. Mohsenin
11
19
0
25 Mar 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic
  Context Variables
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
51
650
0
19 Mar 2019
Truly Proximal Policy Optimization
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
26
123
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
19
45
0
18 Mar 2019
Policy Distillation and Value Matching in Multiagent Reinforcement
  Learning
Policy Distillation and Value Matching in Multiagent Reinforcement Learning
Samir Wadhwania
Dong-Ki Kim
Shayegan Omidshafiei
Jonathan P. How
14
25
0
15 Mar 2019
Deep Reinforcement Learning with Feedback-based Exploration
Deep Reinforcement Learning with Feedback-based Exploration
Jan Scholten
Daan Wout
C. Celemin
Jens Kober
38
4
0
14 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
31
17
0
11 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
17
115
0
04 Mar 2019
A Regularized Approach to Sparse Optimal Policy in Reinforcement
  Learning
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Xiang Li
Wenhao Yang
Zhihua Zhang
11
2
0
02 Mar 2019
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Sergey Kolesnikov
Oleksii Hrinchuk
OffRL
25
8
0
28 Feb 2019
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
24
142
0
26 Feb 2019
Distributionally Robust Reinforcement Learning
Distributionally Robust Reinforcement Learning
E. Smirnova
Elvis Dohmatob
Jérémie Mary
OffRL
29
59
0
23 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
27
48
0
19 Feb 2019
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
  Sample Efficiency and Simplicity
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
Aditya Bhatt
Daniel Palenicek
Boris Belousov
Max Argus
Artemij Amiranashvili
Thomas Brox
Jan Peters
67
46
0
14 Feb 2019
Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General
  Entropy and Effective Environment Exploration in Deep Reinforcement Learning
Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning
Gang Chen
Yiming Peng
24
8
0
14 Feb 2019
Simultaneously Learning Vision and Feature-based Control Policies for
  Real-world Ball-in-a-Cup
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Devin Schwab
Tobias Springenberg
M. Martins
Thomas Lampe
Michael Neunert
A. Abdolmaleki
Tim Hertweck
Roland Hafner
F. Nori
Martin Riedmiller
21
22
0
13 Feb 2019
Artificial Intelligence for Prosthetics - challenge solutions
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
20
40
0
07 Feb 2019
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy
  Reinforcement Learning
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning
Kyungjae Lee
Sungyub Kim
Sungbin Lim
Sungjoon Choi
Songhwai Oh
52
28
0
31 Jan 2019
A Theory of Regularized Markov Decision Processes
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
47
316
0
31 Jan 2019
InfoBot: Transfer and Exploration via the Information Bottleneck
InfoBot: Transfer and Exploration via the Information Bottleneck
Anirudh Goyal
Riashat Islam
Daniel Strouse
Zafarali Ahmed
M. Botvinick
Hugo Larochelle
Yoshua Bengio
Sergey Levine
OffRL
16
166
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
119
0
29 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
26
55
0
29 Jan 2019
Self-organization of action hierarchy and compositionality by
  reinforcement learning with recurrent neural networks
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks
Dongqi Han
Kenji Doya
Jun Tani
AI4CE
27
20
0
29 Jan 2019
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized
  Recursive Reasoning
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
Ying Wen
Yaodong Yang
Rui Luo
Jun Wang
LRM
37
52
0
26 Jan 2019
Credit Assignment Techniques in Stochastic Computation Graphs
Credit Assignment Techniques in Stochastic Computation Graphs
T. Weber
N. Heess
Lars Buesing
David Silver
21
45
0
07 Jan 2019
Hierarchical Reinforcement Learning via Advantage-Weighted Information
  Maximization
Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
OffRL
22
27
0
05 Jan 2019
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
65
10
0
03 Jan 2019
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
434
0
26 Dec 2018
TD-Regularized Actor-Critic Methods
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
32
31
0
19 Dec 2018
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
83
2,381
0
13 Dec 2018
Residual Reinforcement Learning for Robot Control
Residual Reinforcement Learning for Robot Control
T. Johannink
Shikhar Bahl
Ashvin Nair
Jianlan Luo
Avinash Kumar
M. Loskyll
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
35
410
0
07 Dec 2018
Provably Efficient Maximum Entropy Exploration
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
36
295
0
06 Dec 2018
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
Composing Entropic Policies using Divergence Correction
Composing Entropic Policies using Divergence Correction
Jonathan J. Hunt
André Barreto
Timothy Lillicrap
N. Heess
24
2
0
05 Dec 2018
Exploration versus exploitation in reinforcement learning: a stochastic
  control approach
Exploration versus exploitation in reinforcement learning: a stochastic control approach
Haoran Wang
T. Zariphopoulou
X. Zhou
6
49
0
04 Dec 2018
Generative Adversarial Self-Imitation Learning
Generative Adversarial Self-Imitation Learning
Yijie Guo
Junhyuk Oh
Satinder Singh
Honglak Lee
GAN
35
58
0
03 Dec 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
30
139
0
01 Nov 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep
  Reinforcement Learning
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDL
OffRL
24
5
0
30 Oct 2018
Model-Based Active Exploration
Model-Based Active Exploration
Pranav Shyam
Wojciech Ja'skowski
Faustino J. Gomez
35
179
0
29 Oct 2018
Previous
123...798081
Next