ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
v1v2 (latest)

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

28 / 4,128 papers shown
Title
VFunc: a Deep Generative Model for Functions
VFunc: a Deep Generative Model for Functions
Philip Bachman
Riashat Islam
Alessandro Sordoni
Zafarali Ahmed
VLMBDL
68
8
0
11 Jul 2018
Algorithmic Framework for Model-based Deep Reinforcement Learning with
  Theoretical Guarantees
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
136
227
0
10 Jul 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value
  Expansion
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman
Danijar Hafner
George Tucker
E. Brevdo
Honglak Lee
97
333
0
04 Jul 2018
Unsupervised Meta-Learning for Reinforcement Learning
Unsupervised Meta-Learning for Reinforcement Learning
Abhishek Gupta
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSLOffRL
138
107
0
12 Jun 2018
Implicit Policy for Reinforcement Learning
Implicit Policy for Reinforcement Learning
Yunhao Tang
Shipra Agrawal
64
14
0
10 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
241
1,290
0
30 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
79
20
0
29 May 2018
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
95
49
0
24 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
102
815
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
60
11
0
20 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CEBDL
114
677
0
02 May 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
46
6
0
29 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
89
193
0
09 Apr 2018
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
102
68
0
02 Apr 2018
Entropy based Independent Learning in Anonymous Multi-Agent Settings
Entropy based Independent Learning in Anonymous Multi-Agent Settings
Tanvi Verma
Pradeep Varakantham
H. Lau
63
7
0
27 Mar 2018
Simple random search provides a competitive approach to reinforcement
  learning
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
93
317
0
19 Mar 2018
Composable Deep Reinforcement Learning for Robotic Manipulation
Composable Deep Reinforcement Learning for Robotic Manipulation
Tuomas Haarnoja
Vitchyr H. Pong
Aurick Zhou
Murtaza Dalal
Pieter Abbeel
Sergey Levine
152
234
0
19 Mar 2018
Imitation Learning with Concurrent Actions in 3D Games
Imitation Learning with Concurrent Actions in 3D Games
Jack Harmer
Linus Gisslén
Jorge del Val
Henrik Holst
Joakim Bergdahl
Tom Olsson
K. Sjöö
Magnus Nordin
75
46
0
14 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
131
73
0
13 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
389
5,256
0
26 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
269
1,089
0
16 Feb 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement
  Learning Algorithms
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
138
159
0
14 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
103
124
0
13 Feb 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
125
53
0
10 Jan 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
102
25
0
29 Dec 2017
Regret Minimization for Partially Observable Deep Reinforcement Learning
Regret Minimization for Partially Observable Deep Reinforcement Learning
Peter H. Jin
Kurt Keutzer
Sergey Levine
96
51
0
31 Oct 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
116
24
0
06 Aug 2017
Previous
123...818283