Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
Crowdfunding Dynamics Tracking: A Reinforcement Learning Approach
Jun Wang
Haifeng Zhang
Qi Liu
Zhen Pan
Hanqing Tao
39
6
0
27 Dec 2019
SoundSpaces: Audio-Visual Navigation in 3D Environments
Changan Chen
Unnat Jain
Carl Schissler
S. V. A. Garí
Ziad Al-Halah
V. Ithapu
Philip Robinson
Kristen Grauman
110
26
0
24 Dec 2019
Discrete and Continuous Action Representation for Practical RL in Video Games
Olivier Delalleau
Maxim Peter
Eloi Alonso
Adrien Logut
88
53
0
23 Dec 2019
Towards Practical Multi-Object Manipulation using Relational Reinforcement Learning
R. Li
Allan Jabri
Trevor Darrell
Pulkit Agrawal
OffRL
88
112
0
23 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
138
193
0
23 Dec 2019
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRL
OffRL
72
63
0
23 Dec 2019
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
77
34
0
23 Dec 2019
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
51
2
0
20 Dec 2019
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning
A. Celli
Marco Ciccone
Raffaele Bongo
N. Gatti
63
12
0
16 Dec 2019
To Follow or not to Follow: Selective Imitation Learning from Observations
Youngwoon Lee
E. Hu
Zhengyu Yang
Joseph J. Lim
67
15
0
16 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
24
2
0
11 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
59
17
0
11 Dec 2019
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
Riashat Islam
Raihan Seraj
Samin Yeasar Arnob
Doina Precup
OffRL
87
3
0
11 Dec 2019
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
58
8
0
11 Dec 2019
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden K. Petersen
Mikel Landajuela
T. Nathan Mundhenk
Claudio Santiago
Soo K. Kim
Joanne T. Kim
70
320
0
10 Dec 2019
Measuring the Reliability of Reinforcement Learning Algorithms
Stephanie C. Y. Chan
Sam Fishman
John F. Canny
Anoop Korattikara Balan
S. Guadarrama
74
84
0
10 Dec 2019
Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion
Bo Zhou
Hongsheng Zeng
Fan Wang
Yunxiang Li
Hao Tian
59
18
0
10 Dec 2019
Combining Q-Learning and Search with Amortized Value Estimates
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
OffRL
87
48
0
05 Dec 2019
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Abdul Rahman Kreidieh
Yiling You
Nathan Lichtlé
Samyak Parajuli
Rayyan Nasr
Alexandre M. Bayen
116
14
0
05 Dec 2019
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
168
245
0
04 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
241
1,378
0
03 Dec 2019
Human-Robot Collaboration via Deep Reinforcement Learning of Real-World Interactions
Jonas Tjomsland
A. Shafti
Aldo A. Faisal
50
6
0
02 Dec 2019
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning
Tinghao Zhang
Jing Luo
Ping Chen
Jie Liu
AI4CE
47
5
0
01 Dec 2019
Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation
Dmitry Akimov
29
10
0
29 Nov 2019
Simulation-based reinforcement learning for real-world autonomous driving
B. Osinski
Adam Jakubowski
Piotr Milos
Pawel Ziecina
Christopher Galias
S. Homoceanu
Henryk Michalewski
115
122
0
29 Nov 2019
Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization
Qi Zhou
Houqiang Li
Jie Wang
75
17
0
28 Nov 2019
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving
Rupert Mitchell
Jenny Fletcher
Jacopo Panerati
Amanda Prorok
89
17
0
26 Nov 2019
The problem with DDPG: understanding failures in deterministic environments with sparse rewards
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
52
67
0
26 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
77
65
0
26 Nov 2019
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
166
692
0
26 Nov 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Jianchao Tan
Zhuoran Yang
Tamer Basar
236
1,233
0
24 Nov 2019
Merging Deterministic Policy Gradient Estimations with Varied Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Gang Chen
75
4
0
24 Nov 2019
Which Channel to Ask My Question? Personalized Customer Service Request Stream Routing using Deep Reinforcement Learning
Zining Liu
Chong Long
Xiaolu Lu
Zehong Hu
Jie Zhang
Yafang Wang
45
9
0
24 Nov 2019
State Alignment-based Imitation Learning
Fangchen Liu
Z. Ling
Tongzhou Mu
Hao Su
79
93
0
21 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Yangchen Pan
Kirby Banman
Martha White
51
0
0
19 Nov 2019
Implicit Generative Modeling for Efficient Exploration
Neale Ratzlaff
Qinxun Bai
Fuxin Li
Wenyuan Xu
79
12
0
19 Nov 2019
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
105
124
0
17 Nov 2019
Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Riashat Islam
Komal K. Teru
Deepak Sharma
Joelle Pineau
OffRL
88
8
0
16 Nov 2019
Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient
K. Luck
Mel Vecerík
Simon Stepputtis
H. B. Amor
Jonathan Scholz
39
11
0
15 Nov 2019
Data-efficient Co-Adaptation of Morphology and Behaviour with Deep Reinforcement Learning
K. Luck
H. B. Amor
Roberto Calandra
AI4CE
73
53
0
15 Nov 2019
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
70
130
0
13 Nov 2019
Combinatorial Optimization by Graph Pointer Networks and Hierarchical Reinforcement Learning
Qiang Ma
Suwen Ge
Danyang He
D. Thaker
Iddo Drori
72
191
0
12 Nov 2019
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
96
63
0
11 Nov 2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
OffRL
143
8
0
11 Nov 2019
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
73
2
0
11 Nov 2019
Context-aware Active Multi-Step Reinforcement Learning
Gang Chen
Dingcheng Li
Ran Xu
31
0
0
11 Nov 2019
H
∞
H_\infty
H
∞
Model-free Reinforcement Learning with Robust Stability Guarantee
Minghao Han
Yuan Tian
Lixian Zhang
Jun Wang
Wei Pan
70
26
0
07 Nov 2019
A Divergence Minimization Perspective on Imitation Learning Methods
Seyed Kamyar Seyed Ghasemipour
R. Zemel
S. Gu
91
251
0
06 Nov 2019
Previous
1
2
3
...
77
78
79
...
81
82
83
Next