Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,645 papers shown
Title
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
26
56
0
05 Nov 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
35
219
0
30 Oct 2019
Learning to Manipulate Deformable Objects without Demonstrations
Yilin Wu
Wilson Yan
Thanard Kurutach
Lerrel Pinto
Pieter Abbeel
OffRL
31
199
0
29 Oct 2019
Better Exploration with Optimistic Actor-Critic
K. Ciosek
Q. Vuong
R. Loftin
Katja Hofmann
29
149
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Zhengren Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
35
121
0
27 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
106
1,132
0
24 Oct 2019
A New Framework for Multi-Agent Reinforcement Learning -- Centralized Training and Exploration with Decentralized Execution via Policy Distillation
Gang Chen
22
40
0
21 Oct 2019
Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments
Rémy Portelas
Cédric Colas
Katja Hofmann
Pierre-Yves Oudeyer
32
144
0
16 Oct 2019
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
Regularizing Model-Based Planning with Energy-Based Models
Rinu Boney
Arno Solin
Alexander Ilin
22
18
0
12 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
22
31
0
09 Oct 2019
Receding Horizon Curiosity
M. Schultheis
Boris Belousov
Hany Abdulsamad
Jan Peters
25
15
0
08 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
54
541
0
01 Oct 2019
Multi-Agent Actor-Critic with Hierarchical Graph Attention Network
Heechang Ryu
Hayong Shin
Jinkyoo Park
25
115
0
27 Sep 2019
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSL
VLM
OffRL
59
532
0
26 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
37
121
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
Visualizing Movement Control Optimization Landscapes
Perttu Hämäläinen
Juuso Toikka
Amin Babadi
Karen Liu
27
7
0
17 Sep 2019
Model Based Planning with Energy Based Models
Yilun Du
Toru Lin
Igor Mordatch
22
37
0
15 Sep 2019
VILD: Variational Imitation Learning with Diverse-quality Demonstrations
Voot Tangkaratt
Bo Han
Mohammad Emtiyaz Khan
Masashi Sugiyama
25
20
0
15 Sep 2019
Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses
Matt Grenander
Yue Dong
Jackie C.K. Cheung
Annie Louis
27
35
0
08 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
24
96
0
03 Sep 2019
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
25
53
0
25 Aug 2019
Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics
Saurabh Daptardar
Paul Schrater
Xaq Pitkow
17
38
0
13 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
56
413
0
11 Aug 2019
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
23
208
0
28 Jul 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
25
27
0
26 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
26
202
0
22 Jul 2019
Characterizing Attacks on Deep Reinforcement Learning
Xinlei Pan
Chaowei Xiao
Warren He
Shuang Yang
Jian Peng
...
Jinfeng Yi
Zijiang Yang
Mingyan D. Liu
Yue Liu
D. Song
AAML
22
69
0
21 Jul 2019
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
43
74
0
18 Jul 2019
Deep Active Inference as Variational Policy Gradients
Beren Millidge
BDL
32
103
0
08 Jul 2019
Data Efficient Reinforcement Learning for Legged Robots
Yuxiang Yang
Ken Caluwaerts
Atil Iscen
Tingnan Zhang
Jie Tan
Vikas Sindhwani
33
139
0
08 Jul 2019
On-Policy Robot Imitation Learning from a Converging Supervisor
Ashwin Balakrishna
Brijen Thananjeyan
Jonathan Lee
Felix Li
Arsh Zahed
Joseph E. Gonzalez
Ken Goldberg
30
17
0
08 Jul 2019
Variational Inference MPC for Bayesian Model-based Reinforcement Learning
Masashi Okada
T. Taniguchi
43
73
0
08 Jul 2019
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
41
356
0
06 Jul 2019
Modified Actor-Critics
Erinc Merdivan
S. Hanke
M. Geist
24
2
0
02 Jul 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRL
BDL
36
372
0
01 Jul 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
30
108
0
25 Jun 2019
Hierarchical Soft Actor-Critic: Adversarial Exploration via Mutual Information Optimization
Ari Azarafrooz
John Brock
11
3
0
17 Jun 2019
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
8
57
0
17 Jun 2019
Learning-Driven Exploration for Reinforcement Learning
Muhammad Usama
D. Chang
29
10
0
17 Jun 2019
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
34
219
0
13 Jun 2019
Efficient Exploration via State Marginal Matching
Lisa Lee
Benjamin Eysenbach
Emilio Parisotto
Eric Xing
Sergey Levine
Ruslan Salakhutdinov
35
242
0
12 Jun 2019
Interactive Differentiable Simulation
Eric Heiden
David Millard
Hejia Zhang
Gaurav Sukhatme
OOD
AI4CE
PINN
8
50
0
26 May 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
22
30
0
25 May 2019
Adaptive Symmetric Reward Noising for Reinforcement Learning
R. Vivanti
Talya D. Sohlberg-Baris
Shlomo Cohen
Orna Cohen
AAML
21
1
0
24 May 2019
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
42
29
0
24 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
29
80
0
21 May 2019
A Regularized Opponent Model with Maximum Entropy Objective
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
30
31
0
17 May 2019
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
41
127
0
15 May 2019
Previous
1
2
3
...
31
32
33
Next