Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,645 papers shown
Title
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework
Amber Srivastava
S. Salapaka
22
11
0
17 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
46
592
0
16 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
24
0
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
38
82
0
15 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
18
18
0
14 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OOD
OffRL
32
41
0
12 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
214
0
10 Jun 2020
Variational Model-based Policy Optimization
Yinlam Chow
Brandon Cui
Moonkyung Ryu
Mohammad Ghavamzadeh
OffRL
25
12
0
09 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
22
4
0
08 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
26
124
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
42
22
0
07 Jun 2020
Skill Discovery of Coordination in Multi-agent Reinforcement Learning
Shuncheng He
Jianzhun Shao
Xiangyang Ji
26
7
0
07 Jun 2020
State Action Separable Reinforcement Learning
Ziyao Zhang
Liang Ma
K. Leung
Konstantinos Poularakis
Mudhakar Srivatsa
31
2
0
05 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Damian Bogunowicz
A. Rybnikov
Komal Vendidandi
Fedor Chervinskii
25
7
0
29 May 2020
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
33
10
0
25 May 2020
LEAF: Latent Exploration Along the Frontier
Homanga Bharadhwaj
Animesh Garg
Florian Shkurti
29
1
0
21 May 2020
Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning
Michelle A. Lee
Carlos Florensa
Jonathan Tremblay
Nathan D. Ratliff
Animesh Garg
Fabio Ramos
Dieter Fox
23
60
0
21 May 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
24
13
0
21 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
19
95
0
20 May 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
25
83
0
20 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
44
87
0
16 May 2020
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
47
276
0
13 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
32
57
0
12 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
42
63
0
11 May 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
28
27
0
07 May 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
20
648
0
30 Apr 2020
A Perspective on Deep Learning for Molecular Modeling and Simulations
Jun Zhang
Yao-Kun Lei
Zhen Zhang
Junhan Chang
Maodong Li
Xu Han
Lijiang Yang
Yue Yang
Y. Gao
AI4CE
42
8
0
25 Apr 2020
Self-Paced Deep Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
45
54
0
24 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
29
38
0
22 Apr 2020
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
Huimin Peng
VLM
OffRL
31
35
0
17 Apr 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
135
1,318
0
15 Apr 2020
Reinforcement Learning Approach to Vibration Compensation for Dynamic Feed Drive Systems
Ralf Gulde
Marc Tuscher
A. Csiszar
O. Riedel
A. Verl
AI4CE
11
1
0
14 Apr 2020
Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning
Michael Everett
Bjorn Lutjens
Jonathan P. How
AAML
23
41
0
11 Apr 2020
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
106
1,939
0
11 Apr 2020
Weakly-Supervised Reinforcement Learning for Controllable Behavior
Lisa Lee
Benjamin Eysenbach
Ruslan Salakhutdinov
S. Gu
Chelsea Finn
SSL
24
26
0
06 Apr 2020
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
32
80
0
02 Apr 2020
On the Feedback Law in Stochastic Optimal Nonlinear Control
M. Mohamed
S. Chakravorty
R. Goyal
Ran A. Wang
14
5
0
01 Apr 2020
Leverage the Average: an Analysis of KL Regularization in RL
Nino Vieillard
Tadashi Kozuno
B. Scherrer
Olivier Pietquin
Rémi Munos
M. Geist
27
43
0
31 Mar 2020
Regularizing Class-wise Predictions via Self-knowledge Distillation
Sukmin Yun
Jongjin Park
Kimin Lee
Jinwoo Shin
29
276
0
31 Mar 2020
Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles with Uncertainties
Qingrui Zhang
Wei Pan
V. Reppa
14
21
0
30 Mar 2020
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
27
177
0
30 Mar 2020
Learning Dense Visual Correspondences in Simulation to Smooth and Fold Real Fabrics
Aditya Ganapathi
Priya Sundaresan
Brijen Thananjeyan
Ashwin Balakrishna
Daniel Seita
...
Joseph E. Gonzalez
Nawid Jamali
K. Yamane
Soshi Iba
Ken Goldberg
19
27
0
28 Mar 2020
Towards Safer Self-Driving Through Great PAIN (Physically Adversarial Intelligent Networks)
Piyush B. Gupta
Demetris Coleman
J. Siegel
AAML
29
16
0
24 Mar 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
44
37
0
19 Mar 2020
SAPIEN: A SimulAted Part-based Interactive ENvironment
Fanbo Xiang
Yuzhe Qin
Kaichun Mo
Yikuan Xia
Hao Zhu
...
He Wang
Li Yi
Angel X. Chang
Leonidas J. Guibas
Hao Su
223
488
0
19 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
Active Perception and Representation for Robotic Manipulation
Youssef Y. Zaky
Gaurav Paruthi
B. Tripp
James Bergstra
28
16
0
15 Mar 2020
Previous
1
2
3
...
29
30
31
32
33
Next