Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,676 papers shown
Title
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Yiming Zhang
Keith Ross
OffRL
43
41
0
14 Jun 2021
Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Kate Rakelly
Abhishek Gupta
Carlos Florensa
Sergey Levine
SSL
28
39
0
14 Jun 2021
Temporal Predictive Coding For Model-Based Planning In Latent Space
Tung D. Nguyen
Rui Shu
Tu Pham
Hung Bui
Stefano Ermon
OffRL
36
57
0
14 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
34
15
0
13 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
60
790
0
12 Jun 2021
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
54
51
0
11 Jun 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
34
5
0
11 Jun 2021
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Emmanuel Bengio
Moksh Jain
Maksym Korablyov
Doina Precup
Yoshua Bengio
49
312
0
08 Jun 2021
Learning Markov State Abstractions for Deep Reinforcement Learning
Cameron Allen
Neev Parikh
Omer Gottesman
George Konidaris
BDL
OffRL
42
36
0
08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Zafeirios Fountas
Karl J. Friston
27
20
0
08 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
45
119
0
07 Jun 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
54
35
0
06 Jun 2021
SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Min Sun
Anuj Mahajan
Katja Hofmann
Shimon Whiteson
OffRL
26
12
0
06 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
21
46
0
05 Jun 2021
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
40
61
0
05 Jun 2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
48
37
0
03 Jun 2021
On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction
Jiawei Huang
Nan Jiang
21
5
0
02 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
26
5
0
01 Jun 2021
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
55
77
0
01 Jun 2021
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Tianze Zhou
Fubiao Zhang
Kun Shao
Kai Li
Wenhan Huang
...
Hangyu Mao
Bin Wang
Dong Li
Wulong Liu
Jianye Hao
37
16
0
01 Jun 2021
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence
Wenhao Zhan
Shicong Cen
Baihe Huang
Yuxin Chen
Jason D. Lee
Yuejie Chi
32
76
0
24 May 2021
Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
OffRL
21
89
0
23 May 2021
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety
Haitong Ma
Yang Guan
Shegnbo Eben Li
Xiangteng Zhang
Sifa Zheng
Jianyu Chen
43
37
0
22 May 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
23
184
0
17 May 2021
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Julen Urain
Anqi Li
Puze Liu
Carlo DÉramo
Jan Peters
38
26
0
11 May 2021
Reinforcement learning of rare diffusive dynamics
Avishek Das
Dominic C. Rose
J. P. Garrahan
David T. Limmer
24
27
0
10 May 2021
Hierarchical Reinforcement Learning for Air-to-Air Combat
Adrian P. Pope
J. Ide
Daria Mićović
Henry Diaz
D. Rosenbluth
Lee Ritholtz
Jason C. Twedt
Thayne T. Walker
K. Alcedo
D. Javorsek
25
72
0
03 May 2021
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
98
116
0
03 May 2021
Discovering Diverse Athletic Jumping Strategies
Zhiqi Yin
Zeshi Yang
M. van de Panne
KangKang Yin
53
46
0
02 May 2021
Development of a Soft Actor Critic Deep Reinforcement Learning Approach for Harnessing Energy Flexibility in a Large Office Building
Anjukan Kathirgamanathan
E. Mangina
D. Finn
AI4CE
24
38
0
25 Apr 2021
Safe Chance Constrained Reinforcement Learning for Batch Process Control
M. Mowbray
Panagiotis Petsagkourakis
Ehecatl Antonio del Rio Chanona
Dongda Zhang
OffRL
37
34
0
23 Apr 2021
Robust Biped Locomotion Using Deep Reinforcement Learning on Top of an Analytical Control Approach
Mohammadreza Kasaei
Miguel Abreu
N. Lau
Artur Pereira
Luis Paulo Reis
33
22
0
21 Apr 2021
Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning
Zikang Xiong
Suresh Jagannathan
34
6
0
20 Apr 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
35
46
0
20 Apr 2021
Deep Reinforcement Learning in a Monetary Model
Mingli Chen
Andreas Joseph
Michael Kumhof
Xinlei Pan
Xuan Zhou
11
23
0
19 Apr 2021
Auto-Tuned Sim-to-Real Transfer
Yuqing Du
Olivia Watkins
Trevor Darrell
Pieter Abbeel
Deepak Pathak
27
69
0
15 Apr 2021
Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chao Huang
Zhongxu Hu
Peng Hang
Yang Xing
Chen Lv
42
41
0
15 Apr 2021
Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning
Yuan Pu
Shaochen Wang
Rui Yang
Xin Yao
Bin Li
24
19
0
14 Apr 2021
GEM: Group Enhanced Model for Learning Dynamical Control Systems
Philippe Hansen-Estruch
Wenling Shang
Lerrel Pinto
Pieter Abbeel
Stas Tiomkin
AI4CE
38
2
0
07 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
48
43
0
25 Mar 2021
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
35
85
0
25 Mar 2021
CLAMGen: Closed-Loop Arm Motion Generation via Multi-view Vision-Based RL
Iretiayo Akinola
Zizhao Wang
Peter K. Allen
42
2
0
24 Mar 2021
Deep Reinforcement Learning for Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Medium Transition
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
Nicolas P. Bortoluzzi
P. Pinheiro
A. A. Neto
Paulo L. J. Drews-Jr
27
39
0
23 Mar 2021
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
41
50
0
23 Mar 2021
Learning to Robustly Negotiate Bi-Directional Lane Usage in High-Conflict Driving Scenarios
Christoph Killing
Adam R. Villaflor
John M. Dolan
38
5
0
22 Mar 2021
Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving
Zhiyu Huang
Jingda Wu
Chen Lv
24
133
0
19 Mar 2021
A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots
Xinyi Yu
Yu Fan
Siyu Xu
L. Ou
32
32
0
19 Mar 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
44
17
0
18 Mar 2021
Previous
1
2
3
...
25
26
27
...
32
33
34
Next