Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,658 papers shown
Title
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
72
86
0
19 Oct 2020
Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning
Claudia Pérez-DÁrpino
Can Liu
P. Goebel
Roberto Martín-Martín
Silvio Savarese
39
65
0
16 Oct 2020
Learning Dexterous Manipulation from Suboptimal Experts
Rae Jeong
Jost Tobias Springenberg
Jackie Kay
Daniel Zheng
Yuxiang Zhou
Alexandre Galashov
N. Heess
F. Nori
OffRL
18
36
0
16 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
34
55
0
15 Oct 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
40
93
0
12 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
24
22
0
09 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
42
120
0
08 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
29
7
0
08 Oct 2020
Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving
Jiahao Yao
Lin Lin
Marin Bukov
BDL
AI4CE
40
61
0
07 Oct 2020
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
135
59
0
06 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
25
7
0
04 Oct 2020
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
Lanqing Li
Rui Yang
Dijun Luo
OffRL
33
10
0
02 Oct 2020
Learning to swim in potential flow
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
13
40
0
30 Sep 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Haotian Fu
Hongyao Tang
Jianye Hao
Chong Chen
Xidong Feng
Dong Li
Wulong Liu
OffRL
37
51
0
29 Sep 2020
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan
Kacper Twardowski
E. Mangina
D. Finn
8
21
0
22 Sep 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
32
5
0
21 Sep 2020
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control
Qingrui Zhang
Hao Dong
Wei Pan
29
6
0
20 Sep 2020
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Ian Fox
Joyce M. Lee
R. Pop-Busui
Jenna Wiens
BDL
OffRL
30
50
0
18 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
29
23
0
18 Sep 2020
Elastica: A compliant mechanics environment for soft robotic control
Noel M. Naughton
Jiarui Sun
Arman Tekinalp
Girish Chowdhary
M. Gazzola
14
86
0
17 Sep 2020
Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation
Wenhao Ding
Baiming Chen
Yue Liu
Kim Ji Eun
Ding Zhao
AAML
16
100
0
16 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
27
52
0
16 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
288
341
0
14 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
58
610
0
10 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
28
95
0
04 Sep 2020
Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning
Nathan Lambert
Craig B. Schindler
Daniel S. Drew
K. Pister
22
6
0
02 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAML
OffRL
OnRL
37
49
0
02 Sep 2020
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
18
7
0
23 Aug 2020
Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning
Florian Fuchs
Yunlong Song
Elia Kaufmann
Davide Scaramuzza
Peter Dürr
26
124
0
18 Aug 2020
Linear Disentangled Representations and Unsupervised Action Estimation
Matthew Painter
Jonathon S. Hare
Adam Prugel-Bennett
CoGe
DRL
39
20
0
18 Aug 2020
Offline Meta-Reinforcement Learning with Advantage Weighting
E. Mitchell
Rafael Rafailov
Xue Bin Peng
Sergey Levine
Chelsea Finn
OffRL
38
104
0
13 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
29
72
0
08 Aug 2020
SafePILCO: a software tool for safe and data-efficient policy synthesis
Kyriakos Polymenakos
Nikitas Rontsis
Alessandro Abate
Stephen J. Roberts
32
6
0
07 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
26
42
0
02 Aug 2020
Towards Deep Robot Learning with Optimizer applicable to Non-stationary Problems
Taisuke Kobayashi
ODL
20
9
0
31 Jul 2020
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
32
50
0
31 Jul 2020
Understanding the Stability of Deep Control Policies for Biped Locomotion
Hwangpil Park
R. Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
22
9
0
30 Jul 2020
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
43
1
0
29 Jul 2020
Learning Object-conditioned Exploration using Distributed Soft Actor Critic
Ayzaan Wahid
Austin Stone
Kevin Chen
Brian Ichter
Alexander Toshev
27
22
0
29 Jul 2020
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies
Shengpu Tang
Aditya Modi
Michael Sjoding
Jenna Wiens
OffRL
22
25
0
24 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
35
175
0
24 Jul 2020
Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search
Yuan Tian
Qin Wang
Zhiwu Huang
Wen Li
Dengxin Dai
Minghao Yang
Jun Wang
Olga Fink
OffRL
24
60
0
17 Jul 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
52
120
0
15 Jul 2020
Efficient Empowerment Estimation for Unsupervised Stabilization
Ruihan Zhao
Kevin Lu
Pieter Abbeel
Stas Tiomkin
32
8
0
14 Jul 2020
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda Khadka
Estelle Aflalo
Mattias Marder
Avrech Ben-David
Santiago Miret
Shie Mannor
Tamir Hazan
Hanlin Tang
Somdeb Majumdar
GNN
32
11
0
14 Jul 2020
Control as Hybrid Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
21
9
0
11 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
11
19
0
09 Jul 2020
Natural Emergence of Heterogeneous Strategies in Artificially Intelligent Competitive Teams
A. Deka
Katia Sycara
AAML
31
32
0
06 Jul 2020
Previous
1
2
3
...
28
29
30
...
32
33
34
Next