Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,676 papers shown
Title
Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence
Yang Guan
Yangang Ren
Qi Sun
Shengbo Eben Li
Haitong Ma
Jingliang Duan
Yifan Dai
B. Cheng
18
66
0
18 Mar 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
29
21
0
17 Mar 2021
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
31
37
0
17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
A Whole Brain Probabilistic Generative Model: Toward Realizing Cognitive Architectures for Developmental Robots
T. Taniguchi
Hiroshi Yamakawa
Takayuki Nagai
Kenji Doya
M. Sakagami
Masahiro Suzuki
Tomoaki Nakamura
Akira Taniguchi
33
23
0
15 Mar 2021
Gym-ANM: Reinforcement Learning Environments for Active Network Management Tasks in Electricity Distribution Systems
Robin Henry
D. Ernst
27
34
0
14 Mar 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
27
67
0
14 Mar 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning
Guillaume Bellegarda
Yiyu Chen
Zhuochen Liu
Quan Nguyen
39
44
0
11 Mar 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
26
107
0
10 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
50
176
0
10 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Axel Brunnbauer
Luigi Berducci
Andreas Brandstätter
Mathias Lechner
Ramin Hasani
Daniela Rus
Radu Grosu
LM&Ro
40
38
0
08 Mar 2021
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
46
195
0
08 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
39
48
0
08 Mar 2021
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Steven Wu
22
73
0
04 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
24
11
0
04 Mar 2021
Inverse Reinforcement Learning with Explicit Policy Estimates
Navyata Sanghvi
Shinnosuke Usami
Mohit Sharma
J. Groeger
Kris Kitani
CML
31
6
0
04 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
M. Geist
OffRL
39
40
0
02 Mar 2021
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
Xin Ye
Yezhou Yang
32
23
0
01 Mar 2021
Continuous control of an underground loader using deep reinforcement learning
Sofi Backman
Daniel M. Lindmark
K. Bodin
Martin Servin
Joakim Mörk
Håkan Löfgren
46
33
0
01 Mar 2021
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
29
86
0
25 Feb 2021
Task-Agnostic Morphology Evolution
D. Hejna
Pieter Abbeel
Lerrel Pinto
35
26
0
25 Feb 2021
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with
T
\sqrt{T}
T
Regret
Asaf B. Cassel
Tomer Koren
OffRL
36
17
0
25 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
49
24
0
23 Feb 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
45
68
0
23 Feb 2021
Program Synthesis Guided Reinforcement Learning for Partially Observed Environments
Yichen Yang
J. Inala
Osbert Bastani
Yewen Pu
Armando Solar-Lezama
Martin Rinard
42
12
0
22 Feb 2021
Reinforcement Learning of the Prediction Horizon in Model Predictive Control
Eivind Bøhn
S. Gros
Signe Moe
T. Johansen
28
32
0
22 Feb 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
40
14
0
22 Feb 2021
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Chuheng Zhang
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
21
47
0
22 Feb 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
36
7
0
21 Feb 2021
CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Yongqian Xiao
Xin Xu
Yifei Shi
22
9
0
19 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
27
24
0
19 Feb 2021
Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning
Baiyu Peng
Yao Mu
Jingliang Duan
Yang Guan
Shengbo Eben Li
Jianyu Chen
55
19
0
17 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
225
419
0
16 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
37
39
0
16 Feb 2021
Online Apprenticeship Learning
Lior Shani
Tom Zahavy
Shie Mannor
OffRL
31
25
0
13 Feb 2021
Generalizing Decision Making for Automated Driving with an Invariant Environment Representation using Deep Reinforcement Learning
Karl Kurzer
Philip Schorner
Alexander Albers
Hauke Thomsen
Karam Daaboul
Johann Marius Zöllner
25
10
0
12 Feb 2021
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani
Amy Zhang
Joelle Pineau
37
182
0
11 Feb 2021
Reverb: A Framework For Experience Replay
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
VLM
OffRL
32
38
0
09 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
27
55
0
07 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
20
520
0
04 Feb 2021
Proactive and AoI-aware Failure Recovery for Stateful NFV-enabled Zero-Touch 6G Networks: Model-Free DRL Approach
Amirhossein Shaghaghi
Abolfazl Zakeri
Nader Mokari
M. Javan
M. Behdadfar
Eduard Axel Jorswieck
14
20
0
02 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
140
80
0
01 Feb 2021
Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm
Ambedkar Dukkipati
Rajarshi Banerjee
Ranga Shaarad Ayyagari
Dhaval Parmar Udaybhai
29
6
0
30 Jan 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
29
9
0
24 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Meta-Reinforcement Learning for Adaptive Motor Control in Changing Robot Dynamics and Environments
Timothée Anne
Jack Wilkinson
Zhibin Li
31
1
0
19 Jan 2021
Hierarchical Reinforcement Learning By Discovering Intrinsic Options
Jesse Zhang
Haonan Yu
Wenyuan Xu
BDL
135
82
0
16 Jan 2021
Video Summarization Using Deep Neural Networks: A Survey
Evlampios Apostolidis
E. Adamantidou
Alexandros I. Metsai
Vasileios Mezaris
Ioannis Patras
AI4TS
72
203
0
15 Jan 2021
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
Rishabh Agarwal
Marlos C. Machado
Pablo Samuel Castro
Marc G. Bellemare
OffRL
55
164
0
13 Jan 2021
Previous
1
2
3
...
26
27
28
...
32
33
34
Next