Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning
Lionel Blondé
Pablo Strasser
Alexandros Kalousis
90
22
0
28 Jun 2020
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
39
4
0
26 Jun 2020
SOAC: The Soft Option Actor-Critic Architecture
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
L. Xia
Qianchuan Zhao
43
6
0
25 Jun 2020
Some approaches used to overcome overestimation in Deep Reinforcement Learning algorithms
Rafael Stekolshchik
OffRL
8
2
0
25 Jun 2020
Preventing Value Function Collapse in Ensemble {Q}-Learning by Maximizing Representation Diversity
Hassam Sheikh
Ladislau Bölöni
13
0
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
102
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Towards Tractable Optimism in Model-Based Reinforcement Learning
Aldo Pacchiano
Philip J. Ball
Jack Parker-Holder
K. Choromanski
Stephen J. Roberts
OffRL
55
12
0
21 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
103
87
0
20 Jun 2020
Band-limited Soft Actor Critic Model
Miguel Campo
Zhengxing Chen
Luke Kung
Kittipat Virochsiri
Jianyu Wang
32
6
0
19 Jun 2020
A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines
Günther Waxenegger-Wilfing
Kai Dresia
J. Deeken
M. Oschwald
45
17
0
19 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
72
29
0
18 Jun 2020
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Thomas Asikis
Lucas Böttcher
Nino Antulov-Fantulin
76
44
0
17 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
146
619
0
16 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
Parameter-Based Value Functions
Francesco Faccio
Louis Kirsch
Jürgen Schmidhuber
OffRL
93
26
0
16 Jun 2020
RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real
Kanishka Rao
Chris Harris
A. Irpan
Sergey Levine
Julian Ibarz
Mohi Khansari
130
191
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
130
85
0
15 Jun 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
110
62
0
15 Jun 2020
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
44
12
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
75
19
0
14 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
38
14
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
108
24
0
12 Jun 2020
Zeroth-order Deterministic Policy Gradient
Harshat Kumar
Dionysios S. Kalogerias
George J. Pappas
Alejandro Ribeiro
OffRL
33
14
0
12 Jun 2020
Decorrelated Double Q-learning
Gang Chen
33
2
0
12 Jun 2020
From proprioception to long-horizon planning in novel environments: A hierarchical RL model
Nishad Gothoskar
Miguel Lázaro-Gredilla
Dileep George
33
0
0
11 Jun 2020
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
59
9
0
11 Jun 2020
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
Jae Hyun Lim
Aaron Courville
C. Pal
Chin-Wei Huang
DRL
71
23
0
09 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
150
1,838
0
08 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
102
129
0
08 Jun 2020
Refined Continuous Control of DDPG Actors via Parametrised Activation
M. Hossny
Julie Iskander
Mohammed Attia
Khaled Saleh
39
7
0
04 Jun 2020
A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Y. Sung
50
13
0
04 Jun 2020
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
Seungyul Han
Y. Sung
45
26
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Distributed Voltage Regulation of Active Distribution System Based on Enhanced Multi-agent Deep Reinforcement Learning
Di Cao
Junbo Zhao
Weihao Hu
F. Ding
Qi Huang
Zhe Chen
38
11
0
31 May 2020
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning
Parvin Malekzadeh
Mohammad Salimibeni
Arash Mohammadi
A. Assa
Konstantinos N. Plataniotis
OffRL
37
12
0
30 May 2020
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Damian Bogunowicz
A. Rybnikov
Komal Vendidandi
Fedor Chervinskii
133
7
0
29 May 2020
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
98
775
0
27 May 2020
A reinforcement learning approach to rare trajectory sampling
Dominic C. Rose
Jamie F. Mair
J. P. Garrahan
78
52
0
26 May 2020
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
90
11
0
25 May 2020
LEAF: Latent Exploration Along the Frontier
Homanga Bharadhwaj
Animesh Garg
Florian Shkurti
55
1
0
21 May 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
134
13
0
21 May 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
136
87
0
20 May 2020
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning
Donghoon Lee
9
0
0
18 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
91
88
0
16 May 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
117
678
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
81
58
0
12 May 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
246
197
0
08 May 2020
Off-Policy Adversarial Inverse Reinforcement Learning
Samin Yeasar Arnob
31
12
0
03 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
107
468
0
02 May 2020
Previous
1
2
3
...
38
39
40
...
42
43
44
Next