Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
Sensor-Based Navigation Using Hierarchical Reinforcement Learning
Christoph Gebauer
Nils Dengler
Maren Bennewitz
37
4
0
30 Aug 2021
Adaptive perturbation adversarial training: based on reinforcement learning
Zhi-pin Nie
Ying Lin
Sp Ren
Lan Zhang
AAML
35
1
0
30 Aug 2021
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
21
0
0
29 Aug 2021
Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents
Ammar Fayad
M. Ibrahim
58
5
0
28 Aug 2021
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma
Sahil Sharma
25
3
0
27 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
111
159
0
26 Aug 2021
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
116
211
0
18 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
Matthieu Geist
OffRL
71
9
0
16 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
66
7
0
16 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
50
3
0
10 Aug 2021
A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
57
16
0
06 Aug 2021
Distilling Neuron Spike with High Temperature in Reinforcement Learning Agents
Ling Zhang
Jian Cao
Yuan Zhang
Bohan Zhou
Shuo Feng
46
9
0
05 Aug 2021
Policy Gradients Incorporating the Future
David Venuto
Elaine Lau
Doina Precup
Ofir Nachum
OffRL
97
9
0
04 Aug 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
59
9
0
04 Aug 2021
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo
Tengyu Ma
OffRL
101
44
0
04 Aug 2021
Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications
Junya Ikemoto
T. Ushio
76
3
0
03 Aug 2021
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
37
0
0
03 Aug 2021
Sequoia: A Software Framework to Unify Continual Learning Research
Fabrice Normandin
Florian Golemo
O. Ostapenko
Pau Rodríguez López
Matthew D Riemer
...
Dominic Zhao
Timothée Lesort
Laurent Charlin
Irina Rish
Massimo Caccia
CLL
94
21
0
02 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
103
41
0
31 Jul 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu
Z. Ling
Fanbo Xiang
Derek Yang
Xuanlin Li
Stone Tao
Zhiao Huang
Zhiwei Jia
Hao Su
156
138
0
30 Jul 2021
Tianshou: a Highly Modularized Deep Reinforcement Learning Library
Jiayi Weng
Huayu Chen
Dong Yan
Kaichao You
Alexis Duburcq
Minghao Zhang
Yi Su
Hang Su
Jun Zhu
NoLa
OffRL
114
204
0
29 Jul 2021
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings
Sreehari Rammohan
Shangqun Yu
Bowen He
Eric Hsiung
Eric Rosen
Stefanie Tellex
George Konidaris
OffRL
18
4
0
28 Jul 2021
Reinforcement Learning with Formal Performance Metrics for Quadcopter Attitude Control under Non-nominal Contexts
Nicola Bernini
M. Bessa
R. Delmas
A. Gold
Eric Goubault
R. Pennec
S. Putot
Franccois Sillion
49
7
0
27 Jul 2021
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A Deep Reinforcement Learning Approach
Yang Wang
Zhen Gao
Jun Zhang
Xianbin Cao
Dezhi Zheng
Yue Gao
Derrick Wing Kwan Ng
M. Di Renzo
70
99
0
23 Jul 2021
Accelerating Quadratic Optimization with Reinforcement Learning
Jeffrey Ichnowski
Paras Jain
Bartolomeo Stellato
G. Banjac
Michael Luo
Francesco Borrelli
Joseph E. Gonzalez
Ion Stoica
Ken Goldberg
OffRL
85
36
0
22 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao
Qi Yu
Yu Kong
FAtt
75
41
0
21 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
BDL
OffRL
76
35
0
21 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
130
351
0
20 Jul 2021
Active 3D Shape Reconstruction from Vision and Touch
Edward James Smith
David Meger
Luis Villaseñor-Pineda
Roberto Calandra
Jitendra Malik
Adriana Romero
M. Drozdzal
93
47
0
20 Jul 2021
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
João Carvalho
Davide Tateo
Fabio Muratore
Jan Peters
OffRL
48
7
0
20 Jul 2021
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu
Xianyuan Zhan
Xiangyu Zhu
OffRL
76
91
0
19 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
92
23
0
19 Jul 2021
Co-designing Intelligent Control of Building HVACs and Microgrids
Rumia Masburah
Sayani Sinha
R. L. Jana
Soumyajit Dey
Qi Zhu
AI4CE
21
3
0
18 Jul 2021
Hierarchical Reinforcement Learning with Optimal Level Synchronization based on a Deep Generative Model
JaeYoon Kim
Junyu Xuan
Christy Jie Liang
F. Hussain
24
0
0
17 Jul 2021
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Toshinori Kitamura
Lingwei Zhu
Takamitsu Matsubara
92
2
0
16 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
365
118
0
13 Jul 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
33
1
0
13 Jul 2021
The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents
Andrea Dittadi
Frederik Trauble
M. Wuthrich
Felix Widmaier
Peter V. Gehler
Ole Winther
Francesco Locatello
Olivier Bachem
Bernhard Schölkopf
Stefan Bauer
OOD
108
16
0
12 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
104
83
0
12 Jul 2021
Cautious Actor-Critic
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
AAML
75
1
0
12 Jul 2021
Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results
Mohamad H. Danesh
Alan Fern
152
14
0
11 Jul 2021
Aligning an optical interferometer with beam divergence control and continuous action space
Stepan Makarenko
Dmitry Sorokin
Alexander Ulanov
A. Lvovsky
AI4CE
34
4
0
09 Jul 2021
Safe Exploration by Solving Early Terminated MDP
Hao Sun
Ziping Xu
Meng Fang
Zhenghao Peng
Jiadong Guo
Bo Dai
Bolei Zhou
47
17
0
09 Jul 2021
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
182
584
0
08 Jul 2021
Adaptation of Quadruped Robot Locomotion with Meta-Learning
A. Kuzhamuratov
Dmitry Sorokin
Alexander Ulanov
A. Lvovsky
37
0
0
08 Jul 2021
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Muhammad Rizki Maulana
W. Lee
46
1
0
05 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
100
10
0
04 Jul 2021
Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics
N. Botteghi
M. Poel
B. Sirmaçek
C. Brune
58
3
0
04 Jul 2021
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Lirui Wang
Xiangyun Meng
Yu Xiang
Dieter Fox
3DPC
DRL
64
27
0
04 Jul 2021
Previous
1
2
3
...
31
32
33
...
42
43
44
Next