Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
125
18
0
23 Feb 2022
Cooperative Behavior Planning for Automated Driving using Graph Neural Networks
Marvin Klimke
Benjamin Völz
M. Buchholz
49
21
0
23 Feb 2022
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel
Haoran Zhang
Chenkun Yin
Yanxin Zhang
S. Jin
Zhenxuan Li
OffRL
50
3
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
71
9
0
23 Feb 2022
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning
Shentao Yang
Zhendong Wang
Huangjie Zheng
Yihao Feng
Mingyuan Zhou
OffRL
64
9
0
19 Feb 2022
Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic
Yuqi Liu
Qichao Zhang
Dongbin Zhao
78
16
0
19 Feb 2022
CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving
Yinuo Zhao
Kun Wu
Zhiyuan Xu
Zhengping Che
Qi Lu
Jian Tang
C. Liu
88
28
0
17 Feb 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Che Wang
Xufang Luo
George Andriopoulos
Dongsheng Li
OffRL
123
51
0
17 Feb 2022
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control
Killian Dally
E. Kampen
61
16
0
16 Feb 2022
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo
Sébastien M. R. Arnold
P. LÉcuyer
Liyu Chen
Yi-fan Chen
Fei Sha
OffRL
82
4
0
16 Feb 2022
Safe Reinforcement Learning by Imagining the Near Future
G. Thomas
Yuping Luo
Tengyu Ma
OffRL
75
86
0
15 Feb 2022
Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoT
Jingjing Zheng
Kai Li
N. Mhaisen
Wei Ni
Eduardo Tovar
Mohsen Guizani
78
47
0
15 Feb 2022
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
Taisuke Kobayashi
94
16
0
15 Feb 2022
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For Reinforcement Learning Algorithms
Burak Han Demirbilek
22
0
0
14 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
89
70
0
13 Feb 2022
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer
Xingyu Liu
Deepak Pathak
Kris Kitani
93
20
0
10 Feb 2022
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
Stephen James
Pieter Abbeel
56
9
0
08 Feb 2022
skrl: Modular and Flexible Library for Reinforcement Learning
Antonio Serrano-Muñoz
D. Chrysostomou
Simon Boegh
N. Arana-Arexolaleiba
89
31
0
08 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
86
54
0
08 Feb 2022
Model-Based Offline Meta-Reinforcement Learning with Regularization
Sen Lin
Jialin Wan
Tengyu Xu
Yingbin Liang
Junshan Zhang
OffRL
113
16
0
07 Feb 2022
Reinforcement Learning for Shared Autonomy Drone Landings
Kal Backman
Dana Kulic
Hoam Chung
44
16
0
07 Feb 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
54
3
0
07 Feb 2022
Learning Synthetic Environments and Reward Networks for Reinforcement Learning
Fabio Ferreira
Thomas Nierhoff
Andreas Saelinger
Frank Hutter
41
4
0
06 Feb 2022
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Michael Teng
M. van de Panne
Frank Wood
OOD
OffRL
39
1
0
06 Feb 2022
Rethinking ValueDice: Does It Really Improve Performance?
Ziniu Li
Tian Xu
Yang Yu
Zhimin Luo
OffRL
75
17
0
05 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
97
131
0
05 Feb 2022
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Nathan Beck
Abhiramon Rajasekharan
H. Tran
13
3
0
05 Feb 2022
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
114
2
0
04 Feb 2022
Learning Interpretable, High-Performing Policies for Autonomous Driving
Rohan R. Paleja
Yaru Niu
Andrew Silva
Chace Ritchie
Sugju Choi
Matthew C. Gombolay
87
17
0
04 Feb 2022
Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
Seongin Na
Tomáš Rouček
Jiří Ulrich
Jan Pikman
T. Krajník
Barry Lennox
F. Arvin
61
34
0
02 Feb 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
177
48
0
01 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
103
94
0
31 Jan 2022
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning
Hassam Sheikh
Kizza M Nandyose Frisbee
Mariano Phielipp
62
8
0
31 Jan 2022
Steady-State Error Compensation in Reference Tracking and Disturbance Rejection Problems for Reinforcement Learning-Based Control
Daniel Weber
Maximilian Schenke
Oliver Wallscheid
16
2
0
31 Jan 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
72
18
0
29 Jan 2022
Do You Need the Entropy Reward (in Practice)?
Haonan Yu
Haichao Zhang
Wei Xu
85
8
0
28 Jan 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
115
32
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
240
96
0
28 Jan 2022
Constrained Variational Policy Optimization for Safe Reinforcement Learning
Zuxin Liu
Zhepeng Cen
Vladislav Isenbaev
Wei Liu
Zhiwei Steven Wu
Yue Liu
Ding Zhao
94
81
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
78
10
0
24 Jan 2022
State-Conditioned Adversarial Subgoal Generation
V. Wang
Joni Pajarinen
Tinghuai Wang
Joni-Kristian Kämäräinen
94
12
0
24 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
77
2
0
21 Jan 2022
Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation
Junya Ikemoto
T. Ushio
126
10
0
21 Jan 2022
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning
Yuan Tian
Minghao Han
Chetan S. Kulkarni
Olga Fink
71
13
0
20 Jan 2022
Programmatic Policy Extraction by Iterative Local Search
Rasmus Larsen
Mikkel N. Schmidt
27
0
0
18 Jan 2022
Recursive Least Squares Advantage Actor-Critic Algorithms
Yuan Wang
Chunyuan Zhang
Tianzong Yu
Meng-tao Ma
39
0
0
15 Jan 2022
Reinforcement Learning based Air Combat Maneuver Generation
Muhammed Murat Özbek
E. Koyuncu
29
4
0
14 Jan 2022
Evolutionary Action Selection for Gradient-based Policy Learning
Yan Ma
T. Liu
Bingsheng Wei
Yi Liu
Kang Xu
Wei Li
153
9
0
12 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
64
4
0
07 Jan 2022
Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Hanhan Zhou
Tian-Shing Lan
Vaneet Aggarwal
99
32
0
04 Jan 2022
Previous
1
2
3
...
27
28
29
...
42
43
44
Next