Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion
Lev Grossman
Brian Plancher
MQ
65
4
0
14 Oct 2022
Mutual Information Regularized Offline Reinforcement Learning
Xiao Ma
Bingyi Kang
Zhongwen Xu
Min Lin
Shuicheng Yan
OffRL
95
8
0
14 Oct 2022
Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Albert Wilcox
Ashwin Balakrishna
Jules Dedieu
Wyame Benslimane
Daniel S. Brown
Ken Goldberg
OffRL
78
20
0
14 Oct 2022
Skill-Based Reinforcement Learning with Intrinsic Reward Matching
Ademi Adeniji
Amber Xie
Pieter Abbeel
OffRL
73
5
0
14 Oct 2022
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts
Muhammed Murat Özbek
S. Yildirim
Muhammet Aksoy
Eric Kernin
E. Koyuncu
64
5
0
13 Oct 2022
Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin
Oya Celiktutan
BDL
LRM
56
2
0
13 Oct 2022
Self-Validated Physics-Embedding Network: A General Framework for Inverse Modelling
Ruiyuan Kang
D. Kyritsis
P. Liatsis
AI4CE
PINN
75
5
0
12 Oct 2022
A Unified Framework for Alternating Offline Model Training and Policy Learning
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
118
17
0
12 Oct 2022
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
92
79
0
11 Oct 2022
Factors of Influence of the Overestimation Bias of Q-Learning
Julius Wagenbach
M. Sabatelli
141
1
0
11 Oct 2022
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning
Seungjae Lee
Jigang Kim
Inkyu Jang
H. J. Kim
OffRL
105
13
0
11 Oct 2022
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Zifan Xu
Bo Liu
Xuesu Xiao
Anirudh Nair
Peter Stone
136
47
0
10 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
37
0
0
10 Oct 2022
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI
Baturay Saglam
Doğa Gürgünoğlu
Suleyman S. Kozat
36
12
0
10 Oct 2022
Reducing Action Space: Reference-Model-Assisted Deep Reinforcement Learning for Inverter-based Volt-Var Control
Qiong Liu
Ye Guo
Lirong Deng
Haotian Liu
Dongyu Li
Hongbin Sun
18
0
0
10 Oct 2022
State Advantage Weighting for Offline RL
Jiafei Lyu
Aicheng Gong
Le Wan
Zongqing Lu
Xiu Li
OffRL
89
9
0
09 Oct 2022
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Jihwan Jeong
Xiaoyu Wang
Michael Gimelfarb
Hyunwoo J. Kim
Baher Abdulhai
Scott Sanner
OffRL
118
12
0
07 Oct 2022
Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning
Naseh Majidi
Mahdieh Shamsi
F. Marvasti
AIFin
37
8
0
07 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
95
13
0
07 Oct 2022
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks
Adrian Ly
Richard Dazeley
Peter Vamplew
Francisco Cruz
Sunil Aryal
107
13
0
07 Oct 2022
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
Willie Neiswanger
OffRL
79
6
0
06 Oct 2022
Designing a Robust Low-Level Agnostic Controller for a Quadrotor with Actor-Critic Reinforcement Learning
Guilherme Siqueira Eduardo
W. Caarls
37
0
0
06 Oct 2022
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
113
25
0
06 Oct 2022
Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing
Bryon Tjanaka
Matthew C. Fontaine
David H. Lee
Aniruddha Kalkar
Stefanos Nikolaidis
124
10
0
06 Oct 2022
Learning Depth Vision-Based Personalized Robot Navigation From Dynamic Demonstrations in Virtual Reality
Jorge de Heuvel
Nathan Corral
Benedikt Kreis
Jacobus Conradi
Anne Driemel
Maren Bennewitz
71
13
0
04 Oct 2022
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
96
11
0
04 Oct 2022
Safe Self-Supervised Learning in Real of Visuo-Tactile Feedback Policies for Industrial Insertion
Letian Fu
Huang Huang
Lars Berscheid
Hui Li
Ken Goldberg
Sachin Chitta
90
18
0
04 Oct 2022
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
101
10
0
03 Oct 2022
Deep Learning for Wireless Networked Systems: a joint Estimation-Control-Scheduling Approach
Zihuai Zhao
Wanchun Liu
Daniel E. Quevedo
Yonghui Li
Branka Vucetic
69
18
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
59
10
0
02 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman S. Kozat
61
4
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
60
3
0
01 Oct 2022
Online Weighted Q-Ensembles for Reduced Hyperparameter Tuning in Reinforcement Learning
R. G. Oliveira
W. Caarls
OffRL
53
0
0
29 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
37
45
0
29 Sep 2022
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
116
46
0
29 Sep 2022
Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping
Gabriella Pizzuto
Hetong Wang
Hatem Fakhruldeen
Bei Peng
K. Luck
Andrew I. Cooper
74
3
0
29 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
111
1
0
29 Sep 2022
Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
Victor R. F. Miranda
A. A. Neto
G. Freitas
L. Mozelli
72
21
0
28 Sep 2022
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
A. Chiappa
Alessandro Marin Vargas
Alexander Mathis
71
10
0
28 Sep 2022
Neural Network Panning: Screening the Optimal Sparse Network Before Training
Xiatao Kang
P. Li
Jiayi Yao
Chengxi Li
VLM
45
1
0
27 Sep 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
52
1
0
26 Sep 2022
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning
Firas Jarboui
Ahmed Akakzia
41
0
0
26 Sep 2022
Deep Reinforcement Learning for Adaptive Mesh Refinement
C. Foucart
A. Charous
Pierre FJ Lermusiaux
AI4CE
81
23
0
25 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
97
0
0
23 Sep 2022
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning
Abraham George
Alison Bartsch
A. Farimani
OffRL
57
5
0
22 Sep 2022
Modern Machine Learning Tools for Monitoring and Control of Industrial Processes: A Survey
R. Bhushan Gopaluni
Aditya Tulsyan
Benoît Chachuat
Biao Huang
J. M. Lee
Faraz Amjad
S. Damarla
Jong Woo Kim
Nathan P. Lawrence
AI4CE
74
38
0
22 Sep 2022
Bypassing the Simulation-to-reality Gap: Online Reinforcement Learning using a Supervisor
B. D. Evans
Johannes Betz
Hongrui Zheng
H. Engelbrecht
Rahul Mangharam
H. W. Jordaan
OffRL
59
7
0
22 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan
Eniola Ajiboye
Peruth Kamashazi
Timothy Mbaka
OffRL
59
1
0
21 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
112
13
0
21 Sep 2022
ECSAS: Exploring Critical Scenarios from Action Sequence in Autonomous Driving
Shuting Kang
Heng Guo
Lijun Zhang
Guangzhen Liu
Yunzhi Xue
Yanjun Wu
75
5
0
21 Sep 2022
Previous
1
2
3
...
22
23
24
...
42
43
44
Next