Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 849 papers shown
Title
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
28
39
0
25 Oct 2022
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains
Manon Flageat
Félix Chalumeau
Antoine Cully
36
26
0
24 Oct 2022
MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System
Huiliang Zhang
Di Wu
Benoit Boulet
41
6
0
23 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
27
8
0
21 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
23
0
0
19 Oct 2022
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
45
2
0
19 Oct 2022
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto
Onur Celik
Hongyi Zhou
Hanna Ziesche
Ngo Anh Vien
Gerhard Neumann
OffRL
24
19
0
18 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
33
21
0
18 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
24
18
0
15 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
62
0
15 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale
Kuang-Huei Lee
Ted Xiao
A. Li
Paul Wohlhart
Ian S. Fischer
Yao Lu
53
10
0
15 Oct 2022
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
34
8
0
15 Oct 2022
Self-Validated Physics-Embedding Network: A General Framework for Inverse Modelling
Ruiyuan Kang
D. Kyritsis
P. Liatsis
AI4CE
PINN
18
5
0
12 Oct 2022
Factors of Influence of the Overestimation Bias of Q-Learning
Julius Wagenbach
M. Sabatelli
20
1
0
11 Oct 2022
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Zifan Xu
Bo Liu
Xuesu Xiao
Anirudh Nair
Peter Stone
39
42
0
10 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
30
0
0
10 Oct 2022
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI
Baturay Saglam
Doğa Gürgünoğlu
Suleyman Serdar Kozat
24
12
0
10 Oct 2022
Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning
Naseh Majidi
Mahdieh Shamsi
F. Marvasti
AIFin
32
7
0
07 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
29
7
0
07 Oct 2022
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
Willie Neiswanger
OffRL
27
6
0
06 Oct 2022
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
31
21
0
06 Oct 2022
Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing
Bryon Tjanaka
Matthew C. Fontaine
David H. Lee
Aniruddha Kalkar
Stefanos Nikolaidis
68
8
0
06 Oct 2022
Learning Depth Vision-Based Personalized Robot Navigation From Dynamic Demonstrations in Virtual Reality
Jorge de Heuvel
Nathan Corral
Benedikt Kreis
Jacobus Conradi
Anne Driemel
Maren Bennewitz
42
13
0
04 Oct 2022
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
29
9
0
04 Oct 2022
Safe Self-Supervised Learning in Real of Visuo-Tactile Feedback Policies for Industrial Insertion
Letian Fu
Huang Huang
Lars Berscheid
Hui Li
Ken Goldberg
Sachin Chitta
44
18
0
04 Oct 2022
Deep Learning for Wireless Networked Systems: a joint Estimation-Control-Scheduling Approach
Zihuai Zhao
Wanchun Liu
Daniel E. Quevedo
Yonghui Li
Branka Vucetic
32
18
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
21
10
0
02 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman Serdar Kozat
26
4
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
31
3
0
01 Oct 2022
Online Weighted Q-Ensembles for Reduced Hyperparameter Tuning in Reinforcement Learning
R. G. Oliveira
W. Caarls
OffRL
29
0
0
29 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
21
40
0
29 Sep 2022
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
42
39
0
29 Sep 2022
Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping
Gabriella Pizzuto
Hetong Wang
Hatem Fakhruldeen
Bei Peng
K. Luck
Andrew I. Cooper
28
2
0
29 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
42
0
0
29 Sep 2022
Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
Victor R. F. Miranda
A. A. Neto
G. Freitas
L. Mozelli
37
18
0
28 Sep 2022
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
A. Chiappa
Alessandro Marin Vargas
Alexander Mathis
34
7
0
28 Sep 2022
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning
Firas Jarboui
Ahmed Akakzia
19
0
0
26 Sep 2022
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning
Abraham George
Alison Bartsch
A. Farimani
OffRL
19
5
0
22 Sep 2022
Modern Machine Learning Tools for Monitoring and Control of Industrial Processes: A Survey
R. Bhushan Gopaluni
Aditya Tulsyan
Benoît Chachuat
Biao Huang
J. M. Lee
Faraz Amjad
S. Damarla
Jong Woo Kim
Nathan P. Lawrence
AI4CE
21
38
0
22 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan
Eniola Ajiboye
Peruth Kamashazi
Timothy Mbaka
OffRL
27
1
0
21 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
ECSAS: Exploring Critical Scenarios from Action Sequence in Autonomous Driving
Shuting Kang
Heng Guo
Lijun Zhang
Guangzhen Liu
Yunzhi Xue
Yanjun Wu
29
5
0
21 Sep 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
75
6
0
18 Sep 2022
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel Approach Towards DRL Combined with EA in Continuous Control Tasks
Marzie Esmaeeli
H. Malek
27
2
0
18 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
40
30
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
27
16
0
16 Sep 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
35
18
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
45
3
0
15 Sep 2022
Previous
1
2
3
...
7
8
9
...
15
16
17
Next