Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
93
8
0
19 Sep 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
148
6
0
18 Sep 2022
A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells
Yuan Z Gao
56
1
0
18 Sep 2022
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel Approach Towards DRL Combined with EA in Continuous Control Tasks
Marzie Esmaeeli
H. Malek
66
2
0
18 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
106
27
0
18 Sep 2022
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of UAVs
Vishnu Saj
Bochan Lee
D. Kalathil
Moble Benedict
56
5
0
17 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
74
32
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
89
18
0
16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement Learning via
ε
ε
ε
-Greedy Exploration
Fanghui Liu
Luca Viano
Volkan Cevher
116
19
0
15 Sep 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
97
20
0
15 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
78
11
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
72
3
0
15 Sep 2022
C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning
Ci Chen
Pingyu Xiang
Haojian Lu
Yue Wang
R. Xiong
OffRL
84
3
0
14 Sep 2022
Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep Reinforcement Learning Through Environmental Generalization
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
R. S. Guerra
Paulo L. J. Drews-Jr
80
13
0
13 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
100
13
0
12 Sep 2022
Non-iterative generation of an optimal mesh for a blade passage using deep reinforcement learning
Innyoung Kim
Sejin Kim
D. You
AI4CE
16
6
0
08 Sep 2022
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform
Zhiling Jiang
Guang-hua Song
60
9
0
07 Sep 2022
When Bioprocess Engineering Meets Machine Learning: A Survey from the Perspective of Automated Bioprocess Development
Nghia Duong-Trung
Stefan Born
Jong Woo Kim
M. Schermeyer
Katharina Paulick
...
Thorben Werner
Randolf Scholz
Lars Schmidt-Thieme
Peter Neubauer
Ernesto Martinez
80
20
0
02 Sep 2022
Actor Prioritized Experience Replay
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
78
27
0
01 Sep 2022
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
Pihe Hu
L. Pan
Yu Chen
Zhixuan Fang
Longbo Huang
34
5
0
30 Aug 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
Soheil Feizi
OffRL
111
3
0
28 Aug 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
103
0
0
28 Aug 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
86
12
0
27 Aug 2022
Risk Verification of Stochastic Systems with Neural Network Controllers
Matthew Cleaveland
Lars Lindemann
Radoslav Ivanov
George Pappas
91
9
0
26 Aug 2022
Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems
Yu Zhang
Shuaifei Chen
Jiayi Zhang
57
0
0
26 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
61
76
0
25 Aug 2022
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems
Yue Wu
J. D. Loera
38
4
0
25 Aug 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
OffRL
67
11
0
25 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
52
3
0
23 Aug 2022
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
R. EshwarS
Shishir Kolathaya
Gugan Thoppe
45
0
0
22 Aug 2022
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Shivakanth Sujit
Somjit Nath
Pedro H. M. Braga
Samira Ebrahimi Kahou
88
16
0
22 Aug 2022
Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
B. Liu
Yihao Feng
Qian Liu
Peter Stone
94
3
0
17 Aug 2022
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
T. Basaklar
S. Gumussoy
Ümit Y. Ogras
50
41
0
16 Aug 2022
Trustworthy Federated Learning via Blockchain
Zhanpeng Yang
Yuanming Shi
Yong Zhou
Zixin Wang
Kai Yang
80
72
0
13 Aug 2022
Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning
Qin Yang
Ramviyas Parasuraman
BDL
39
0
0
11 Aug 2022
Fairness Based Energy-Efficient 3D Path Planning of a Portable Access Point: A Deep Reinforcement Learning Approach
N. Babu
I. Donevski
Álvaro Valcarce
P. Popovski
J. J. Nielsen
C. Papadias
44
12
0
10 Aug 2022
Robust Reinforcement Learning using Offline Data
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OffRL
111
79
0
10 Aug 2022
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang
Junning Liu
Yuzhuo Dai
Yiyan Qi
Yifan Yuan
Kunlun Zheng
Fan Huang
Xianfeng Tan
OffRL
80
51
0
09 Aug 2022
Automating DBSCAN via Deep Reinforcement Learning
Ruitong Zhang
Hao Peng
Yingtong Dou
Hongzhi Zhang
Qingyun Sun
Jingyi Zhang
Philip S. Yu
OffRL
50
20
0
09 Aug 2022
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning
Kai Liu
Tianxian Zhang
L. Kong
78
0
0
07 Aug 2022
Transferable Multi-Agent Reinforcement Learning with Dynamic Participating Agents
Xuting Tang
Jia Xu
Shusen Wang
51
1
0
04 Aug 2022
Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse Rewards
Yongle Luo
Yuxin Wang
Kun Dong
Qiaosheng Zhang
Erkang Cheng
Zhiyong Sun
Bo Song
62
18
0
01 Aug 2022
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
OnRL
85
1
0
01 Aug 2022
Performance Comparison of Deep RL Algorithms for Energy Systems Optimal Scheduling
Shengren Hou
Edgar Mauricio Salazar Duque
Pedro P. Vergara
Peter Palensky
26
19
0
01 Aug 2022
Biologically Plausible Training of Deep Neural Networks Using a Top-down Credit Assignment Network
Jian-Hui Chen
Cheng-Lin Liu
Zuoren Wang
55
0
0
01 Aug 2022
Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse
Zhen Meng
Changyang She
Guodong Zhao
D. Martini
28
42
0
31 Jul 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
81
2
0
31 Jul 2022
Unified Automatic Control of Vehicular Systems with Reinforcement Learning
Zhongxia Yan
Abdul Rahman Kreidieh
Eugene Vinitsky
Alexandre M. Bayen
Cathy Wu
AI4CE
86
43
0
30 Jul 2022
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRL
LRM
76
3
0
29 Jul 2022
Previous
1
2
3
...
23
24
25
...
42
43
44
Next