ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Active Predicting Coding: Brain-Inspired Reinforcement Learning for
  Sparse Reward Robotic Control Problems
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
93
8
0
19 Sep 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
148
6
0
18 Sep 2022
A Computational Model of Learning Flexible Navigation in a Maze by
  Layout-Conforming Replay of Place Cells
A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells
Yuan Z Gao
56
1
0
18 Sep 2022
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel
  Approach Towards DRL Combined with EA in Continuous Control Tasks
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel Approach Towards DRL Combined with EA in Continuous Control Tasks
Marzie Esmaeeli
H. Malek
66
2
0
18 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
106
27
0
18 Sep 2022
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of
  UAVs
Robust Reinforcement Learning Algorithm for Vision-based Ship Landing of UAVs
Vishnu Saj
Bochan Lee
D. Kalathil
Moble Benedict
56
5
0
17 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in
  visual Reinforcement Learning
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
74
32
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
89
18
0
16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement
  Learning via $ε$-Greedy Exploration
Understanding Deep Neural Function Approximation in Reinforcement Learning via εεε-Greedy Exploration
Fanghui Liu
Luca Viano
Volkan Cevher
116
19
0
15 Sep 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
97
20
0
15 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRLOnRL
78
11
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
72
3
0
15 Sep 2022
C^2:Co-design of Robots via Concurrent Networks Coupling Online and
  Offline Reinforcement Learning
C^2:Co-design of Robots via Concurrent Networks Coupling Online and Offline Reinforcement Learning
Ci Chen
Pingyu Xiang
Haojian Lu
Yue Wang
R. Xiong
OffRL
84
3
0
14 Sep 2022
Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep
  Reinforcement Learning Through Environmental Generalization
Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep Reinforcement Learning Through Environmental Generalization
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
R. S. Guerra
Paulo L. J. Drews-Jr
80
13
0
13 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical
  Approach to Address Backtest Overfitting
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
100
13
0
12 Sep 2022
Non-iterative generation of an optimal mesh for a blade passage using
  deep reinforcement learning
Non-iterative generation of an optimal mesh for a blade passage using deep reinforcement learning
Innyoung Kim
Sejin Kim
D. You
AI4CE
16
6
0
08 Sep 2022
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a
  Platform
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform
Zhiling Jiang
Guang-hua Song
60
9
0
07 Sep 2022
When Bioprocess Engineering Meets Machine Learning: A Survey from the
  Perspective of Automated Bioprocess Development
When Bioprocess Engineering Meets Machine Learning: A Survey from the Perspective of Automated Bioprocess Development
Nghia Duong-Trung
Stefan Born
Jong Woo Kim
M. Schermeyer
Katharina Paulick
...
Thorben Werner
Randolf Scholz
Lars Schmidt-Thieme
Peter Neubauer
Ernesto Martinez
80
20
0
02 Sep 2022
Actor Prioritized Experience Replay
Actor Prioritized Experience Replay
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
78
27
0
01 Sep 2022
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent
  Reinforcement Learning
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
Pihe Hu
L. Pan
Yu Chen
Zhixuan Fang
Longbo Huang
34
5
0
30 Aug 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
Soheil Feizi
OffRL
111
3
0
28 Aug 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
103
0
0
28 Aug 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy
  Treatment Strategies with Deep Reinforcement Learning
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
86
12
0
27 Aug 2022
Risk Verification of Stochastic Systems with Neural Network Controllers
Risk Verification of Stochastic Systems with Neural Network Controllers
Matthew Cleaveland
Lars Lindemann
Radoslav Ivanov
George Pappas
91
9
0
26 Aug 2022
Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free
  Massive MIMO Systems
Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems
Yu Zhang
Shuaifei Chen
Jiayi Zhang
57
0
0
26 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
61
76
0
25 Aug 2022
Turning Mathematics Problems into Games: Reinforcement Learning and
  Gröbner bases together solve Integer Feasibility Problems
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems
Yue Wu
J. D. Loera
38
4
0
25 Aug 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing
  Tasks
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
OffRL
67
11
0
25 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement
  learning algorithm
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
52
3
0
23 Aug 2022
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
R. EshwarS
Shishir Kolathaya
Gugan Thoppe
45
0
0
22 Aug 2022
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Shivakanth Sujit
Somjit Nath
Pedro H. M. Braga
Samira Ebrahimi Kahou
88
16
0
22 Aug 2022
Metric Residual Networks for Sample Efficient Goal-Conditioned
  Reinforcement Learning
Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
B. Liu
Yihao Feng
Qian Liu
Peter Stone
94
3
0
17 Aug 2022
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning
  Algorithm
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
T. Basaklar
S. Gumussoy
Ümit Y. Ogras
50
41
0
16 Aug 2022
Trustworthy Federated Learning via Blockchain
Trustworthy Federated Learning via Blockchain
Zhanpeng Yang
Yuanming Shi
Yong Zhou
Zixin Wang
Kai Yang
80
72
0
13 Aug 2022
Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep
  Reinforcement Learning
Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning
Qin Yang
Ramviyas Parasuraman
BDL
39
0
0
11 Aug 2022
Fairness Based Energy-Efficient 3D Path Planning of a Portable Access
  Point: A Deep Reinforcement Learning Approach
Fairness Based Energy-Efficient 3D Path Planning of a Portable Access Point: A Deep Reinforcement Learning Approach
N. Babu
I. Donevski
Álvaro Valcarce
P. Popovski
J. J. Nielsen
C. Papadias
44
12
0
10 Aug 2022
Robust Reinforcement Learning using Offline Data
Robust Reinforcement Learning using Offline Data
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OffRL
111
79
0
10 Aug 2022
Multi-Task Fusion via Reinforcement Learning for Long-Term User
  Satisfaction in Recommender Systems
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang
Junning Liu
Yuzhuo Dai
Yiyan Qi
Yifan Yuan
Kunlun Zheng
Fan Huang
Xianfeng Tan
OffRL
80
51
0
09 Aug 2022
Automating DBSCAN via Deep Reinforcement Learning
Automating DBSCAN via Deep Reinforcement Learning
Ruitong Zhang
Hao Peng
Yingtong Dou
Hongzhi Zhang
Qingyun Sun
Jingyi Zhang
Philip S. Yu
OffRL
50
20
0
09 Aug 2022
Maximum Correntropy Value Decomposition for Multi-agent Deep
  Reinforcemen Learning
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning
Kai Liu
Tianxian Zhang
L. Kong
78
0
0
07 Aug 2022
Transferable Multi-Agent Reinforcement Learning with Dynamic
  Participating Agents
Transferable Multi-Agent Reinforcement Learning with Dynamic Participating Agents
Xuting Tang
Jia Xu
Shusen Wang
51
1
0
04 Aug 2022
Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement
  Learning for Sequential Object Manipulation Tasks with Sparse Rewards
Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse Rewards
Yongle Luo
Yuxin Wang
Kun Dong
Qiaosheng Zhang
Erkang Cheng
Zhiyong Sun
Bo Song
62
18
0
01 Aug 2022
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRLOnRL
85
1
0
01 Aug 2022
Performance Comparison of Deep RL Algorithms for Energy Systems Optimal
  Scheduling
Performance Comparison of Deep RL Algorithms for Energy Systems Optimal Scheduling
Shengren Hou
Edgar Mauricio Salazar Duque
Pedro P. Vergara
Peter Palensky
26
19
0
01 Aug 2022
Biologically Plausible Training of Deep Neural Networks Using a Top-down
  Credit Assignment Network
Biologically Plausible Training of Deep Neural Networks Using a Top-down Credit Assignment Network
Jian-Hui Chen
Cheng-Lin Liu
Zuoren Wang
55
0
0
01 Aug 2022
Sampling, Communication, and Prediction Co-Design for Synchronizing the
  Real-World Device and Digital Model in Metaverse
Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse
Zhen Meng
Changyang She
Guodong Zhao
D. Martini
28
42
0
31 Jul 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and
  Early Termination
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
81
2
0
31 Jul 2022
Unified Automatic Control of Vehicular Systems with Reinforcement
  Learning
Unified Automatic Control of Vehicular Systems with Reinforcement Learning
Zhongxia Yan
Abdul Rahman Kreidieh
Eugene Vinitsky
Alexandre M. Bayen
Cathy Wu
AI4CE
86
43
0
30 Jul 2022
Meta Reinforcement Learning with Successor Feature Based Context
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRLLRM
76
3
0
29 Jul 2022
Previous
123...232425...424344
Next