ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXivPDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 849 papers shown
Title
Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep
  Reinforcement Learning Through Environmental Generalization
Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep Reinforcement Learning Through Environmental Generalization
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
R. S. Guerra
Paulo L. J. Drews-Jr
41
13
0
13 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical
  Approach to Address Backtest Overfitting
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
32
13
0
12 Sep 2022
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a
  Platform
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform
Zhiling Jiang
Guang-hua Song
13
9
0
07 Sep 2022
When Bioprocess Engineering Meets Machine Learning: A Survey from the
  Perspective of Automated Bioprocess Development
When Bioprocess Engineering Meets Machine Learning: A Survey from the Perspective of Automated Bioprocess Development
Nghia Duong-Trung
Stefan Born
Jong Woo Kim
M. Schermeyer
Katharina Paulick
...
Thorben Werner
Randolf Scholz
Lars Schmidt-Thieme
Peter Neubauer
Ernesto Martinez
39
20
0
02 Sep 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
S. Feizi
OffRL
24
2
0
28 Aug 2022
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy
  Treatment Strategies with Deep Reinforcement Learning
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
29
12
0
27 Aug 2022
Turning Mathematics Problems into Games: Reinforcement Learning and
  Gröbner bases together solve Integer Feasibility Problems
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems
Yue Wu
J. D. Loera
24
4
0
25 Aug 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing
  Tasks
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
OffRL
22
10
0
25 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement
  learning algorithm
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Trustworthy Federated Learning via Blockchain
Trustworthy Federated Learning via Blockchain
Zhanpeng Yang
Yuanming Shi
Yong Zhou
Zixin Wang
Kai Yang
39
68
0
13 Aug 2022
Fairness Based Energy-Efficient 3D Path Planning of a Portable Access
  Point: A Deep Reinforcement Learning Approach
Fairness Based Energy-Efficient 3D Path Planning of a Portable Access Point: A Deep Reinforcement Learning Approach
N. Babu
I. Donevski
Álvaro Valcarce
P. Popovski
J. J. Nielsen
C. Papadias
16
12
0
10 Aug 2022
Multi-Task Fusion via Reinforcement Learning for Long-Term User
  Satisfaction in Recommender Systems
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang
Junning Liu
Yuzhuo Dai
Yiyan Qi
Yifan Yuan
Kunlun Zheng
Fan Huang
Xianfeng Tan
OffRL
35
50
0
09 Aug 2022
Automating DBSCAN via Deep Reinforcement Learning
Automating DBSCAN via Deep Reinforcement Learning
Ruitong Zhang
Hao Peng
Yingtong Dou
Jia Wu
Qingyun Sun
Jingyi Zhang
Philip S. Yu
OffRL
24
19
0
09 Aug 2022
Biologically Plausible Training of Deep Neural Networks Using a Top-down
  Credit Assignment Network
Biologically Plausible Training of Deep Neural Networks Using a Top-down Credit Assignment Network
Jian-Hui Chen
Cheng-Lin Liu
Zuoren Wang
28
0
0
01 Aug 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and
  Early Termination
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
Unified Automatic Control of Vehicular Systems with Reinforcement
  Learning
Unified Automatic Control of Vehicular Systems with Reinforcement Learning
Zhongxia Yan
Abdul Rahman Kreidieh
Eugene Vinitsky
Alexandre M. Bayen
Cathy Wu
AI4CE
32
41
0
30 Jul 2022
Meta Reinforcement Learning with Successor Feature Based Context
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRL
LRM
42
3
0
29 Jul 2022
An Enhanced Graph Representation for Machine Learning Based Automatic
  Intersection Management
An Enhanced Graph Representation for Machine Learning Based Automatic Intersection Management
Marvin Klimke
Jasper Gerigk
Benjamin Völz
M. Buchholz
30
9
0
18 Jul 2022
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Ricard Durall
21
4
0
14 Jul 2022
Vessel-following model for inland waterways based on deep reinforcement
  learning
Vessel-following model for inland waterways based on deep reinforcement learning
Fabian Hart
Ostap Okhrin
M. Treiber
48
11
0
07 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
42
8
0
04 Jul 2022
General Policy Evaluation and Improvement by Learning to Identify Few
  But Crucial States
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Francesco Faccio
Aditya A. Ramesh
Vincent Herrmann
J. Harb
Jürgen Schmidhuber
OffRL
44
8
0
04 Jul 2022
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning
  Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu
Xiao-guang Gao
Kaifang Wan
Qianglong Wang
Yiwei Zhai
40
10
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
USHER: Unbiased Sampling for Hindsight Experience Replay
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
19
4
0
03 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
40
36
0
03 Jul 2022
Learning fast and agile quadrupedal locomotion over complex terrain
Learning fast and agile quadrupedal locomotion over complex terrain
Xu Chang
Zhitong Zhang
Honglei An
Hongxu Ma
Qing Wei
29
0
0
02 Jul 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Analysis of Stochastic Processes through Replay Buffers
Analysis of Stochastic Processes through Replay Buffers
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
36
6
0
26 Jun 2022
Interactive Visual Reasoning under Uncertainty
Interactive Visual Reasoning under Uncertainty
Manjie Xu
Guangyuan Jiang
Wei Liang
Song-Chun Zhu
Yixin Zhu
LRM
49
5
0
18 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
35
10
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning
  Environments
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
26
4
0
17 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
51
110
0
17 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
42
141
0
15 Jun 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning
  Implementation for High-Freq Stock Trading
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading
Zitao Song
Xuyang Jin
Chenliang Li
OffRL
AIFin
29
1
0
13 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Overcoming the Spectral Bias of Neural Value Approximation
Overcoming the Spectral Bias of Neural Value Approximation
Ge Yang
Anurag Ajay
Pulkit Agrawal
36
25
0
09 Jun 2022
Biologically Inspired Dynamic Thresholds for Spiking Neural Networks
Biologically Inspired Dynamic Thresholds for Spiking Neural Networks
Jianchuan Ding
B. Dong
Felix Heide
Yufei Ding
Yunduo Zhou
Baocai Yin
Xin Yang
20
24
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
46
17
0
08 Jun 2022
Meta-Learning Parameterized Skills
Meta-Learning Parameterized Skills
Haotian Fu
Shangqun Yu
Saket Tiwari
Michael Littman
George Konidaris
38
6
0
07 Jun 2022
Robust Adversarial Attacks Detection based on Explainable Deep
  Reinforcement Learning For UAV Guidance and Planning
Robust Adversarial Attacks Detection based on Explainable Deep Reinforcement Learning For UAV Guidance and Planning
Tom Hickling
Nabil Aouf
P. Spencer
AAML
30
50
0
06 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
144
103
0
05 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
35
35
0
02 Jun 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation
  with Residual Actor
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
32
24
0
01 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational
  Perturbations
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
48
35
0
29 May 2022
Constrained Reinforcement Learning for Short Video Recommendation
Constrained Reinforcement Learning for Short Video Recommendation
Qingpeng Cai
Ruohan Zhan
Chi Zhang
Jie Zheng
Guangwei Ding
Pinghua Gong
Dong Zheng
Peng Jiang
33
6
0
26 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
36
8
0
22 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Previous
123...8910...151617
Next