ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
84
9
0
30 Apr 2024
Evaluating Collaborative Autonomy in Opposed Environments using Maritime
  Capture-the-Flag Competitions
Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions
Jordan Beason
Michael Novitzky
John Kliem
Tyler Errico
Zachary Serlin
...
Michael R. Benjamin
Prithviraj Dasgupta
Peter Crowley
Charles O'Donnell
John James
69
2
0
25 Apr 2024
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
Tongzhou Mu
Minghua Liu
Hao Su
OffRL
88
4
0
25 Apr 2024
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
67
2
0
25 Apr 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
98
0
0
25 Apr 2024
AFU: Actor-Free critic Updates in off-policy RL for continuous control
AFU: Actor-Free critic Updates in off-policy RL for continuous control
Nicolas Perrin-Gilbert
OffRL
108
0
0
24 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
68
1
0
24 Apr 2024
Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
Xiaoshuang Chen
Gengrui Zhang
Yao Wang
Yulin Wu
Shuo Su
Kaiqiao Zhan
Ben Wang
OffRL
83
2
0
23 Apr 2024
Towards Multi-Morphology Controllers with Diversity and Knowledge
  Distillation
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Alican Mertan
Nick Cheney
104
0
0
22 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
104
9
0
22 Apr 2024
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against
  Perturbation
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Xulin Chen
Ruipeng Liu
Garret E. Katz
74
0
0
22 Apr 2024
Adaptive Regularization of Representation Rank as an Implicit Constraint
  of Bellman Equation
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
93
3
0
19 Apr 2024
Learning to Cut via Hierarchical Sequence/Set Model for Efficient
  Mixed-Integer Programming
Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming
Jie Wang
Zhihai Wang
Xijun Li
Yufei Kuang
Zhihao Shi
Fangzhou Zhu
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
83
8
0
19 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement
  Learning Agents
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
75
1
0
18 Apr 2024
Trajectory Planning for Autonomous Vehicle Using Iterative Reward
  Prediction in Reinforcement Learning
Trajectory Planning for Autonomous Vehicle Using Iterative Reward Prediction in Reinforcement Learning
Hyunwoo Park
70
0
0
18 Apr 2024
Actor-Critic Reinforcement Learning with Phased Actor
Actor-Critic Reinforcement Learning with Phased Actor
Ruofan Wu
Junmin Zhong
Jennie Si
39
0
0
18 Apr 2024
Continual Offline Reinforcement Learning via Diffusion-based Dual
  Generative Replay
Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
Jinmei Liu
Wenbin Li
Xiangyu Yue
Shilin Zhang
Chunlin Chen
Zhi Wang
OffRLDiffM
75
6
0
16 Apr 2024
Continuous Control Reinforcement Learning: Distributed Distributional
  DrQ Algorithms
Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms
Zehao Zhou
OffRL
31
0
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
75
1
0
16 Apr 2024
Developing An Attention-Based Ensemble Learning Framework for Financial
  Portfolio Optimisation
Developing An Attention-Based Ensemble Learning Framework for Financial Portfolio Optimisation
Zhenglong Li
Vincent Tam
95
1
0
13 Apr 2024
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Yunxiang Li
Rui Yuan
Chen Fan
Mark Schmidt
Samuel Horváth
Robert Mansel Gower
Martin Takávc
72
0
0
11 Apr 2024
Generative Probabilistic Planning for Optimizing Supply Chain Networks
Generative Probabilistic Planning for Optimizing Supply Chain Networks
Hyung-il Ahn
Santiago Olivar
Hershel Mehta
Young Chol Song
61
0
0
11 Apr 2024
UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement
  Learning
UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning
Saichao Liu
Geng Sun
Jiahui Li
Shuang Liang
Qingqing Wu
Pengfei Wang
Dusit Niyato
81
6
0
11 Apr 2024
Rethinking Out-of-Distribution Detection for Reinforcement Learning:
  Advancing Methods for Evaluation and Detection
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
L. Nasvytis
Kai Sandbrink
Jakob N. Foerster
Tim Franzmeyer
Christian Schroeder de Witt
OffRLOODD
51
9
0
10 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
121
8
0
09 Apr 2024
Computing Transition Pathways for the Study of Rare Events Using Deep
  Reinforcement Learning
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Bo Lin
Yangzheng Zhong
Weiqing Ren
50
0
0
08 Apr 2024
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning
  for Diverse Workloads
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads
Taiyi Wang
Eiko Yoneki
53
2
0
08 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
75
3
0
03 Apr 2024
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning
  with Value-based Dataset
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
Dongsu Lee
Chanin Eom
Minhae Kwon
GPOffRL
43
9
0
03 Apr 2024
Imitation Game: A Model-based and Imitation Learning Deep Reinforcement
  Learning Hybrid
Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
Eric M. S. P. Veith
Torben Logemann
Aleksandr Berezin
Arlena Wellßow
Stephan Balduin
66
2
0
02 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
75
4
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
123
0
0
02 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from
  Pixels
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&RoOffRLOCL
88
14
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRLOnRL
90
0
0
31 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for
  Efficient Deep Reinforcement Learning
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
324
2
0
29 Mar 2024
Closed-form congestion control via deep symbolic regression
Closed-form congestion control via deep symbolic regression
Jean Martins
Igor Almeida
Ricardo Souza
Silvia Lins
27
0
0
28 Mar 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement
  Learning based Recommendation Systems
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
89
2
0
26 Mar 2024
Exploring CausalWorld: Enhancing robotic manipulation via knowledge
  transfer and curriculum learning
Exploring CausalWorld: Enhancing robotic manipulation via knowledge transfer and curriculum learning
Xinrui Wang
Yan Jin
100
2
0
25 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and
  Differentiable L0-Sparse Polynomial Policies
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
103
6
0
22 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
76
0
0
21 Mar 2024
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable
  Satisfaction of Hard Constraints
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
Jean-Baptiste Bouvier
Kartik Nagpal
Negar Mehr
87
5
0
20 Mar 2024
Simple Ingredients for Offline Reinforcement Learning
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
106
2
0
19 Mar 2024
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal
  Footstep Planning and Forecasting
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting
Clément Gaspard
G. Passault
Mélodie Daniel
Olivier Ly
42
1
0
19 Mar 2024
Phasic Diversity Optimization for Population-Based Reinforcement
  Learning
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
60
0
0
17 Mar 2024
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement
  Learning
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou
Haotian Zhang
Guodong Sun
89
0
0
16 Mar 2024
Stimulate the Potential of Robots via Competition
Stimulate the Potential of Robots via Competition
K. Huang
Di Guo
Xinyu Zhang
Xiangyang Ji
Huaping Liu
95
3
0
15 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Shape Control of
  Deformable Linear Objects
Offline Goal-Conditioned Reinforcement Learning for Shape Control of Deformable Linear Objects
Rita Laezza
Mohammadreza Shetab-Bushehri
Gabriel Arslan Waltersson
Erol Özgür
Y. Mezouar
Y. Karayiannidis
OffRL
80
1
0
15 Mar 2024
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse
  Behaviors via Value and Successor Features Critics
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti
Maxence Faldor
Borja G. León
Antoine Cully
118
3
0
15 Mar 2024
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online
  Reinforcement Learning
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
Motoki Omura
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
46
0
0
12 Mar 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
110
5
0
12 Mar 2024
Previous
123...91011...424344
Next