ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
183
17
0
13 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
113
0
0
12 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
527
4
0
12 Oct 2024
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent
  Reinforcement Learning
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning
Xinran Li
Ling Pan
Jun Zhang
92
3
0
11 Oct 2024
FRASA: An End-to-End Reinforcement Learning Agent for Fall Recovery and Stand Up of Humanoid Robots
FRASA: An End-to-End Reinforcement Learning Agent for Fall Recovery and Stand Up of Humanoid Robots
Clément Gaspard
Marc Duclusaud
G. Passault
Mélodie Daniel
Olivier Ly
90
4
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
126
5
0
11 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
120
0
0
11 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
117
6
0
10 Oct 2024
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRLDRL
81
2
0
09 Oct 2024
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement
  Learning and Application in UAV Hovering
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
Qihan Qi
Xinsong Yang
Gang Xia
Daniel W. C. Ho
Pengyang Tang
89
0
0
09 Oct 2024
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
74
0
0
08 Oct 2024
Reinforcement Learning From Imperfect Corrective Actions And Proxy
  Rewards
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
133
1
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
101
1
0
07 Oct 2024
Bisimulation metric for Model Predictive Control
Bisimulation metric for Model Predictive Control
Yutaka Shimizu
Masayoshi Tomizuka
101
0
0
06 Oct 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao
Wenhao Zhan
Jonathan D. Chang
Gokul Swamy
Kianté Brantley
Jason D. Lee
Wen Sun
OffRL
140
7
0
06 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via
  Vector Quantization
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
75
0
0
04 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
131
5
0
03 Oct 2024
Diffusion Meets Options: Hierarchical Generative Skill Composition for
  Temporally-Extended Tasks
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng
Hao Luan
Kevin Yuchen Ma
Harold Soh
83
2
0
03 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
118
3
0
02 Oct 2024
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with
  LLM-Guided Knowledge
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge
Xiefeng Wu
OffRL
100
2
0
02 Oct 2024
Dual Approximation Policy Optimization
Dual Approximation Policy Optimization
Zhihan Xiong
Maryam Fazel
Lin Xiao
66
1
0
02 Oct 2024
Mimicking Human Intuition: Cognitive Belief-Driven Reinforcement Learning
Mimicking Human Intuition: Cognitive Belief-Driven Reinforcement Learning
Xingrui Gu
Guanren Qiao
Chuyi Jiang
OffRL
87
0
0
02 Oct 2024
Generalizability of Graph Neural Networks for Decentralized Unlabeled
  Motion Planning
Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning
Shreyas Muthusamy
Damian Owerko
Charilaos I. Kanatsoulis
Saurav Agarwal
Alejandro Ribeiro
58
1
0
29 Sep 2024
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement
  Learning
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
Haohui Chen
Zhiyong Chen
Aoxiang Liu
Wentuo Fang
OffRL
73
1
0
28 Sep 2024
DMC-VB: A Benchmark for Representation Learning for Control with Visual
  Distractors
DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Joseph Ortiz
Antoine Dedieu
Wolfgang Lehrach
Swaroop Guntupalli
Carter Wendelken
Ahmad Humayun
Guangyao Zhou
Sivaramakrishnan Swaminathan
Miguel Lázaro-Gredilla
Kevin P. Murphy
OffRL
76
1
0
26 Sep 2024
OffRIPP: Offline RL-based Informative Path Planning
OffRIPP: Offline RL-based Informative Path Planning
Srikar Babu Gadipudi
Srujan Deolasee
Siva Kailas
Wenhao Luo
Katia Sycara
Woojun Kim
OffRL
60
2
0
25 Sep 2024
MSARS: A Meta-Learning and Reinforcement Learning Framework for SLO
  Resource Allocation and Adaptive Scaling for Microservices
MSARS: A Meta-Learning and Reinforcement Learning Framework for SLO Resource Allocation and Adaptive Scaling for Microservices
Kan Hu
Linfeng Wen
Minxian Xu
Kejiang Ye
73
0
0
23 Sep 2024
RPAF: A Reinforcement Prediction-Allocation Framework for Cache
  Allocation in Large-Scale Recommender Systems
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su
Xiaoshuang Chen
Yao Wang
Yulin Wu
Ziqiang Zhang
Kaiqiao Zhan
Ben Wang
Kun Gai
AI4TS
80
1
0
20 Sep 2024
Morphology and Behavior Co-Optimization of Modular Satellites for
  Attitude Control
Morphology and Behavior Co-Optimization of Modular Satellites for Attitude Control
Yuxing Wang
Jie Li
Cong Yu
Xinyang Li
Simeng Huang
Yongzhe Chang
Xueqian Wang
Bin Liang
55
0
0
20 Sep 2024
Autonomous Driving at Unsignalized Intersections: A Review of
  Decision-Making Challenges and Reinforcement Learning-Based Solutions
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions
Mohammad K. Al-Sharman
Luc Edes
Bert Sun
Vishal Jayakumar
Mohamed A. Daoud
Derek Rayside
W. Melek
72
2
0
20 Sep 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
55
3
0
18 Sep 2024
Representing Positional Information in Generative World Models for
  Object Manipulation
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&RoOCL
66
0
0
18 Sep 2024
DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement
  Learning based Meta-Planner in Robot Navigation
DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation
Shiwei Feng
Xuan Chen
Zhiyuan Cheng
Zikang Xiong
Yifei Gao
Siyuan Cheng
Sayali Kate
Xiangyu Zhang
OffRL
69
0
0
17 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement
  Learning
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
75
2
0
13 Sep 2024
BetterBodies: Reinforcement Learning guided Diffusion for Antibody
  Sequence Design
BetterBodies: Reinforcement Learning guided Diffusion for Antibody Sequence Design
Yannick Vogt
Mehdi Naouar
M. Kalweit
Christoph Cornelius Miething
Justus Duyster
Joschka Boedecker
Gabriel Kalweit
DiffM
78
0
0
09 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
93
2
0
07 Sep 2024
Causality-Driven Reinforcement Learning for Joint Communication and
  Sensing
Causality-Driven Reinforcement Learning for Joint Communication and Sensing
Anik Roy
Serene Banerjee
Jishnu Sadasivan
Arnab Sarkar
Soumyajit Dey
25
0
0
07 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective
  Subgoal Guidance
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
78
0
0
06 Sep 2024
Simplex-enabled Safe Continual Learning Machine
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
78
3
0
05 Sep 2024
Sparsifying Parametric Models with L0 Regularization
Sparsifying Parametric Models with L0 Regularization
N. Botteghi
Urban Fasel
86
1
0
05 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and
  Self-Supervised Imitation Learning
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
123
0
0
04 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
134
0
0
02 Sep 2024
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
Pritthijit Nath
Henry Moss
Emily Shuckburgh
Mark Webb
AI4ClAI4CE
155
0
0
28 Aug 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
100
5
0
27 Aug 2024
Benchmarking Reinforcement Learning Methods for Dexterous Robotic
  Manipulation with a Three-Fingered Gripper
Benchmarking Reinforcement Learning Methods for Dexterous Robotic Manipulation with a Three-Fingered Gripper
Elizabeth Cutler
Yuning Xing
Tony Cui
Brendan Zhou
Koen van Rijnsoever
...
David Valencia
Lee Violet C. Ong
Trevor Gee
Minas V. Liarokapis
Henry Williams
OffRL
42
0
0
27 Aug 2024
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality
  with Exploration-Enhanced Contrastive Learning
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning
Wen-Han Hsieh
Jen-Yuan Chang
49
0
0
26 Aug 2024
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
Peng Zhu
Yuante Li
Yifan Hu
Qinyuan Liu
Dawei Cheng
Yuqi Liang
AIFinAI4TS
163
6
0
26 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
134
1
0
23 Aug 2024
Offline Model-Based Reinforcement Learning with Anti-Exploration
Offline Model-Based Reinforcement Learning with Anti-Exploration
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
106
0
0
20 Aug 2024
Hokoff: Real Game Dataset from Honor of Kings and its Offline
  Reinforcement Learning Benchmarks
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Yun Qu
Boyuan Wang
Jianzhun Shao
Yuhang Jiang
Chen Chen
...
Qiang Fu
Wei Yang
Guang Yang
Lanxiao Huang
Xiangyang Ji
OffRL
106
10
0
20 Aug 2024
Previous
123...567...424344
Next