ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Multi-fidelity Reinforcement Learning Control for Complex Dynamical Systems
Multi-fidelity Reinforcement Learning Control for Complex Dynamical Systems
Luning Sun
Xin-Yang Liu
Siyan Zhao
Aditya Grover
Jian-Xun Wang
Jayaraman J. Thiagarajan
AI4CE
108
0
0
08 Apr 2025
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
69
1
0
08 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
131
0
0
08 Apr 2025
Stratified Expert Cloning with Adaptive Selection for User Retention in Large-Scale Recommender Systems
Stratified Expert Cloning with Adaptive Selection for User Retention in Large-Scale Recommender Systems
Chengzhi Lin
Annan Xie
Shuchang Liu
Wuhong Wang
Chuyuan Wang
Yongqi Liu
OffRL
59
0
0
08 Apr 2025
AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
Zhuoli Zhuang
Cheng-You Lu
Yu-Cheng Chang
Yu-Kai Wang
T. Do
Chin-Teng Lin
107
0
0
08 Apr 2025
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
79
0
0
07 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
78
0
0
07 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
219
0
0
05 Apr 2025
Exploration-Driven Generative Interactive Environments
Exploration-Driven Generative Interactive Environments
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen3DV
107
1
0
03 Apr 2025
Overcoming Deceptiveness in Fitness Optimization with Unsupervised Quality-Diversity
Overcoming Deceptiveness in Fitness Optimization with Unsupervised Quality-Diversity
Lisa Coiffard
Paul Templier
Antoine Cully
OffRL
123
0
0
02 Apr 2025
Immersive Explainability: Visualizing Robot Navigation Decisions through XAI Semantic Scene Projections in Virtual Reality
Immersive Explainability: Visualizing Robot Navigation Decisions through XAI Semantic Scene Projections in Virtual Reality
Jorge de Heuvel
Sebastian Müller
Marlene Wessels
Aftab Akhtar
Christian Bauckhage
Maren Bennewitz
73
0
0
01 Apr 2025
MPCritic: A plug-and-play MPC architecture for reinforcement learning
MPCritic: A plug-and-play MPC architecture for reinforcement learning
Nathan P. Lawrence
Thomas Banker
Ali Mesbah
101
0
0
01 Apr 2025
A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi
A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi
Chenyu Zhang
Shiying Sun
Kuan Liu
Chuanbao Zhou
Xiaoguang Zhao
M. Tan
Yuanmin Huang
91
0
0
31 Mar 2025
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
Enrico Marchesini
Benjamin Donnot
Constance Crozier
Ian Dytham
Christian Merz
Lars Schewe
Nico Westerbeck
Cathy Wu
Antoine Marot
P. Donti
OffRL
84
1
0
29 Mar 2025
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
Songyi Gao
Zuolin Tu
Rong-Jun Qin
Yi-Hao Sun
Xiong-Hui Chen
Yang Yu
OffRL
79
0
0
25 Mar 2025
Reinforcement Learning for Adaptive Planner Parameter Tuning: A Perspective on Hierarchical Architecture
Reinforcement Learning for Adaptive Planner Parameter Tuning: A Perspective on Hierarchical Architecture
Lu Wangtao
Wei Yufei
Xu Jiadong
Jia Wenhao
Li Liang
Xiong Rong
Wang Yue
82
0
0
24 Mar 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Daniel Mayfrank
M. Velioglu
Alexander Mitsos
Manuel Dahmen
OffRL
89
0
0
24 Mar 2025
Evolutionary Policy Optimization
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
82
0
0
24 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
86
1
0
23 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
74
0
0
20 Mar 2025
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
Shijie Fang
Wenchang Gao
Shivam Goel
Christopher Thierauf
matthias. scheutz
Jivko Sinapov
95
0
0
17 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRLOnRL
110
0
0
15 Mar 2025
Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
Peter Böhm
Pauline Pounds
Archie C. Chapman
70
0
0
14 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCVBDL
507
2
0
14 Mar 2025
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
Peihong Yu
Amisha Bhaskar
Anukriti Singh
Zahiruddin Mahammad
Pratap Tokekar
96
2
0
14 Mar 2025
Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning
Peter Böhm
Archie C. Chapman
Pauline Pounds
166
0
0
14 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TSAIFin
166
1
0
12 Mar 2025
The Impact of VR and 2D Interfaces on Human Feedback in Preference-Based Robot Learning
The Impact of VR and 2D Interfaces on Human Feedback in Preference-Based Robot Learning
Jorge de Heuvel
Daniel Marta
Simon Holk
Iolanda Leite
Maren Bennewitz
101
1
0
11 Mar 2025
Goal Conditioned Reinforcement Learning for Photo Finishing Tuning
Jiarui Wu
Yujin Wang
Lingen Li
Zhang Fan
Tianfan Xue
91
0
0
10 Mar 2025
PER-DPP Sampling Framework and Its Application in Path Planning
Junzhe Wang
65
0
0
10 Mar 2025
Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design
Yasir Zubayr Barlas
Kizito Salako
59
1
0
07 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang
Min-hwan Oh
OffRL
119
0
0
07 Mar 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
116
0
0
07 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
89
1
0
06 Mar 2025
Supervised Visual Docking Network for Unmanned Surface Vehicles Using Auto-labeling in Real-world Water Environments
Yijie Chu
Ziniu Wu
Yong Yue
Eng Gee Lim
Paolo Paoletti
Xiaohui Zhu
59
0
0
05 Mar 2025
Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Qiyang Yan
Zihan Ding
Xin Zhou
Adam J. Spiers
75
1
0
04 Mar 2025
Enhancing Deep Reinforcement Learning-based Robot Navigation Generalization through Scenario Augmentation
Shanze Wang
Mingao Tan
Zhiyong Yang
Xinyu Wang
Xiaoyu Shen
Hailong Huang
Wei Zhang
99
0
0
03 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
65
0
0
02 Mar 2025
Discrete Codebook World Models for Continuous Control
Aidan Scannell
Mohammadreza Nakhaei
Kalle Kujanpää
Yi Zhao
Kevin Sebastian Luck
Dieter Büchler
Joni Pajarinen
OffRL
91
2
0
01 Mar 2025
Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions
Guanwen Xie
Jingzehua Xu
Yimian Ding
Zhi Zhang
Shuai Zhang
Yongqian Li
76
0
0
01 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
148
0
0
28 Feb 2025
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling
Maxwell Hogan
Abdulla Tammam
Nabil Aouf
104
1
0
27 Feb 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
188
1
0
27 Feb 2025
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Beomyeol Yu
Taeyoung Lee
133
0
0
27 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
122
0
0
24 Feb 2025
A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding
Hamidreza Raei
Elena De Momi
Arash Ajoudani
146
0
0
24 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRLOnRL
490
3
0
24 Feb 2025
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
Zifeng Zhuang
Diyuan Shi
Runze Suo
Xiao He
Hongyin Zhang
Ting Wang
Shangke Lyu
Donglin Wang
84
1
0
24 Feb 2025
CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models
CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models
Zihao Sheng
Zilin Huang
Yansong Qu
Yue Leng
Sruthi Bhavanam
Sikai Chen
111
4
0
24 Feb 2025
Estimating Control Barriers from Offline Data
Hongzhan Yu
Seth Farrell
Ryo Yoshimitsu
Zhizhen Qin
Henrik I. Christensen
Sicun Gao
OffRL
86
3
0
21 Feb 2025
Previous
123456...424344
Next