ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones
Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones
Romain Poletti
Lorenzo Schena
Lilla Koloszar
Joris Degroote
M. A. Mendez
60
0
0
21 May 2025
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
Yingbo Luo
Meibao Yao
Xueming Xiao
75
0
0
21 May 2025
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Li Wang
Xin Yu
Xuxin Lv
Gangzheng Ai
Wenjun Wu
AAML
69
0
0
20 May 2025
SafeMove-RL: A Certifiable Reinforcement Learning Framework for Dynamic Motion Constraints in Trajectory Planning
SafeMove-RL: A Certifiable Reinforcement Learning Framework for Dynamic Motion Constraints in Trajectory Planning
Tengfei Liu
Haoyang Zhong
Jiazheng Hu
Tan Zhang
41
0
0
19 May 2025
Counterfactual Explanations for Continuous Action Reinforcement Learning
Counterfactual Explanations for Continuous Action Reinforcement Learning
Shuyang Dong
Shangtong Zhang
Lu Feng
OffRLLRM
91
0
0
19 May 2025
Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models
Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models
Lucas Berry
Axel Brando
Wei-Di Chang
Juan Camilo Gamboa Higuera
David Meger
DiffM
58
0
0
19 May 2025
DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking
DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking
Jirong Zha
Yuxuan Fan
Kai Li
Han Li
Chen Gao
Xinlei Chen
Yong Li
35
0
0
18 May 2025
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
Gang Li
Ming Lin
Tomer Galanti
Zhengzhong Tu
Tianbao Yang
93
1
0
18 May 2025
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Georgiy Malaniya
Anton Bolychev
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
118
0
0
18 May 2025
A universal policy wrapper with guarantees
A universal policy wrapper with guarantees
Anton Bolychev
Georgiy Malaniya
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
OffRL
90
0
0
18 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
61
0
0
17 May 2025
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic
Yufei Xiang
Yiqun Shen
Yeqin Zhang
Cam-Tu Nguyen
OffRLLLMAGKELMLRM
226
3
0
17 May 2025
Bi-Level Policy Optimization with Nyström Hypergradients
Bi-Level Policy Optimization with Nyström Hypergradients
Arjun Prakash
Naicheng He
Denizalp Goktas
Amy Greenwald
71
0
0
16 May 2025
ShiQ: Bringing back Bellman to LLMs
ShiQ: Bringing back Bellman to LLMs
Pierre Clavier
Nathan Grinsztajn
Raphaël Avalos
Yannis Flet-Berliac
Irem Ergun
...
Eugene Tarassov
Olivier Pietquin
Pierre Harvey Richemond
Florian Strub
Matthieu Geist
OffRL
64
0
0
16 May 2025
Exploration by Random Distribution Distillation
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
117
1
0
16 May 2025
Zero-Shot Visual Generalization in Robot Manipulation
Zero-Shot Visual Generalization in Robot Manipulation
Sumeet Batra
Gaurav Sukhatme
75
0
0
16 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRLLLMAGLM&RoLRM
65
0
0
15 May 2025
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
Hazim Alzorgan
Abolfazl Razi
58
0
0
13 May 2025
A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows
A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows
Selim Mecanna
Aurore Loisy
Christophe Eloy
88
0
0
08 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
57
0
0
07 May 2025
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Hao Peng
Xiang Huang
Shuo Sun
Ruitong Zhang
Philip S. Yu
80
0
0
07 May 2025
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Yue Chen
Hui Kang
Jiahui Li
Geng Sun
Boxiong Wang
Jiacheng Wang
Cong Liang
Shuang Liang
Dusit Niyato
230
0
0
06 May 2025
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNNAI4CE
81
0
0
05 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
109
0
0
05 May 2025
Constructing an Optimal Behavior Basis for the Option Keyboard
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
53
0
0
01 May 2025
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
Pulkit Agrawal
Rukma Talwadker
Aditya Pareek
Tridib Mukherjee
OffRL
91
0
0
30 Apr 2025
Model-based controller assisted domain randomization in deep reinforcement learning: application to nonlinear powertrain control
Model-based controller assisted domain randomization in deep reinforcement learning: application to nonlinear powertrain control
Heisei Yonezawa
Ansei Yonezawa
Itsuro Kajiwara
76
0
0
28 Apr 2025
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
Andreas Persson
Michael Felsberg
Amy Loutfi
OffRL
99
0
0
28 Apr 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
170
0
0
27 Apr 2025
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
86
0
0
26 Apr 2025
KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation
KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation
Jiabin Fan
Guoqing Luo
Michael Bowling
Lili Mou
OffRL
140
0
0
26 Apr 2025
BiasBench: A reproducible benchmark for tuning the biases of event cameras
BiasBench: A reproducible benchmark for tuning the biases of event cameras
Andreas Ziegler
David Joseph
Thomas Gossard
Emil Moldovan
A. Zell
63
0
0
25 Apr 2025
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Mingqi Yuan
Qi Wang
Guozheng Ma
Yue Liu
Xin Jin
Yunbo Wang
Xiaokang Yang
Wenjun Zeng
D. Tao
OffRLAI4CE
106
0
0
24 Apr 2025
Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion
Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion
Julian Bedei
Murray McBain
Alexander Winkler
C. R. Koch
Jakob Andert
David C. Gordon
AI4CE
50
0
0
23 Apr 2025
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
Rohit Dhakate
Thomas Jantos
Eren Allak
Stephan Weiss
J. Steinbrener
78
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
143
0
0
22 Apr 2025
Never too Cocky to Cooperate: An FIM and RL-based USV-AUV Collaborative System for Underwater Tasks in Extreme Sea Conditions
Never too Cocky to Cooperate: An FIM and RL-based USV-AUV Collaborative System for Underwater Tasks in Extreme Sea Conditions
Jingzehua Xu
Guanwen Xie
Jiwei Tang
Yimian Ding
Weiyi Liu
Shuai Zhang
Yongqian Li
47
0
0
21 Apr 2025
Learning to Reason under Off-Policy Guidance
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRLLRM
165
17
0
21 Apr 2025
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Zichun Xu
Zhaomin Wang
Yuntao Li
Lei Zhuang
Zhiyuan Zhao
Guocai Yang
Jingdong Zhao
57
0
0
21 Apr 2025
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Ziqi Zhao
Zhaochun Ren
Jiyuan Yang
Zuming Yan
Zihan Wang
Liu Yang
Pengjie Ren
Zhumin Chen
Maarten de Rijke
Xin Xin
CML
115
1
0
18 Apr 2025
Coordinating Spinal and Limb Dynamics for Enhanced Sprawling Robot Mobility
Coordinating Spinal and Limb Dynamics for Enhanced Sprawling Robot Mobility
Merve Atasever
Ali Okhovat
Azhang Nazaripouya
John Nisbet
Omer Kurkutlu
Jyotirmoy V. Deshmukh
Yasemin Ozkan Aydin
24
0
0
18 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
118
1
0
17 Apr 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
127
0
0
16 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
94
1
0
16 Apr 2025
A Clean Slate for Offline Reinforcement Learning
A Clean Slate for Offline Reinforcement Learning
Matthew Jackson
Uljad Berdica
Jarek Liesen
Shimon Whiteson
Jakob Foerster
OffRLOnRL
91
1
0
15 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
76
0
0
14 Apr 2025
HEAT:History-Enhanced Dual-phase Actor-Critic Algorithm with A Shared Transformer
HEAT:History-Enhanced Dual-phase Actor-Critic Algorithm with A Shared Transformer
Hong Yang
OffRL
62
0
0
13 Apr 2025
State Estimation Using Particle Filtering in Adaptive Machine Learning Methods: Integrating Q-Learning and NEAT Algorithms with Noisy Radar Measurements
State Estimation Using Particle Filtering in Adaptive Machine Learning Methods: Integrating Q-Learning and NEAT Algorithms with Noisy Radar Measurements
Wonjin Song
Feng Bao
64
0
0
10 Apr 2025
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
Songwei Zhao
Yuan Jiang
Zijing Zhang
Yang Yu
Hechang Chen
75
0
0
09 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
101
0
0
09 Apr 2025
Previous
12345...424344
Next