Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Multi-fidelity Reinforcement Learning Control for Complex Dynamical Systems
Luning Sun
Xin-Yang Liu
Siyan Zhao
Aditya Grover
Jian-Xun Wang
Jayaraman J. Thiagarajan
AI4CE
108
0
0
08 Apr 2025
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
69
1
0
08 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
131
0
0
08 Apr 2025
Stratified Expert Cloning with Adaptive Selection for User Retention in Large-Scale Recommender Systems
Chengzhi Lin
Annan Xie
Shuchang Liu
Wuhong Wang
Chuyuan Wang
Yongqi Liu
OffRL
59
0
0
08 Apr 2025
AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
Zhuoli Zhuang
Cheng-You Lu
Yu-Cheng Chang
Yu-Kai Wang
T. Do
Chin-Teng Lin
107
0
0
08 Apr 2025
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
79
0
0
07 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
78
0
0
07 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
219
0
0
05 Apr 2025
Exploration-Driven Generative Interactive Environments
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen
3DV
107
1
0
03 Apr 2025
Overcoming Deceptiveness in Fitness Optimization with Unsupervised Quality-Diversity
Lisa Coiffard
Paul Templier
Antoine Cully
OffRL
123
0
0
02 Apr 2025
Immersive Explainability: Visualizing Robot Navigation Decisions through XAI Semantic Scene Projections in Virtual Reality
Jorge de Heuvel
Sebastian Müller
Marlene Wessels
Aftab Akhtar
Christian Bauckhage
Maren Bennewitz
73
0
0
01 Apr 2025
MPCritic: A plug-and-play MPC architecture for reinforcement learning
Nathan P. Lawrence
Thomas Banker
Ali Mesbah
101
0
0
01 Apr 2025
A Reactive Framework for Whole-Body Motion Planning of Mobile Manipulators Combining Reinforcement Learning and SDF-Constrained Quadratic Programmi
Chenyu Zhang
Shiying Sun
Kuan Liu
Chuanbao Zhou
Xiaoguang Zhao
M. Tan
Yuanmin Huang
91
0
0
31 Mar 2025
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
Enrico Marchesini
Benjamin Donnot
Constance Crozier
Ian Dytham
Christian Merz
Lars Schewe
Nico Westerbeck
Cathy Wu
Antoine Marot
P. Donti
OffRL
84
1
0
29 Mar 2025
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
Songyi Gao
Zuolin Tu
Rong-Jun Qin
Yi-Hao Sun
Xiong-Hui Chen
Yang Yu
OffRL
79
0
0
25 Mar 2025
Reinforcement Learning for Adaptive Planner Parameter Tuning: A Perspective on Hierarchical Architecture
Lu Wangtao
Wei Yufei
Xu Jiadong
Jia Wenhao
Li Liang
Xiong Rong
Wang Yue
82
0
0
24 Mar 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Daniel Mayfrank
M. Velioglu
Alexander Mitsos
Manuel Dahmen
OffRL
89
0
0
24 Mar 2025
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
82
0
0
24 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
86
1
0
23 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
74
0
0
20 Mar 2025
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation
Shijie Fang
Wenchang Gao
Shivam Goel
Christopher Thierauf
matthias. scheutz
Jivko Sinapov
95
0
0
17 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRL
OnRL
110
0
0
15 Mar 2025
Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
Peter Böhm
Pauline Pounds
Archie C. Chapman
70
0
0
14 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
507
2
0
14 Mar 2025
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
Peihong Yu
Amisha Bhaskar
Anukriti Singh
Zahiruddin Mahammad
Pratap Tokekar
96
2
0
14 Mar 2025
Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning
Peter Böhm
Archie C. Chapman
Pauline Pounds
166
0
0
14 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
166
1
0
12 Mar 2025
The Impact of VR and 2D Interfaces on Human Feedback in Preference-Based Robot Learning
Jorge de Heuvel
Daniel Marta
Simon Holk
Iolanda Leite
Maren Bennewitz
101
1
0
11 Mar 2025
Goal Conditioned Reinforcement Learning for Photo Finishing Tuning
Jiarui Wu
Yujin Wang
Lingen Li
Zhang Fan
Tianfan Xue
91
0
0
10 Mar 2025
PER-DPP Sampling Framework and Its Application in Path Planning
Junzhe Wang
65
0
0
10 Mar 2025
Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design
Yasir Zubayr Barlas
Kizito Salako
59
1
0
07 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang
Min-hwan Oh
OffRL
119
0
0
07 Mar 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
116
0
0
07 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
89
1
0
06 Mar 2025
Supervised Visual Docking Network for Unmanned Surface Vehicles Using Auto-labeling in Real-world Water Environments
Yijie Chu
Ziniu Wu
Yong Yue
Eng Gee Lim
Paolo Paoletti
Xiaohui Zhu
59
0
0
05 Mar 2025
Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
Qiyang Yan
Zihan Ding
Xin Zhou
Adam J. Spiers
75
1
0
04 Mar 2025
Enhancing Deep Reinforcement Learning-based Robot Navigation Generalization through Scenario Augmentation
Shanze Wang
Mingao Tan
Zhiyong Yang
Xinyu Wang
Xiaoyu Shen
Hailong Huang
Wei Zhang
99
0
0
03 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
65
0
0
02 Mar 2025
Discrete Codebook World Models for Continuous Control
Aidan Scannell
Mohammadreza Nakhaei
Kalle Kujanpää
Yi Zhao
Kevin Sebastian Luck
Dieter Büchler
Joni Pajarinen
OffRL
91
2
0
01 Mar 2025
Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions
Guanwen Xie
Jingzehua Xu
Yimian Ding
Zhi Zhang
Shuai Zhang
Yongqian Li
76
0
0
01 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
148
0
0
28 Feb 2025
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling
Maxwell Hogan
Abdulla Tammam
Nabil Aouf
104
1
0
27 Feb 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
188
1
0
27 Feb 2025
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Beomyeol Yu
Taeyoung Lee
133
0
0
27 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
122
0
0
24 Feb 2025
A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding
Hamidreza Raei
Elena De Momi
Arash Ajoudani
146
0
0
24 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
490
3
0
24 Feb 2025
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
Zifeng Zhuang
Diyuan Shi
Runze Suo
Xiao He
Hongyin Zhang
Ting Wang
Shangke Lyu
Donglin Wang
84
1
0
24 Feb 2025
CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models
Zihao Sheng
Zilin Huang
Yansong Qu
Yue Leng
Sruthi Bhavanam
Sikai Chen
111
4
0
24 Feb 2025
Estimating Control Barriers from Offline Data
Hongzhan Yu
Seth Farrell
Ryo Yoshimitsu
Zhizhen Qin
Henrik I. Christensen
Sicun Gao
OffRL
86
3
0
21 Feb 2025
Previous
1
2
3
4
5
6
...
42
43
44
Next