ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
100
50
0
16 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
83
14
0
16 May 2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning
  Research
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
Jiaming Ji
Jiayi Zhou
Borong Zhang
Juntao Dai
Xuehai Pan
Ruiyang Sun
Weidong Huang
Yiran Geng
Mickel Liu
Yaodong Yang
OffRL
146
52
0
16 May 2023
A Deep RL Approach on Task Placement and Scaling of Edge Resources for Cellular Vehicle-to-Network Service Provisioning
A Deep RL Approach on Task Placement and Scaling of Edge Resources for Cellular Vehicle-to-Network Service Provisioning
Cyril Shih-Huan Hsu
Jorge Martín-Pérez
D. D. Vleeschauwer
K. Kondepu
Xi Li
Xi Li
61
0
0
16 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
46
2
0
15 May 2023
On Practical Robust Reinforcement Learning: Practical Uncertainty Set
  and Double-Agent Algorithm
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm
Ukjo Hwang
Songnam Hong
47
1
0
11 May 2023
GFlowNets with Human Feedback
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
68
5
0
11 May 2023
Deep Reinforcement Learning Based Resource Allocation for Cloud Native
  Wireless Network
Deep Reinforcement Learning Based Resource Allocation for Cloud Native Wireless Network
L. Wang
Jiasheng Wu
Yueyuan Gao
Jingjing Zhang
27
3
0
10 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
68
5
0
09 May 2023
Policy Gradient Methods in the Presence of Symmetries and State
  Abstractions
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
74
4
0
09 May 2023
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary
  Multi-Agent Adversary Generation
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation
Lei Yuan
F. Chen
Zhongzhan Zhang
Yang Yu
AAML
100
10
0
09 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent
  Reinforcement Learning
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
77
3
0
08 May 2023
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing
  RL Safety
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety
André Rosa de Sousa Porfírio Correia
L. A. Alexandre
OffRL
15
0
0
08 May 2023
A Minimal Approach for Natural Language Action Space in Text-based Games
A Minimal Approach for Natural Language Action Space in Text-based Games
Dongwon Kelvin Ryu
Meng Fang
Shirui Pan
Gholamreza Haffari
Ehsan Shareghi
LLMAG
75
2
0
06 May 2023
HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile
  Manipulation
HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation
Wen-Min Zhou
Bowen Jiang
Fan Yang
Chris Paxton
David Held
128
33
0
06 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
93
1
0
04 May 2023
Masked Trajectory Models for Prediction, Representation, and Control
Masked Trajectory Models for Prediction, Representation, and Control
Philipp Wu
Arjun Majumdar
Kevin Stone
Yixin Lin
Igor Mordatch
Pieter Abbeel
Aravind Rajeswaran
OffRL
67
39
0
04 May 2023
Simple Noisy Environment Augmentation for Reinforcement Learning
Simple Noisy Environment Augmentation for Reinforcement Learning
Raad Khraishi
Ramin Okhrati
OffRL
45
1
0
04 May 2023
CCIL: Context-conditioned imitation learning for urban driving
CCIL: Context-conditioned imitation learning for urban driving
Ke Guo
Wei Jing
Junbo Chen
Jia Pan
101
10
0
04 May 2023
An Imitation Learning Based Algorithm Enabling Priori Knowledge Transfer in Modern Electricity Markets for Bayesian Nash Equilibrium Estimation
Ziqing Zhu
K. Chan
S. Bu
Ze Hu
S. Xia
50
2
0
04 May 2023
Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic
  Forgetting in Reinforcement Learning
Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning
Muhammad Burhan Hafez
Tilman Immisch
Tom Weber
S. Wermter
CLL
97
6
0
03 May 2023
More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability
  in Legged Locomotion
More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability in Legged Locomotion
Huang Huang
Antonio Loquercio
Ashish Kumar
Neerja Thakkar
Ken Goldberg
Jitendra Malik
57
2
0
02 May 2023
Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study
  on Hybrid Electric Vehicle Energy Management
Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Electric Vehicle Energy Management
Jinming Xu
N. L. Azad
Yuan Lin
58
0
0
02 May 2023
A Coupled Flow Approach to Imitation Learning
A Coupled Flow Approach to Imitation Learning
G. Freund
Elad Sarafian
Sarit Kraus
OOD
73
13
0
29 Apr 2023
Can Agents Run Relay Race with Strangers? Generalization of RL to
  Out-of-Distribution Trajectories
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories
Li-Cheng Lan
Huan Zhang
Cho-Jui Hsieh
OODD
73
10
0
26 Apr 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling
  in Offline Reinforcement Learning
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffMOffRL
143
75
0
25 Apr 2023
A Multi-Task Approach to Robust Deep Reinforcement Learning for Resource
  Allocation
A Multi-Task Approach to Robust Deep Reinforcement Learning for Resource Allocation
Steffen Gracla
C. Bockelmann
Armin Dekorsy
52
3
0
25 Apr 2023
Parallel bootstrap-based on-policy deep reinforcement learning for
  continuous flow control applications
Parallel bootstrap-based on-policy deep reinforcement learning for continuous flow control applications
J. Viquerat
E. Hachem
56
3
0
24 Apr 2023
How to Control Hydrodynamic Force on Fluidic Pinball via Deep
  Reinforcement Learning
How to Control Hydrodynamic Force on Fluidic Pinball via Deep Reinforcement Learning
Haodong Feng
Yue Wang
Hui Xiang
Zhiyang Jin
Dixia Fan
AI4CE
68
10
0
23 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
45
2
0
21 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
118
36
0
20 Apr 2023
Aiding reinforcement learning for set point control
Aiding reinforcement learning for set point control
Ruoqing Zhang
Per Mattsson
T. Wigren
120
3
0
20 Apr 2023
Filter-Aware Model-Predictive Control
Filter-Aware Model-Predictive Control
Baris Kayalibay
Atanas Mirchev
Ahmed Agha
Patrick van der Smagt
Justin Bayer
98
0
0
20 Apr 2023
Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Steffen Gracla
Edgar Beck
C. Bockelmann
Armin Dekorsy
OOD
62
3
0
20 Apr 2023
CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning
  in Robot-Assisted Intervention
CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning in Robot-Assisted Intervention
Hao Li
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Z. Hou
OffRL
55
11
0
19 Apr 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
82
47
0
19 Apr 2023
Long-Term Fairness with Unknown Dynamics
Long-Term Fairness with Unknown Dynamics
Tongxin Yin
Reilly P. Raab
M. Liu
Yang Liu
FaML
94
28
0
19 Apr 2023
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for
  Robotics Control with Action Constraints
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Kazumi Kasaura
Shuwa Miura
Tadashi Kozuno
Ryo Yonetani
Kenta Hoshino
Y. Hosoe
75
14
0
18 Apr 2023
Integration of Reinforcement Learning Based Behavior Planning With
  Sampling Based Motion Planning for Automated Driving
Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
Marvin Klimke
Benjamin Völz
M. Buchholz
48
6
0
17 Apr 2023
Causal Decision Transformer for Recommender Systems via Offline
  Reinforcement Learning
Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang
Xiaocong Chen
Dietmar Jannach
Lina Yao
CMLOffRL
114
30
0
17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
102
6
0
14 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
25
0
0
14 Apr 2023
A Platform-Agnostic Deep Reinforcement Learning Framework for Effective
  Sim2Real Transfer in Autonomous Driving
A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer in Autonomous Driving
Dian-Tao Li
Ostap Okhrin
104
3
0
14 Apr 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
58
0
0
12 Apr 2023
Reinforcement Learning-Based Black-Box Model Inversion Attacks
Reinforcement Learning-Based Black-Box Model Inversion Attacks
Gyojin Han
Jaehyun Choi
Haeil Lee
Junmo Kim
MIACV
65
37
0
10 Apr 2023
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning
Kevin Zakka
Philipp Wu
Laura M. Smith
Nimrod Gileadi
Taylor A. Howell
...
Sumeet Singh
Yuval Tassa
Pete Florence
Andy Zeng
Pieter Abbeel
109
32
0
09 Apr 2023
Deep Reinforcement Learning-Based Mapless Crowd Navigation with
  Perceived Risk of the Moving Crowd for Mobile Robots
Deep Reinforcement Learning-Based Mapless Crowd Navigation with Perceived Risk of the Moving Crowd for Mobile Robots
Hafiq Anas
Ong Wee Hong
O. A. Malik
16
3
0
07 Apr 2023
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary
  3D Environment
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment
Xuyang Li
Jianwu Fang
Kai Du
K. Mei
Jianru Xue
68
6
0
07 Apr 2023
A modular framework for stabilizing deep reinforcement learning control
A modular framework for stabilizing deep reinforcement learning control
Nathan P. Lawrence
Philip D. Loewen
Shuyuan Wang
M. Forbes
R. Bhushan Gopaluni
71
1
0
07 Apr 2023
Optimal Energy Storage Scheduling for Wind Curtailment Reduction and
  Energy Arbitrage: A Deep Reinforcement Learning Approach
Optimal Energy Storage Scheduling for Wind Curtailment Reduction and Energy Arbitrage: A Deep Reinforcement Learning Approach
Jinhao Li
Changlong Wang
Hao Wang
27
3
0
05 Apr 2023
Previous
123...171819...424344
Next