ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and
  Stable Online Fine-Tuning
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Alex Beeson
Giovanni Montana
OffRLOnRL
60
24
0
21 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement
  Learning
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
84
14
0
21 Nov 2022
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement
  Learning
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning
Lang Qin
Rui Yan
Huajin Tang
OffRL
58
6
0
21 Nov 2022
Let Offline RL Flow: Training Conservative Agents in the Latent Space of
  Normalizing Flows
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D. Akimov
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
86
11
0
20 Nov 2022
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation
  with Deep Reinforcement Learning
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement Learning
Mauro Martini
Andrea Eirale
Simone Cerrato
Marcello Chiaberge
60
11
0
19 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
78
1
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
91
216
0
14 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards
  global optimality
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
78
16
0
12 Nov 2022
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
Yunpeng Qing
Shunyu Liu
Mingli Song
Huiqiong Wang
Mingli Song
XAI
85
1
0
12 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
65
15
0
10 Nov 2022
Detecting and Accommodating Novel Types and Concepts in an Embodied
  Simulation Environment
Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment
Sadaf Ghaffari
Nikhil Krishnaswamy
38
7
0
08 Nov 2022
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness
  to Model Misspecification
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
73
11
0
07 Nov 2022
On learning history based policies for controlling Markov decision
  processes
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
94
5
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making
  in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
68
8
0
06 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert Platt
OffRL
97
20
0
03 Nov 2022
Causal Counterfactuals for Improving the Robustness of Reinforcement
  Learning
Causal Counterfactuals for Improving the Robustness of Reinforcement Learning
Tom He
Jasmina Gajcin
Ivana Dusparic
CML
71
5
0
02 Nov 2022
Offline RL With Realistic Datasets: Heteroskedasticity and Support
  Constraints
Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints
Anika Singh
Aviral Kumar
Q. Vuong
Yevgen Chebotar
Sergey Levine
OffRL
62
14
0
02 Nov 2022
Spatial-temporal recurrent reinforcement learning for autonomous ships
Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz
Ostap Okhrin
96
9
0
02 Nov 2022
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE
  Network
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network
Yao Feng
Yuhong Jiang
Hang Su
Dong Yan
Jun Zhu
90
1
0
02 Nov 2022
Reinforcement Learning for Solving Robotic Reaching Tasks in the
  Neurorobotics Platform
Reinforcement Learning for Solving Robotic Reaching Tasks in the Neurorobotics Platform
Márton Szep
Leander Lauenburg
Kevin Farkas
Xiyan Su
Chuanlong Zang
59
0
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight
  Grouping for Multi-Agent Reinforcement Learning
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
54
2
0
29 Oct 2022
Meta-Reinforcement Learning Using Model Parameters
Meta-Reinforcement Learning Using Model Parameters
G. Hartmann
A. Azaria
73
0
0
27 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
84
26
0
27 Oct 2022
Reachability Verification Based Reliability Assessment for Deep
  Reinforcement Learning Controlled Robotics and Autonomous Systems
Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems
Yizhen Dong
Xingyu Zhao
Sen Wang
Xiaowei Huang
106
8
0
26 Oct 2022
Low-Rank Modular Reinforcement Learning via Muscle Synergy
Low-Rank Modular Reinforcement Learning via Muscle Synergy
Heng Dong
Tonghan Wang
Jiayuan Liu
Chongjie Zhang
130
18
0
26 Oct 2022
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared
  State Representation and Individual Policy Representation
ERL-Re2^22: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
89
26
0
26 Oct 2022
A Bibliometric Analysis and Review on Reinforcement Learning for
  Transportation Applications
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
81
15
0
26 Oct 2022
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving
  Without Real Data
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data
John So
Amber Xie
Sunggoo Jung
J. Edlund
Rohan Thakker
Ali Agha-mohammadi
Pieter Abbeel
Stephen James
92
9
0
25 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online
  Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRLOnRL
84
40
0
25 Oct 2022
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain
  Domains
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains
Manon Flageat
Félix Chalumeau
Antoine Cully
82
26
0
24 Oct 2022
MetaEMS: A Meta Reinforcement Learning-based Control Framework for
  Building Energy Management System
MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System
Huiliang Zhang
Di Wu
Benoit Boulet
78
6
0
23 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
63
2
0
21 Oct 2022
STAP: Sequencing Task-Agnostic Policies
STAP: Sequencing Task-Agnostic Policies
Christopher Agia
Toki Migimatsu
Jiajun Wu
Jeannette Bohg
111
20
0
21 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
68
9
0
21 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic
  Manipulator Control
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
89
5
0
20 Oct 2022
Task Phasing: Automated Curriculum Learning from Demonstrations
Task Phasing: Automated Curriculum Learning from Demonstrations
Vaibhav Bajaj
Guni Sharon
Peter Stone
66
8
0
20 Oct 2022
Integrated Decision and Control for High-Level Automated Vehicles by
  Mixed Policy Gradient and Its Experiment Verification
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
40
0
0
19 Oct 2022
Robust Offline Reinforcement Learning with Gradient Penalty and
  Constraint Relaxation
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
95
2
0
19 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
70
2
0
18 Oct 2022
Deep Black-Box Reinforcement Learning with Movement Primitives
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto
Onur Celik
Hongyi Zhou
Hanna Ziesche
Ngo Anh Vien
Gerhard Neumann
OffRL
71
19
0
18 Oct 2022
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
89
21
0
18 Oct 2022
The Impact of Task Underspecification in Evaluating Deep Reinforcement
  Learning
The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning
Vindula Jayawardana
Catherine Tang
Sirui Li
Da Suo
Cathy Wu
OffRL
108
13
0
16 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement
  Learning
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
76
19
0
15 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
153
64
0
15 Oct 2022
A Scalable Reinforcement Learning Approach for Attack Allocation in
  Swarm to Swarm Engagement Problems
A Scalable Reinforcement Learning Approach for Attack Allocation in Swarm to Swarm Engagement Problems
Umut Demir
N. K. Üre
48
1
0
15 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic
  Reinforcement Learning at Scale
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale
Kuang-Huei Lee
Ted Xiao
A. Li
Paul Wohlhart
Ian S. Fischer
Yao Lu
120
10
0
15 Oct 2022
DyFEn: Agent-Based Fee Setting in Payment Channel Networks
DyFEn: Agent-Based Fee Setting in Payment Channel Networks
Kian Asgari
Aida Mohammadian
M. Tefagh
18
7
0
15 Oct 2022
CUP: Critic-Guided Policy Reuse
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
86
8
0
15 Oct 2022
Geometric Reinforcement Learning For Robotic Manipulation
Geometric Reinforcement Learning For Robotic Manipulation
Naseem Alhousani
Matteo Saveriano
Ibrahim Sevinc
Talha Abdulkuddus
Hatice Kose
Fares J. Abu-Dakka
70
6
0
14 Oct 2022
Previous
123...212223...424344
Next