ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXivPDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 849 papers shown
Title
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
Aiding reinforcement learning for set point control
Aiding reinforcement learning for set point control
Ruoqing Zhang
Per Mattsson
T. Wigren
21
3
0
20 Apr 2023
Filter-Aware Model-Predictive Control
Filter-Aware Model-Predictive Control
Baris Kayalibay
Atanas Mirchev
Ahmed Agha
Patrick van der Smagt
Justin Bayer
44
0
0
20 Apr 2023
Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Steffen Gracla
Edgar Beck
C. Bockelmann
Armin Dekorsy
OOD
38
3
0
20 Apr 2023
Integration of Reinforcement Learning Based Behavior Planning With
  Sampling Based Motion Planning for Automated Driving
Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
Marvin Klimke
Benjamin Völz
M. Buchholz
26
5
0
17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
13
0
0
14 Apr 2023
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary
  3D Environment
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment
Xuyang Li
Jianwu Fang
Kai Du
K. Mei
Jianru Xue
34
6
0
07 Apr 2023
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Maximilien Le Clei
Pierre C. Bellec
21
0
0
03 Apr 2023
Adaptive formation motion planning and control of autonomous underwater
  vehicles using deep reinforcement learning
Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning
Behnaz Hadi
A. Khosravi
Pouria Sarhadi
20
18
0
01 Apr 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
28
16
0
30 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value
  Regularization
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
73
0
28 Mar 2023
The Quality-Diversity Transformer: Generating Behavior-Conditioned
  Trajectories with Decision Transformers
The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers
Valentin Macé
Raphael Boige
Félix Chalumeau
Thomas Pierrot
Guillaume Richard
Nicolas Perrin-Gilbert
OffRL
38
12
0
27 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic
  Environments
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
27
14
0
24 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor
  Feature-Based Concurrent Composition
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
29
4
0
24 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic
  Local Planner and Polar State Representations
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
20
4
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy
  Constraint Q Learning for Propofol Infusion Control
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
38
5
0
17 Mar 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations
  and Interpretable Policy Dynamics
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
AI4TS
AI4MH
43
10
0
16 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
18
21
0
14 Mar 2023
Evolving Populations of Diverse RL Agents with MAP-Elites
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
35
8
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
53
3
0
08 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
28
1
0
02 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
A Variational Approach to Mutual Information-Based Coordination for
  Multi-Agent Reinforcement Learning
A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Young-Jin Sung
32
7
0
01 Mar 2023
Human-Inspired Framework to Accelerate Reinforcement Learning
Human-Inspired Framework to Accelerate Reinforcement Learning
Ali Beikmohammadi
Sindri Magnússon
OffRL
29
4
0
28 Feb 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
29
26
0
28 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning
  and Forward Simulation with Positioning Error Below End-Effector Physical
  Minimum Displacement
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
21
2
0
26 Feb 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function
  Approximation
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
Neural Laplace Control for Continuous-time Delayed Systems
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
34
10
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task
  Planning for an Underactuated Cooperative Robotic task
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
21
0
0
22 Feb 2023
Behavior Proximal Policy Optimization
Behavior Proximal Policy Optimization
Zifeng Zhuang
Kun Lei
Jinxin Liu
Donglin Wang
Yilang Guo
OffRL
30
34
0
22 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement
  Learning
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
26
9
0
20 Feb 2023
Price of Anarchy in a Double-Sided Critical Distribution System
Price of Anarchy in a Double-Sided Critical Distribution System
David Sychrovský
Jakub Cerny
Sylvain Lichau
M. Loebl
23
1
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration
  for Task Automation of Surgical Robot
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
37
23
0
20 Feb 2023
Stochastic Generative Flow Networks
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
49
31
0
19 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
36
1
0
17 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
41
9
0
14 Feb 2023
Time-attenuating Twin Delayed DDPG Reinforcement Learning for Trajectory
  Tracking Control of Quadrotors
Time-attenuating Twin Delayed DDPG Reinforcement Learning for Trajectory Tracking Control of Quadrotors
Boyuan Deng
Jian Sun
Zhuo Li
G. Wang
35
0
0
13 Feb 2023
Learning Interaction-aware Motion Prediction Model for Decision-making
  in Autonomous Driving
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
33
17
0
08 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
42
163
0
06 Feb 2023
A Strong Baseline for Batch Imitation Learning
A Strong Baseline for Batch Imitation Learning
Matthew Smith
Lucas Maystre
Zhenwen Dai
K. Ciosek
OffRL
25
4
0
06 Feb 2023
Open Problems and Modern Solutions for Deep Reinforcement Learning
Open Problems and Modern Solutions for Deep Reinforcement Learning
Weiqin Chen
OffRL
23
0
0
05 Feb 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender
  System
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
33
41
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
42
62
0
02 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
34
3
0
30 Jan 2023
Automatic Intersection Management in Mixed Traffic Using Reinforcement
  Learning and Graph Neural Networks
Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
Marvin Klimke
Benjamin Völz
M. Buchholz
41
13
0
30 Jan 2023
Previous
123...567...151617
Next