ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Empirical Design in Reinforcement Learning
Empirical Design in Reinforcement Learning
Andrew Patterson
Samuel Neumann
Martha White
Adam White
112
30
0
03 Apr 2023
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Maximilien Le Clei
Pierre C. Bellec
47
0
0
03 Apr 2023
Neuroevolution of Recurrent Architectures on Control Tasks
Neuroevolution of Recurrent Architectures on Control Tasks
Maximilien Le Clei
Pierre C. Bellec
28
4
0
03 Apr 2023
TacGNN:Learning Tactile-based In-hand Manipulation with a Blind Robot
TacGNN:Learning Tactile-based In-hand Manipulation with a Blind Robot
Linhan Yang
Bidan Huang
Qingbiao Li
Ya-Yen Tsai
Wang Wei Lee
Chaoyang Song
Jia Pan
51
23
0
03 Apr 2023
Adaptive formation motion planning and control of autonomous underwater
  vehicles using deep reinforcement learning
Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning
Behnaz Hadi
A. Khosravi
Pouria Sarhadi
83
20
0
01 Apr 2023
Understanding Reinforcement Learning Algorithms: The Progress from Basic
  Q-learning to Proximal Policy Optimization
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization
M. Chadi
H. Mousannif
OffRL
43
4
0
31 Mar 2023
Learning Human-to-Robot Handovers from Point Clouds
Learning Human-to-Robot Handovers from Point Clouds
Sammy Christen
Wei Yang
Claudia Pérez-DÁrpino
Otmar Hilliges
Dieter Fox
Yu-Wei Chao
73
43
0
30 Mar 2023
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs
  and Practical Solutions
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions
Yicheng Luo
Jackie Kay
Edward Grefenstette
M. Deisenroth
OffRLOnRL
69
16
0
30 Mar 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
90
16
0
30 Mar 2023
Dependent Task Offloading in Edge Computing Using GNN and Deep
  Reinforcement Learning
Dependent Task Offloading in Edge Computing Using GNN and Deep Reinforcement Learning
Zequn Cao
Xiaoheng Deng
32
12
0
30 Mar 2023
Importance Sampling for Stochastic Gradient Descent in Deep Neural
  Networks
Importance Sampling for Stochastic Gradient Descent in Deep Neural Networks
Thibault Lahire
31
2
0
29 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with
  Limited Demonstrations
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations
Li Haofeng
C. Yiwen
Tan Jiayi
Marcelo H. Ang Jr
OffRL
35
2
0
29 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value
  Regularization
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
98
85
0
28 Mar 2023
The Quality-Diversity Transformer: Generating Behavior-Conditioned
  Trajectories with Decision Transformers
The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers
Valentin Macé
Raphael Boige
Félix Chalumeau
Thomas Pierrot
Guillaume Richard
Nicolas Perrin-Gilbert
OffRL
123
13
0
27 Mar 2023
Balancing policy constraint and ensemble size in uncertainty-based
  offline reinforcement learning
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Alex Beeson
Giovanni Montana
OffRL
70
13
0
26 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic
  Environments
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
53
14
0
24 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor
  Feature-Based Concurrent Composition
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
77
4
0
24 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
74
1
0
23 Mar 2023
EDGI: Equivariant Diffusion for Planning with Embodied Agents
EDGI: Equivariant Diffusion for Planning with Embodied Agents
Johann Brehmer
Joey Bose
P. D. Haan
Taco S. Cohen
DiffM
105
36
0
22 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic
  Local Planner and Polar State Representations
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
66
6
0
21 Mar 2023
Style Miner: Find Significant and Stable Explanatory Factors in Time
  Series with Constrained Reinforcement Learning
Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
Dapeng Li
Feiyang Pan
Jia He
Zhiwei Xu
Dandan Tu
Guoliang Fan
AI4TS
56
2
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy
  Constraint Q Learning for Propofol Infusion Control
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
71
5
0
17 Mar 2023
Efficient Learning of High Level Plans from Play
Efficient Learning of High Level Plans from Play
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
50
3
0
16 Mar 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations
  and Interpretable Policy Dynamics
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRLAI4TSAI4MH
107
11
0
16 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space
  Partitioning
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
38
3
0
16 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRLOnRL
69
24
0
14 Mar 2023
Understanding the Synergies between Quality-Diversity and Deep
  Reinforcement Learning
Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning
Bryan Lim
Manon Flageat
Antoine Cully
OnRL
81
7
0
10 Mar 2023
Evolving Populations of Diverse RL Agents with MAP-Elites
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
118
10
0
09 Mar 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRLOnRL
188
125
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
114
3
0
08 Mar 2023
A Strategy-Oriented Bayesian Soft Actor-Critic Model
A Strategy-Oriented Bayesian Soft Actor-Critic Model
Qin Yang
Ramviyas Parasuraman
73
8
0
07 Mar 2023
Diminishing Return of Value Expansion Methods in Model-Based
  Reinforcement Learning
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
Daniel Palenicek
M. Lutter
João Carvalho
Jan Peters
79
4
0
07 Mar 2023
MAP-Elites with Descriptor-Conditioned Gradients and Archive
  Distillation into a Single Policy
MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
92
19
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
142
56
0
07 Mar 2023
Dexterous In-hand Manipulation by Guiding Exploration with Simple
  Sub-skill Controllers
Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers
Gagan Khandate
C. Mehlman
Xingsheng Wei
M. Ciocarlie
57
3
0
06 Mar 2023
Learning to Backdoor Federated Learning
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
82
10
0
06 Mar 2023
Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A
  Reinforcement Learning Based Approach
Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A Reinforcement Learning Based Approach
Xiaorui Tang
Sicong Liu
Xiaojiang Du
Mohsen Guizani
57
0
0
05 Mar 2023
Swim: A General-Purpose, High-Performing, and Efficient Activation
  Function for Locomotion Control Tasks
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks
Maryam Abdool
Tony Dear
34
1
0
05 Mar 2023
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
91
41
0
05 Mar 2023
CFlowNets: Continuous Control with Generative Flow Networks
CFlowNets: Continuous Control with Generative Flow Networks
Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
132
23
0
04 Mar 2023
Decision Transformer under Random Frame Dropping
Decision Transformer under Random Frame Dropping
Kaizhe Hu
Rachel Zheng
Yang Gao
Huazhe Xu
OffRL
172
13
0
03 Mar 2023
Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based
  Deep Reinforcement Learning
Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning
Jorge de Heuvel
Weixian Shi
Xiangyu Zeng
Maren Bennewitz
95
1
0
02 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
69
1
0
02 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
83
10
0
02 Mar 2023
A Variational Approach to Mutual Information-Based Coordination for
  Multi-Agent Reinforcement Learning
A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Young-Jin Sung
53
7
0
01 Mar 2023
Human-Inspired Framework to Accelerate Reinforcement Learning
Human-Inspired Framework to Accelerate Reinforcement Learning
Ali Beikmohammadi
Sindri Magnússon
OffRL
86
4
0
28 Feb 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
63
0
0
28 Feb 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
85
26
0
28 Feb 2023
Taylor TD-learning
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
58
1
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning
  and Forward Simulation with Positioning Error Below End-Effector Physical
  Minimum Displacement
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
75
2
0
26 Feb 2023
Previous
123...181920...424344
Next