ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference
Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference
Hao Sun
Taiyi A. Wang
CML
73
6
0
02 Jan 2022
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement
  Learning
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Ziyang Tang
Yihao Feng
Qiang Liu
OffRL
43
1
0
01 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement
  Learning
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
88
17
0
01 Jan 2022
Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of
  Unmanned Aerial Vehicles
Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
P. Drews
92
35
0
27 Dec 2021
A Survey on Interpretable Reinforcement Learning
A Survey on Interpretable Reinforcement Learning
Claire Glanois
Paul Weng
Matthieu Zimmer
Dong Li
Tianpei Yang
Jianye Hao
Wulong Liu
OffRL
108
105
0
24 Dec 2021
Missing Velocity in Dynamic Obstacle Avoidance based on Deep
  Reinforcement Learning
Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning
Fabian Hart
Martin Waltz
Ostap Okhrin
32
0
0
23 Dec 2021
Direct Behavior Specification via Constrained Reinforcement Learning
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy
Roger Girgis
Joshua Romoff
Pierre-Luc Bacon
C. Pal
114
36
0
22 Dec 2021
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous
  Policies in a Multi-agent Urban Driving Environment
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment
Aizaz Sharif
D. Marijan
33
5
0
22 Dec 2021
Newsvendor Model with Deep Reinforcement Learning
Newsvendor Model with Deep Reinforcement Learning
Dylan K. Goetting
25
0
0
22 Dec 2021
Value Activation for Bias Alleviation: Generalized-activated Deep Double
  Deterministic Policy Gradients
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRLAI4CE
73
7
0
21 Dec 2021
Soft Actor-Critic with Cross-Entropy Policy Optimization
Soft Actor-Critic with Cross-Entropy Policy Optimization
Zhenyang Shi
Surya Pal Singh
46
5
0
21 Dec 2021
Learning Robust Policy against Disturbance in Transition Dynamics via
  State-Conservative Policy Optimization
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
Yufei Kuang
Miao Lu
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
56
22
0
20 Dec 2021
Interpretable Preference-based Reinforcement Learning with
  Tree-Structured Reward Functions
Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions
Tom Bewley
Freddy Lecue
OffRL
58
12
0
20 Dec 2021
Variational Quantum Soft Actor-Critic
Variational Quantum Soft Actor-Critic
Qingfeng Lan
55
21
0
20 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
71
34
0
16 Dec 2021
Learning from Guided Play: A Scheduled Hierarchical Approach for
  Improving Exploration in Adversarial Imitation Learning
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning
Trevor Ablett
Bryan Chan
Jonathan Kelly
68
4
0
16 Dec 2021
OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical
  Locomotion
OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion
V. Barbera
Fabio Pardo
Yuval Tassa
M. Daley
C. Richards
Petar Kormushev
J. Hutchinson
56
12
0
11 Dec 2021
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting
  Adversarial Imitation for Sample Efficiency
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Mingfei Sun
Sam Devlin
Katja Hofmann
Shimon Whiteson
30
4
0
11 Dec 2021
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep
  Reinforcement Learning
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning
Xiao-Yang Liu
Zechu Li
Zhuoran Yang
Jiahao Zheng
Zhaoran Wang
A. Walid
Jian Guo
Michael I. Jordan
74
25
0
11 Dec 2021
Faster Deep Reinforcement Learning with Slower Online Network
Faster Deep Reinforcement Learning with Slower Online Network
Kavosh Asadi
Rasool Fakoor
Omer Gottesman
Taesup Kim
Michael L. Littman
Alexander J. Smola
OnRL
68
7
0
10 Dec 2021
Reward-Based Environment States for Robot Manipulation Policy Learning
Reward-Based Environment States for Robot Manipulation Policy Learning
Cédérick Mouliets
Isabelle Ferrané
Heriberto Cuayáhuitl
43
0
0
10 Dec 2021
A Validation Tool for Designing Reinforcement Learning Environments
A Validation Tool for Designing Reinforcement Learning Environments
Ruiyang Xu
Zhengxing Chen
OffRL
33
0
0
10 Dec 2021
An Experimental Design Perspective on Model-Based Reinforcement Learning
An Experimental Design Perspective on Model-Based Reinforcement Learning
Viraj Mehta
Biswajit Paria
J. Schneider
Stefano Ermon
Willie Neiswanger
OffRL
84
22
0
09 Dec 2021
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and
  Practical Perspectives
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
Toshinori Kitamura
Ryo Yonetani
OffRL
146
4
0
08 Dec 2021
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
61
10
0
07 Dec 2021
Functional Regularization for Reinforcement Learning via Learned Fourier
  Features
Functional Regularization for Reinforcement Learning via Learned Fourier Features
Alexander C. Li
Deepak Pathak
87
18
0
06 Dec 2021
Flexible Option Learning
Flexible Option Learning
Martin Klissarov
Doina Precup
OffRL
77
26
0
06 Dec 2021
Deep Policy Iteration with Integer Programming for Inventory Management
Deep Policy Iteration with Integer Programming for Inventory Management
Pavithra Harsha
A. Jagmohan
Jayant Kalagnanam
Brian Quanz
Divya Singhvi
46
1
0
04 Dec 2021
Coupling Vision and Proprioception for Navigation of Legged Robots
Coupling Vision and Proprioception for Navigation of Legged Robots
Zipeng Fu
Ashish Kumar
Ananye Agarwal
Haozhi Qi
Jitendra Malik
Deepak Pathak
72
78
0
03 Dec 2021
Learning a Robust Multiagent Driving Policy for Traffic Congestion
  Reduction
Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction
Yulin Zhang
William Macke
Jiaxun Cui
Daniel Urieli
Peter Stone
82
8
0
03 Dec 2021
Episodic Policy Gradient Training
Episodic Policy Gradient Training
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDLOffRL
68
6
0
03 Dec 2021
Homotopy Based Reinforcement Learning with Maximum Entropy for
  Autonomous Air Combat
Homotopy Based Reinforcement Learning with Maximum Entropy for Autonomous Air Combat
Yiwen Zhu
Zhou Fang
Yuan Zheng
Wenya Wei
39
2
0
01 Dec 2021
Continuous Control With Ensemble Deep Deterministic Policy Gradients
Continuous Control With Ensemble Deep Deterministic Policy Gradients
Piotr Januszewski
Mateusz Olko
M. Królikowski
J. Swiatkowski
Marcin Andrychowicz
Lukasz Kuciñski
Piotr Milo's
OffRL
29
10
0
30 Nov 2021
SAGCI-System: Towards Sample-Efficient, Generalizable, Compositional,
  and Incremental Robot Learning
SAGCI-System: Towards Sample-Efficient, Generalizable, Compositional, and Incremental Robot Learning
Jun Lv
Qiaojun Yu
Lin Shao
Wenhai Liu
Wenqiang Xu
Cewu Lu
71
26
0
29 Nov 2021
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
127
38
0
26 Nov 2021
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Iman Nematollahi
Erick Rosete-Beas
Adrian Rofer
Tim Welschehold
Abhinav Valada
Wolfram Burgard
69
16
0
25 Nov 2021
Learn Zero-Constraint-Violation Policy in Model-Free Constrained
  Reinforcement Learning
Learn Zero-Constraint-Violation Policy in Model-Free Constrained Reinforcement Learning
Haitong Ma
Changliu Liu
Shengbo Eben Li
Sifa Zheng
Wen Sun
Jianyu Chen
81
11
0
25 Nov 2021
Off-Policy Correction For Multi-Agent Reinforcement Learning
Off-Policy Correction For Multi-Agent Reinforcement Learning
Michał Zawalski
Bla.zej Osiñski
Henryk Michalewski
Piotr Milo's
OffRL
72
2
0
22 Nov 2021
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement
  Learning with Actor Rectification
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
L. Pan
Longbo Huang
Tengyu Ma
Huazhe Xu
OffRLOnRL
117
55
0
22 Nov 2021
Renewable energy integration and microgrid energy trading using
  multi-agent deep reinforcement learning
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning
Daniel J. B. Harrold
Jun Cao
Zhongbo Fan
41
67
0
21 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
116
104
0
19 Nov 2021
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement
  Learning
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
38
2
0
19 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
George Andriopoulos
OffRL
65
9
0
17 Nov 2021
CleanRL: High-quality Single-file Implementations of Deep Reinforcement
  Learning Algorithms
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
18
0
0
16 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based
  Autonomous Driving
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
92
61
0
16 Nov 2021
Improving Learning from Demonstrations by Learning from Experience
Improving Learning from Demonstrations by Learning from Experience
Hao-Kang Liu
Yiwen Chen
Jiayi Tan
M. Ang
OffRL
112
1
0
16 Nov 2021
Joint Synthesis of Safety Certificate and Safe Control Policy using
  Constrained Reinforcement Learning
Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning
Haitong Ma
Changliu Liu
Shengbo Eben Li
Sifa Zheng
Jianyu Chen
82
43
0
15 Nov 2021
Deep Reinforcement Learning with Shallow Controllers: An Experimental
  Application to PID Tuning
Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning
Nathan P. Lawrence
M. Forbes
Philip D. Loewen
Daniel G. McClement
Johan U. Backstrom
R. Bhushan Gopaluni
OffRL
41
77
0
13 Nov 2021
Cooperative multi-agent reinforcement learning for high-dimensional
  nonequilibrium control
Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control
Shriram Chennakesavalu
Grant M. Rotskoff
18
1
0
12 Nov 2021
AWD3: Dynamic Reduction of the Estimation Bias
AWD3: Dynamic Reduction of the Estimation Bias
Dogan C. Cicek
Enes Duran
Baturay Saglam
Kagan Kaya
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
26
7
0
12 Nov 2021
Previous
123...282930...424344
Next