ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Generalizable Episodic Memory for Deep Reinforcement Learning
Generalizable Episodic Memory for Deep Reinforcement Learning
Haotian Hu
Jianing Ye
Guangxiang Zhu
Zhizhou Ren
Chongjie Zhang
OffRL
84
39
0
11 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
128
186
0
10 Mar 2021
Learning to Play Soccer From Scratch: Sample-Efficient Emergent
  Coordination through Curriculum-Learning and Competition
Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and Competition
Pavan Samtani
Francisco Leiva
Javier Ruiz-del-Solar
40
2
0
09 Mar 2021
Model-free Policy Learning with Reward Gradients
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
49
6
0
09 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Instabilities of Offline RL with Pre-Trained Neural Representation
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
158
42
0
08 Mar 2021
Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for
  Resilient Wireless Signal Classification
Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for Resilient Wireless Signal Classification
Salvatore D’oro
Francesco Restuccia
Tommaso Melodia
44
11
0
05 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future
  Prediction
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
138
4
0
03 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
Matthieu Geist
OffRL
103
41
0
02 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning
  Approach
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning Approach
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
97
15
0
01 Mar 2021
Sim-to-Real Transfer for Robotic Manipulation with Tactile Sensory
Sim-to-Real Transfer for Robotic Manipulation with Tactile Sensory
Zihan Ding
Ya-Yen Tsai
Wang Wei Lee
Bidan Huang
35
28
0
28 Feb 2021
Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Revisiting Peng's Q(λλλ) for Modern Reinforcement Learning
Tadashi Kozuno
Yunhao Tang
Mark Rowland
Rémi Munos
Steven Kapturowski
Will Dabney
Michal Valko
David Abel
OffRL
55
19
0
27 Feb 2021
Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
57
86
0
25 Feb 2021
Deep Reinforcement Learning for Safe Landing Site Selection with
  Concurrent Consideration of Divert Maneuvers
Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers
Keidai Iiyama
Kento Tomita
Bhavi Jagatia
Tatsuwaki Nakagawa
K. Ho
65
14
0
24 Feb 2021
Memory-based Deep Reinforcement Learning for POMDPs
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulic
104
100
0
24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with
  Quantization-Aware Training and Adaptive Parallelism
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism
Jenny Yang
Seongmin Hong
Joo-Young Kim
50
18
0
24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with
  Smaller Actors in Actor-Critic RL
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
OffRL
38
9
0
23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
106
25
0
23 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
77
12
0
23 Feb 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units
  Using Offline Reinforcement Learning
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRLAI4CE
135
69
0
23 Feb 2021
HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem
  Solving
HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving
Sirui Xie
Xiaojian Ma
Peiyu Yu
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
94
20
0
22 Feb 2021
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy
  Reinforcement Learning
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Brett Daley
Cameron Hickert
Chris Amato
OffRL
23
5
0
22 Feb 2021
Reinforcement Learning with Prototypical Representations
Reinforcement Learning with Prototypical Representations
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
81
226
0
22 Feb 2021
Reinforcement Learning of the Prediction Horizon in Model Predictive
  Control
Reinforcement Learning of the Prediction Horizon in Model Predictive Control
Eivind Bøhn
S. Gros
Signe Moe
T. Johansen
44
36
0
22 Feb 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement
  Learning via Frank-Wolfe Policy Optimization
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
110
14
0
22 Feb 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
122
9
0
21 Feb 2021
Decentralized Deterministic Multi-Agent Reinforcement Learning
Decentralized Deterministic Multi-Agent Reinforcement Learning
Antoine Grosnit
D. Cai
L. Wynter
OffRL
115
7
0
19 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for
  Urban Autonomous Driving
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
86
74
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
283
27
0
18 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade
  Execution
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
42
4
0
16 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
302
433
0
16 Feb 2021
Steadily Learn to Drive with Virtual Memory
Steadily Learn to Drive with Virtual Memory
Yuhang Zhang
Yao Mu
Yujie Yang
Yang Guan
Shengbo Eben Li
Qi Sun
Jianyu Chen
40
1
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
40
0
0
16 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
97
40
0
16 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous
  Agents via Personalized Simulators
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
76
18
0
13 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
134
42
0
10 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
Model-Augmented Q-learning
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
43
1
0
07 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
96
59
0
07 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
158
535
0
04 Feb 2021
Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust
Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust
Thomas Chaffre
Julien Moras
Adrien Chan-Hon-Tong
J. Marzat
Karl Sammut
G. Chenadec
Benoit Clement
44
5
0
29 Jan 2021
OffCon$^3$: What is state of the art anyway?
OffCon3^33: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
82
8
0
27 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with
  Evolution Strategies
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
57
8
0
24 Jan 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
63
9
0
24 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
72
16
0
23 Jan 2021
Breaking the Deadly Triad with a Target Network
Breaking the Deadly Triad with a Target Network
Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
AAML
127
45
0
21 Jan 2021
Robust Reinforcement Learning on State Observations with Learned Optimal
  Adversary
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
121
168
0
21 Jan 2021
Learning Kinematic Feasibility for Mobile Manipulation through Deep
  Reinforcement Learning
Learning Kinematic Feasibility for Mobile Manipulation through Deep Reinforcement Learning
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
76
49
0
13 Jan 2021
Evolving Reinforcement Learning Algorithms
Evolving Reinforcement Learning Algorithms
John D. Co-Reyes
Yingjie Miao
Daiyi Peng
Esteban Real
Sergey Levine
Quoc V. Le
Honglak Lee
Aleksandra Faust
131
74
0
08 Jan 2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Shangtong Zhang
Yi Wan
R. Sutton
Shimon Whiteson
OffRL
73
31
0
08 Jan 2021
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
Zeyu Zhu
Huijing Zhao
147
159
0
06 Jan 2021
Previous
123...343536...424344
Next