ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Offline RL With Resource Constrained Online Deployment
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
74
2
0
07 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
259
46
0
06 Oct 2021
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
83
2
0
06 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CMLFAttOffRL
68
2
0
06 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Wonjoon Goo
S. Niekum
OffRL
83
8
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
90
113
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
185
283
0
04 Oct 2021
Large Batch Experience Replay
Large Batch Experience Replay
Thibault Lahire
Matthieu Geist
Emmanuel Rachelson
OffRL
100
13
0
04 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL
  Implementations
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
34
8
0
03 Oct 2021
Exploration of Artificial Intelligence-oriented Power System Dynamic
  Simulators
Exploration of Artificial Intelligence-oriented Power System Dynamic Simulators
Tannan Xiao
Ying-Cong Chen
Jianquan Wang
Shaowei Huang
Weilin Tong
Tirui He
42
15
0
03 Oct 2021
BRAC+: Improved Behavior Regularized Actor Critic for Offline
  Reinforcement Learning
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
92
17
0
02 Oct 2021
Explanation-Aware Experience Replay in Rule-Dense Environments
Explanation-Aware Experience Replay in Rule-Dense Environments
Francesco Sovrano
Alex Raymond
Amanda Prorok
49
8
0
29 Sep 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
71
17
0
29 Sep 2021
Learning Dynamics Models for Model Predictive Agents
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
114
28
0
29 Sep 2021
Formulation and validation of a car-following model based on deep
  reinforcement learning
Formulation and validation of a car-following model based on deep reinforcement learning
Fabian Hart
Ostap Okhrin
M. Treiber
79
23
0
29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
132
60
0
28 Sep 2021
A First-Occupancy Representation for Reinforcement Learning
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
83
16
0
28 Sep 2021
Exploring More When It Needs in Deep Reinforcement Learning
Exploring More When It Needs in Deep Reinforcement Learning
Youtian Guo
Qitong Gao
29
0
0
28 Sep 2021
Not Only Domain Randomization: Universal Policy with Embedding System
  Identification
Not Only Domain Randomization: Universal Policy with Embedding System Identification
Zihan Ding
12
2
0
28 Sep 2021
Prioritized Experience-based Reinforcement Learning with Human Guidance
  for Autonomous Driving
Prioritized Experience-based Reinforcement Learning with Human Guidance for Autonomous Driving
Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
105
77
0
26 Sep 2021
Deep Reinforcement Learning for Wireless Scheduling in Distributed Networked Control
Deep Reinforcement Learning for Wireless Scheduling in Distributed Networked Control
Gaoyang Pang
Kang Huang
Daniel E. Quevedo
Branka Vucetic
Yonghui Li
Wanchun Liu
87
18
0
26 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement
  Learning for Deterministic Policy Gradients
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
OffRL
53
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
51
33
0
24 Sep 2021
Hierarchies of Planning and Reinforcement Learning for Robot Navigation
Hierarchies of Planning and Reinforcement Learning for Robot Navigation
J. Wöhlke
Felix Schmitt
H. V. Hoof
39
24
0
23 Sep 2021
Real Robot Challenge: A Robotics Competition in the Cloud
Real Robot Challenge: A Robotics Competition in the Cloud
Stefan Bauer
Felix Widmaier
M. Wuthrich
Annika Buchholz
Sebastian Stark
...
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
Bernhard Schölkopf
61
12
0
22 Sep 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
215
87
0
22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for
  Deterministic Actor-Critic Methods
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
A Reinforcement Learning Benchmark for Autonomous Driving in
  Intersection Scenarios
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios
Yuqi Liu
Qichao Zhang
Dongbin Zhao
OffRL
130
13
0
22 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep
  Reinforcement Learning
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
85
10
0
22 Sep 2021
A Model-free Deep Reinforcement Learning Approach To Maneuver A
  Quadrotor Despite Single Rotor Failure
A Model-free Deep Reinforcement Learning Approach To Maneuver A Quadrotor Despite Single Rotor Failure
Paras Sharma
Prithvi Poddar
P. B. Sujit
31
5
0
22 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
107
6
0
18 Sep 2021
Density-based Curriculum for Multi-goal Reinforcement Learning with
  Sparse Rewards
Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards
Deyu Yang
Hanbo Zhang
Xuguang Lan
Jishiyu Ding
OffRL
77
2
0
18 Sep 2021
Soft Actor-Critic With Integer Actions
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
69
15
0
17 Sep 2021
AdaLoss: A computationally-efficient and provably convergent adaptive
  gradient method
AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Xiaoxia Wu
Yuege Xie
S. Du
Rachel A. Ward
ODL
49
7
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
91
77
0
16 Sep 2021
Dynamics-Aware Quality-Diversity for Efficient Learning of Skill
  Repertoires
Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires
Bryan Lim
Luca Grillotti
Lorenzo Bernasconi
Antoine Cully
122
28
0
16 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement
  Learning
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRLOnRL
44
0
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
102
0
14 Sep 2021
Direct Random Search for Fine Tuning of Deep Reinforcement Learning
  Policies
Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies
Sean Gillen
Asutay Ozmen
Katie Byl
32
0
0
12 Sep 2021
Encoding Distributional Soft Actor-Critic for Autonomous Driving in
  Multi-lane Scenarios
Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios
Jingliang Duan
Yangang Ren
Fawang Zhang
Yang Guan
Dongjie Yu
Shengbo Eben Li
B. Cheng
Lin Zhao
68
8
0
12 Sep 2021
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via
  Hybrid Action Representation
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
84
43
0
12 Sep 2021
Membership Inference Attacks Against Temporally Correlated Data in Deep
  Reinforcement Learning
Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning
Maziar Gomrokchi
Susan Amin
Hossein Aboutalebi
Alexander Wong
Doina Precup
MIACVAAML
86
3
0
08 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic
  Methods
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
69
1
0
08 Sep 2021
Optimizing Quantum Variational Circuits with Deep Reinforcement Learning
Optimizing Quantum Variational Circuits with Deep Reinforcement Learning
Owen Lockwood
86
9
0
07 Sep 2021
Optimal Stroke Learning with Policy Gradient Approach for Robotic Table
  Tennis
Optimal Stroke Learning with Policy Gradient Approach for Robotic Table Tennis
Yapeng Gao
Jonas Tebbe
A. Zell
OffRL
94
14
0
07 Sep 2021
Safety-Critical Learning of Robot Control with Temporal Logic
  Specifications
Safety-Critical Learning of Robot Control with Temporal Logic Specifications
Mingyu Cai
C. Vasile
109
4
0
07 Sep 2021
Error Controlled Actor-Critic
Error Controlled Actor-Critic
Xingen Gao
Chia-Wen Lin
Changle Zhou
Zhen Ge
Chih-Min Lin
Longzhi Yang
Xiang Chang
C. Shang
8
3
0
06 Sep 2021
Safe Reinforcement Learning using Formal Verification for Tissue
  Retraction in Autonomous Robotic-Assisted Surgery
Safe Reinforcement Learning using Formal Verification for Tissue Retraction in Autonomous Robotic-Assisted Surgery
Ameya Pore
Davide Corsi
Enrico Marchesini
Diego DallÁlba
A. Casals
Alessandro Farinelli
Paolo Fiorini
81
42
0
06 Sep 2021
Socially-Aware Multi-Agent Following with 2D Laser Scans via Deep
  Reinforcement Learning and Potential Field
Socially-Aware Multi-Agent Following with 2D Laser Scans via Deep Reinforcement Learning and Potential Field
Yuxiang Cui
Xiaolong Huang
Yue Wang
R. Xiong
29
2
0
04 Sep 2021
Previous
123...303132...424344
Next