ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
94
22
0
09 Nov 2021
Coordinated Proximal Policy Optimization
Coordinated Proximal Policy Optimization
Zifan Wu
Chao Yu
Deheng Ye
Junge Zhang
Haiyin Piao
H. Zhuo
85
46
0
07 Nov 2021
FinRL: Deep Reinforcement Learning Framework to Automate Trading in
  Quantitative Finance
FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance
Xiao-Yang Liu
Hongyang Yang
Jiechao Gao
Chris Wang
AIFinOffRL
121
99
0
07 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRLGP
111
106
0
06 Nov 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
90
24
0
04 Nov 2021
Control of a fly-mimicking flyer in complex flow using deep
  reinforcement learning
Control of a fly-mimicking flyer in complex flow using deep reinforcement learning
Seungpyo Hong
Sejin Kim
D. You
AI4CE
49
2
0
04 Nov 2021
Confidence Composition for Monitors of Verification Assumptions
Confidence Composition for Monitors of Verification Assumptions
I. Ruchkin
Matthew Cleaveland
Radoslav Ivanov
Pengyuan Lu
Taylor J. Carpenter
O. Sokolsky
Insup Lee
103
13
0
03 Nov 2021
Balanced Q-learning: Combining the Influence of Optimistic and
  Pessimistic Targets
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
64
5
0
03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
35
11
0
02 Nov 2021
A Hybrid Approach for Learning to Shift and Grasp with Elaborate Motion
  Primitives
A Hybrid Approach for Learning to Shift and Grasp with Elaborate Motion Primitives
Zohar Feldman
Hanna Ziesche
Ngo Anh Vien
Dotan Di Castro
70
16
0
02 Nov 2021
Human-Level Control without Server-Grade Hardware
Human-Level Control without Server-Grade Hardware
Brett Daley
Chris Amato
BDLOffRL
36
0
0
01 Nov 2021
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Kuo Li
Qing-Shan Jia
OffRL
18
2
0
31 Oct 2021
An Actor-Critic Method for Simulation-Based Optimization
An Actor-Critic Method for Simulation-Based Optimization
Kuo Li
Qing-Shan Jia
Jiaqi Yan
13
2
0
31 Oct 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
87
17
0
30 Oct 2021
Generalized Proximal Policy Optimization with Sample Reuse
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
152
51
0
29 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
80
8
0
28 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
109
18
0
27 Oct 2021
Multi-Agent Reinforcement Learning for Active Voltage Control on Power
  Distribution Networks
Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks
Jianhong Wang
Wangkun Xu
Yunjie Gu
Wenbin Song
T. Green
86
128
0
27 Oct 2021
A Subgame Perfect Equilibrium Reinforcement Learning Approach to
  Time-inconsistent Problems
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems
Nixie S. Lesmana
Chi Seng Pun
OffRL
38
4
0
27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Beining Han
Chongyi Zheng
Harris Chan
Keiran Paster
Michael Ruogu Zhang
Jimmy Ba
OODAI4CE
106
14
0
27 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
121
62
0
26 Oct 2021
Automating Control of Overestimation Bias for Reinforcement Learning
Automating Control of Overestimation Bias for Reinforcement Learning
Arsenii Kuznetsov
Alexander Grishin
Artem Tsypin
Arsenii Ashukha
Artur Kadurin
Dmitry Vetrov
OffRL
47
2
0
26 Oct 2021
Recurrent Off-policy Baselines for Memory-based Continuous Control
Recurrent Off-policy Baselines for Memory-based Continuous Control
Zhihan Yang
Hai V. Nguyen
CLLOffRL
80
24
0
25 Oct 2021
Learning Insertion Primitives with Discrete-Continuous Hybrid Action
  Space for Robotic Assembly Tasks
Learning Insertion Primitives with Discrete-Continuous Hybrid Action Space for Robotic Assembly Tasks
Yongyu Wang
Shiyu Jin
Changhao Wang
Xinghao Zhu
Masayoshi Tomizuka
138
42
0
25 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
76
9
0
24 Oct 2021
DiffSRL: Learning Dynamical State Representation for Deformable Object
  Manipulation with Differentiable Simulator
DiffSRL: Learning Dynamical State Representation for Deformable Object Manipulation with Differentiable Simulator
Sirui Chen
Yunhao Liu
Jialong Li
Shang Wen Yao
Tingxiang Fan
Jia Pan
AI4CE
69
10
0
24 Oct 2021
Off-policy Reinforcement Learning with Optimistic Exploration and
  Distribution Correction
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
A. Ahmad
Shuo Cheng
D. Saraswat
Aly El Gamal
Wenjie Wang
Gurmukh Johal
OffRLOnRL
32
1
0
22 Oct 2021
A Versatile and Efficient Reinforcement Learning Framework for
  Autonomous Driving
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving
Guan-Bo Wang
Haoyi Niu
Desheng Zhu
Jianming Hu
Xianyuan Zhan
Guyue Zhou
OffRL
100
2
0
22 Oct 2021
Is High Variance Unavoidable in RL? A Case Study in Continuous Control
Is High Variance Unavoidable in RL? A Case Study in Continuous Control
Johan Bjorck
Carla P. Gomes
Kilian Q. Weinberger
92
23
0
21 Oct 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement
  Learning and Goal-Aware State Information
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information
Jin Li
Xianyuan Zhan
Zixu Xiao
Guyue Zhou
OffRLOnRL
59
2
0
21 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
103
23
0
19 Oct 2021
Balancing Value Underestimation and Overestimation with Realistic
  Actor-Critic
Balancing Value Underestimation and Overestimation with Realistic Actor-Critic
Sicen Li
Qinyun Tang
G. Wang
Xinmeng Ma
Li-quan Wang
OffRL
69
4
0
19 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender
  System
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
43
9
0
18 Oct 2021
Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration,
  Convergence, and Stabilization
Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization
Ke Sun
Yafei Wang
Yi Liu
Yingnan Zhao
Bo Pan
Shangling Jui
Bei Jiang
Linglong Kong
44
11
0
17 Oct 2021
SaLinA: Sequential Learning of Agents
SaLinA: Sequential Learning of Agents
Ludovic Denoyer
Alfredo De la Fuente
S. Duong
Jean-Baptiste Gaya
Pierre-Alexandre Kamienny
Daniel H. Thompson
94
11
0
15 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
79
31
0
14 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
312
931
0
12 Oct 2021
Learning from Ambiguous Demonstrations with Self-Explanation Guided
  Reinforcement Learning
Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning
Yantian Zha
L. Guan
Subbarao Kambhampati
97
6
0
11 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
81
110
0
11 Oct 2021
Bid Optimization using Maximum Entropy Reinforcement Learning
Bid Optimization using Maximum Entropy Reinforcement Learning
Mengjuan Liu
Jinyu Liu
Zhengning Hu
Yuchen Ge
Xuyun Nie
35
5
0
11 Oct 2021
Multi-condition multi-objective optimization using deep reinforcement
  learning
Multi-condition multi-objective optimization using deep reinforcement learning
Sejin Kim
Innyoung Kim
D. You
AI4CE
76
29
0
10 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise
  Datasets
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
96
5
0
10 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior
  Engineering beyond Reward Maximization
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
80
14
0
10 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor
  Function Approximation
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
17
0
09 Oct 2021
Improving Kinodynamic Planners for Vehicular Navigation with Learned
  Goal-Reaching Controllers
Improving Kinodynamic Planners for Vehicular Navigation with Learned Goal-Reaching Controllers
Aravind Sivaramakrishnan
Edgar Granados
Seth Karten
T. McMahon
Kostas E. Bekris
49
7
0
08 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation
  Budget Matters
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
99
24
0
08 Oct 2021
Learning to Centralize Dual-Arm Assembly
Learning to Centralize Dual-Arm Assembly
Marvin Alles
Elie Aljalbout
67
18
0
08 Oct 2021
Active Extrinsic Contact Sensing: Application to General Peg-in-Hole
  Insertion
Active Extrinsic Contact Sensing: Application to General Peg-in-Hole Insertion
Sangwoon Kim
Alberto Rodriguez
98
70
0
07 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
83
17
0
07 Oct 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley
  Additive Explanations
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations
Sindre Benjamin Remman
A. Lekkas
55
14
0
07 Oct 2021
Previous
123...293031...424344
Next