ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
108
136
0
01 Jul 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and
  Pessimistic Q-Ensemble
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
OffRLOnRL
76
192
0
01 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
87
144
0
01 Jul 2021
MHER: Model-based Hindsight Experience Replay
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
95
17
0
01 Jul 2021
SA-MATD3:Self-attention-based multi-agent continuous control method in
  cooperative environments
SA-MATD3:Self-attention-based multi-agent continuous control method in cooperative environments
Kai Liu
Yuyang Zhao
Gang Wang
Bei Peng
35
22
0
01 Jul 2021
The Values Encoded in Machine Learning Research
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
89
295
0
29 Jun 2021
VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating
  3D ARTiculated Objects
VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects
Ruihai Wu
Yan Zhao
Kaichun Mo
Zizheng Guo
Yian Wang
Tianhao Wu
Qingnan Fan
Xuelin Chen
Leonidas Guibas
Hao Dong
122
94
0
28 Jun 2021
Compositional Reinforcement Learning from Logical Specifications
Compositional Reinforcement Learning from Logical Specifications
Kishor Jothimurugan
Suguman Bansal
Osbert Bastani
Rajeev Alur
CoGe
135
81
0
25 Jun 2021
panda-gym: Open-source goal-conditioned environments for robotic
  learning
panda-gym: Open-source goal-conditioned environments for robotic learning
Quentin Gallouedec
Nicolas Cazin
Emmanuel Dellandrea
Liming Chen
OffRL
71
80
0
25 Jun 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic
  Manipulation via Discretisation
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
100
134
0
23 Jun 2021
Local policy search with Bayesian optimization
Local policy search with Bayesian optimization
Sarah Müller
Alexander von Rohr
Sebastian Trimpe
BDL
89
42
0
22 Jun 2021
Off-Policy Reinforcement Learning with Delayed Rewards
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han
Zhizhou Ren
Zuofan Wu
Yuanshuo Zhou
Jian-wei Peng
OffRL
71
33
0
22 Jun 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution
  Correction Estimation
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
176
101
0
21 Jun 2021
A Max-Min Entropy Framework for Reinforcement Learning
A Max-Min Entropy Framework for Reinforcement Learning
Seungyul Han
Y. Sung
98
23
0
19 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain
  Transfer in Offline RL
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
70
30
0
16 Jun 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
78
6
0
16 Jun 2021
Offline RL Without Off-Policy Evaluation
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
108
170
0
16 Jun 2021
Solving Continuous Control with Episodic Memory
Solving Continuous Control with Episodic Memory
Igor Kuznetsov
Andrey Filchenkov
CLLOffRL
48
19
0
16 Jun 2021
Deep Reinforcement Learning for Conservation Decisions
Deep Reinforcement Learning for Conservation Decisions
Marcus Lapeyrolerie
Melissa S. Chapman
Kari E. A. Norman
C. Boettiger
OffRL
124
18
0
15 Jun 2021
Population-coding and Dynamic-neurons improved Spiking Actor Network for
  Reinforcement Learning
Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Xiang Cheng
Bo Xu
AI4CE
73
1
0
15 Jun 2021
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function
  Approximation
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
Pascal Bianchi
Julien Lehmann
91
9
0
14 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
92
15
0
13 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
143
831
0
12 Jun 2021
A Deep Reinforcement Learning Approach to Marginalized Importance
  Sampling with the Successor Representation
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto
David Meger
Doina Precup
76
17
0
12 Jun 2021
Recomposing the Reinforcement Learning Building Blocks with
  Hypernetworks
Recomposing the Reinforcement Learning Building Blocks with Hypernetworks
Shai Keynan
Elad Sarafian
Sarit Kraus
OffRL
93
30
0
12 Jun 2021
Lvio-Fusion: A Self-adaptive Multi-sensor Fusion SLAM Framework Using
  Actor-critic Method
Lvio-Fusion: A Self-adaptive Multi-sensor Fusion SLAM Framework Using Actor-critic Method
Yupeng Jia
Haiyong Luo
Fang Zhao
Guanlin Jiang
Yuhang Li
Jiaquan Yan
Zhuqing Jiang
Zitian Wang
OffRL
61
39
0
12 Jun 2021
Offline Reinforcement Learning as Anti-Exploration
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
Matthieu Geist
OffRL
109
52
0
11 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
63
5
0
11 Jun 2021
Dynamic Sparse Training for Deep Reinforcement Learning
Dynamic Sparse Training for Deep Reinforcement Learning
Ghada Sokar
Elena Mocanu
Decebal Constantin Mocanu
Mykola Pechenizkiy
Peter Stone
108
59
0
08 Jun 2021
Offline Policy Comparison under Limited Historical Agent-Environment
  Interactions
Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Anton Dereventsov
Joseph Daws
Clayton Webster
OffRL
55
3
0
07 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
99
122
0
07 Jun 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit
  Differentiation
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
96
38
0
06 Jun 2021
ScheduleNet: Learn to solve multi-agent scheduling problems with
  reinforcement learning
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning
Junyoung Park
Sanjar Bakhtiyar
Jinkyoo Park
70
39
0
06 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
49
0
06 Jun 2021
Learning Routines for Effective Off-Policy Reinforcement Learning
Learning Routines for Effective Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
26
1
0
05 Jun 2021
Towards Learning to Play Piano with Dexterous Hands and Touch
Towards Learning to Play Piano with Dexterous Hands and Touch
Huazhe Xu
Yuping Luo
Shaoxiong Wang
Trevor Darrell
Roberto Calandra
170
30
0
03 Jun 2021
On the Convergence Rate of Off-Policy Policy Optimization Methods with
  Density-Ratio Correction
On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction
Jiawei Huang
Nan Jiang
98
5
0
02 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
Matthieu Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
114
78
0
01 Jun 2021
Q-attention: Enabling Efficient Learning for Vision-based Robotic
  Manipulation
Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation
Stephen James
Andrew J. Davison
85
129
0
31 May 2021
Adversarial Intrinsic Motivation for Reinforcement Learning
Adversarial Intrinsic Motivation for Reinforcement Learning
Ishan Durugkar
Mauricio Tec
S. Niekum
Peter Stone
OOD
127
41
0
27 May 2021
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence
  Optimization
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization
Taisuke Kobayashi
61
14
0
27 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
103
31
0
26 May 2021
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in
  Connected and Automated Hybrid Electric Vehicles
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles
Zhaoxuan Zhu
Nicola Pivaro
Shobhit Gupta
Abhishek Gupta
Marcello Canova
OffRL
65
36
0
25 May 2021
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring
  Statewise Safety
Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety
Haitong Ma
Yang Guan
Shegnbo Eben Li
Xiangteng Zhang
Sifa Zheng
Jianyu Chen
95
37
0
22 May 2021
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using
  Reinforcement Learning
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning
M. Tashman
John Hoffman
Jiayi Xie
Feng Ye
Atefeh Morsali
Lee Winikor
Rouzbeh Gerami
OffRL
41
0
0
21 May 2021
Towards a Sample Efficient Reinforcement Learning Pipeline for Vision
  Based Robotics
Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics
Maxence Mahe
Pierre Belamri
Jesús Bujalance Martín
40
0
0
20 May 2021
A Stochastic Composite Augmented Lagrangian Method For Reinforcement
  Learning
A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Yongfeng Li
Mingming Zhao
Weijie Chen
Zaiwen Wen
42
5
0
20 May 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDLOffRLOnRL
80
190
0
17 May 2021
APPL: Adaptive Planner Parameter Learning
APPL: Adaptive Planner Parameter Learning
Xuesu Xiao
Zizhao Wang
Zifan Xu
Bo Liu
Garrett A. Warnell
Gauraang Dhamankar
Anirudh Nair
Peter Stone
72
52
0
17 May 2021
Regret Minimization Experience Replay in Off-Policy Reinforcement
  Learning
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
Xu-Hui Liu
Zhenghai Xue
Jing-Cheng Pang
Shengyi Jiang
Feng Xu
Yang Yu
OffRL
73
37
0
15 May 2021
Previous
123...323334...424344
Next