ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Transfer Learning Across Simulated Robots With Different Sensors
Transfer Learning Across Simulated Robots With Different Sensors
Hélène Plisnier
Denis Steckelmacher
D. Roijers
A. Nowé
OffRL
127
1
0
18 Jul 2019
Learning Self-Correctable Policies and Value Functions from
  Demonstrations with Negative Sampling
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Yuping Luo
Huazhe Xu
Tengyu Ma
SSL
78
13
0
12 Jul 2019
Deep Active Inference as Variational Policy Gradients
Deep Active Inference as Variational Policy Gradients
Beren Millidge
BDL
89
103
0
08 Jul 2019
On-Policy Robot Imitation Learning from a Converging Supervisor
On-Policy Robot Imitation Learning from a Converging Supervisor
Ashwin Balakrishna
Brijen Thananjeyan
Jonathan Lee
Felix Li
Arsh Zahed
Joseph E. Gonzalez
Ken Goldberg
141
17
0
08 Jul 2019
Benchmarking Model-Based Reinforcement Learning
Benchmarking Model-Based Reinforcement Learning
Tingwu Wang
Xuchan Bao
I. Clavera
Jerrick Hoang
Yeming Wen
Eric D. Langlois
Matthew Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
OffRL
79
365
0
03 Jul 2019
Modified Actor-Critics
Modified Actor-Critics
Erinc Merdivan
S. Hanke
Matthieu Geist
43
2
0
02 Jul 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a
  Latent Variable Model
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRLBDL
93
382
0
01 Jul 2019
FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Longxiang Shi
Shijian Li
LongBing Cao
Long Yang
Gang Zheng
Gang Pan
26
5
0
01 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
152
343
0
30 Jun 2019
Policy Optimization with Stochastic Mirror Descent
Policy Optimization with Stochastic Mirror Descent
Long Yang
Yu Zhang
Gang Zheng
Qian Zheng
Pengfei Li
Jianhang Huang
Jun Wen
Gang Pan
128
34
0
25 Jun 2019
Proximal Distilled Evolutionary Reinforcement Learning
Proximal Distilled Evolutionary Reinforcement Learning
Cristian Bodnar
Ben Day
Pietro Lio
101
76
0
24 Jun 2019
Exploring Model-based Planning with Policy Networks
Exploring Model-based Planning with Policy Networks
Tingwu Wang
Jimmy Ba
114
150
0
20 Jun 2019
Reward Prediction Error as an Exploration Objective in Deep RL
Reward Prediction Error as an Exploration Objective in Deep RL
Riley Simmons-Edler
Ben Eisner
Daniel Yang
Anthony Bisulco
E. Mitchell
Sebastian Seung
Daniel D. Lee
51
5
0
19 Jun 2019
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent
  Coordination
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
Shauharda Khadka
Somdeb Majumdar
Santiago Miret
Stephen McAleer
Kagan Tumer
68
60
0
18 Jun 2019
Is the Policy Gradient a Gradient?
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
94
58
0
17 Jun 2019
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual
  Inputs and Natural Rewards
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Gerrit Schoettler
Ashvin Nair
Jianlan Luo
Shikhar Bahl
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
59
192
0
13 Jun 2019
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Arrasy Rahman
Stefano V. Albrecht
90
191
0
11 Jun 2019
Boosting Soft Actor-Critic: Emphasizing Recent Experience without
  Forgetting the Past
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Che Wang
George Andriopoulos
51
45
0
10 Jun 2019
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran
Joshua Romoff
Nicolas Ballas
Joelle Pineau
Michael G. Rabbat
64
33
0
09 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRLOnRL
150
1,068
0
03 Jun 2019
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep
  Model-Based RL for Sparse Cost Robotic Tasks
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks
Brijen Thananjeyan
Ashwin Balakrishna
Ugo Rosolia
Felix Li
R. McAllister
Joseph E. Gonzalez
Sergey Levine
Francesco Borrelli
Ken Goldberg
OffRL
92
4
0
31 May 2019
Disentangling Dynamics and Returns: Value Function Decomposition with
  Future Prediction
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
33
2
0
27 May 2019
Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep
  Reinforcement Learning
Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning
Yufei Wang
Ziju Shen
Zichao Long
Bin Dong
AI4CEPINN
77
40
0
27 May 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
Composing Task-Agnostic Policies with Deep Reinforcement Learning
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
76
30
0
25 May 2019
Mechatronic Design of a Dribbling System for RoboCup Small Size Robot
Mechatronic Design of a Dribbling System for RoboCup Small Size Robot
Zheyuan Huang
Yunkai Wang
Lingyun Chen
Jiacheng Li
Zexi Chen
R. Xiong
19
1
0
24 May 2019
Distributional Policy Optimization: An Alternative Approach for
  Continuous Control
Distributional Policy Optimization: An Alternative Approach for Continuous Control
Chen Tessler
Guy Tennenholtz
Shie Mannor
OffRL
49
44
0
23 May 2019
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic
  Learning
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning
Brian Yang
Jesse Zhang
Vitchyr H. Pong
Sergey Levine
Dinesh Jayaraman
77
37
0
17 May 2019
Leveraging exploration in off-policy algorithms via normalizing flows
Leveraging exploration in off-policy algorithms via normalizing flows
Bogdan Mazoure
T. Doan
A. Durand
R. Devon Hjelm
Joelle Pineau
OnRL
66
62
0
16 May 2019
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient
  Reinforcement Learning
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning
Seungyul Han
Y. Sung
OffRL
60
20
0
07 May 2019
P3O: Policy-on Policy-off Policy Optimization
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
84
56
0
05 May 2019
Deep Residual Reinforcement Learning
Deep Residual Reinforcement Learning
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
89
32
0
03 May 2019
Collaborative Evolutionary Reinforcement Learning
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
64
100
0
02 May 2019
RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for
  Real-Time Point Cloud Shape Completion
RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion
M. Sarmad
H. J. Lee
Y. Kim
3DPC
90
181
0
28 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
75
267
0
20 Apr 2019
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement
  Learning Algorithms
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
77
64
0
15 Apr 2019
Active Domain Randomization
Active Domain Randomization
Bhairav Mehta
Manfred Diaz
Florian Golemo
C. Pal
Liam Paull
84
265
0
09 Apr 2019
Regularizing Trajectory Optimization with Denoising Autoencoders
Regularizing Trajectory Optimization with Denoising Autoencoders
Rinu Boney
Norman Di Palo
Mathias Berglund
Alexander Ilin
Arno Solin
Antti Rasmus
Harri Valpola
56
10
0
28 Mar 2019
How to pick the domain randomization parameters for sim-to-real transfer
  of reinforcement learning policies?
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?
Q. Vuong
Sharad Vikram
H. Su
Sicun Gao
Henrik I. Christensen
OOD
79
49
0
28 Mar 2019
Generalized Off-Policy Actor-Critic
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRLCML
132
43
0
27 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
97
29
0
25 Mar 2019
Towards Characterizing Divergence in Deep Q-Learning
Towards Characterizing Divergence in Deep Q-Learning
Joshua Achiam
Ethan Knight
Pieter Abbeel
55
98
0
21 Mar 2019
Truly Proximal Policy Optimization
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
78
126
0
19 Mar 2019
Trajectory Optimization for Unknown Constrained Systems using
  Reinforcement Learning
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
Keita Ota
Devesh K. Jha
Tomoaki Oiki
Mamoru Miura
Takashi Nammoto
D. Nikovski
T. Mariyama
60
27
0
13 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
67
17
0
11 Mar 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong
Murtaza Dalal
Steven Lin
Ashvin Nair
Shikhar Bahl
Sergey Levine
OffRLSSL
129
277
0
08 Mar 2019
Learning Hierarchical Teaching Policies for Cooperative Agents
Learning Hierarchical Teaching Policies for Cooperative Agents
Dong-Ki Kim
Miao Liu
Shayegan Omidshafiei
Sebastian Lopez-Cot
Matthew D Riemer
Golnaz Habibi
Gerald Tesauro
Sami Mourad
Murray Campbell
Jonathan P. How
65
7
0
07 Mar 2019
The AI Driving Olympics at NeurIPS 2018
The AI Driving Olympics at NeurIPS 2018
J. Zilly
J. Tani
Breandan Considine
Bhairav Mehta
Andrea F. Daniele
...
R. Hristov
S. Mallya
Emilio Frazzoli
A. Censi
Liam Paull
85
14
0
06 Mar 2019
A Regularized Approach to Sparse Optimal Policy in Reinforcement
  Learning
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Xiang Li
Wenhao Yang
Zhihua Zhang
29
2
0
02 Mar 2019
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Sergey Kolesnikov
Oleksii Hrinchuk
OffRL
42
8
0
28 Feb 2019
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
85
142
0
26 Feb 2019
Previous
123...424344
Next