ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial
  Imitation Learning
Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning
Lionel Blondé
Pablo Strasser
Alexandros Kalousis
90
22
0
28 Jun 2020
DDPG++: Striving for Simplicity in Continuous-control Off-Policy
  Reinforcement Learning
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
39
4
0
26 Jun 2020
SOAC: The Soft Option Actor-Critic Architecture
SOAC: The Soft Option Actor-Critic Architecture
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
L. Xia
Qianchuan Zhao
43
6
0
25 Jun 2020
Some approaches used to overcome overestimation in Deep Reinforcement
  Learning algorithms
Some approaches used to overcome overestimation in Deep Reinforcement Learning algorithms
Rafael Stekolshchik
OffRL
8
2
0
25 Jun 2020
Preventing Value Function Collapse in Ensemble {Q}-Learning by
  Maximizing Representation Diversity
Preventing Value Function Collapse in Ensemble {Q}-Learning by Maximizing Representation Diversity
Hassam Sheikh
Ladislau Bölöni
13
0
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
102
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Towards Tractable Optimism in Model-Based Reinforcement Learning
Towards Tractable Optimism in Model-Based Reinforcement Learning
Aldo Pacchiano
Philip J. Ball
Jack Parker-Holder
K. Choromanski
Stephen J. Roberts
OffRL
55
12
0
21 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement
  Learning
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
103
87
0
20 Jun 2020
Band-limited Soft Actor Critic Model
Band-limited Soft Actor Critic Model
Miguel Campo
Zhengxing Chen
Luke Kung
Kittipat Virochsiri
Jianyu Wang
32
6
0
19 Jun 2020
A Reinforcement Learning Approach for Transient Control of Liquid Rocket
  Engines
A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines
Günther Waxenegger-Wilfing
Kai Dresia
J. Deeken
M. Oschwald
45
17
0
19 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
72
29
0
18 Jun 2020
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Thomas Asikis
Lucas Böttcher
Nino Antulov-Fantulin
76
44
0
17 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRLOnRL
146
619
0
16 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
Parameter-Based Value Functions
Parameter-Based Value Functions
Francesco Faccio
Louis Kirsch
Jürgen Schmidhuber
OffRL
93
26
0
16 Jun 2020
RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real
RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real
Kanishka Rao
Chris Harris
A. Irpan
Sergey Levine
Julian Ibarz
Mohi Khansari
130
191
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
130
85
0
15 Jun 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity
  Optimization
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
110
62
0
15 Jun 2020
Optimistic Distributionally Robust Policy Optimization
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
44
12
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
75
19
0
14 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary
  Strategies
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
38
14
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
108
24
0
12 Jun 2020
Zeroth-order Deterministic Policy Gradient
Zeroth-order Deterministic Policy Gradient
Harshat Kumar
Dionysios S. Kalogerias
George J. Pappas
Alejandro Ribeiro
OffRL
33
14
0
12 Jun 2020
Decorrelated Double Q-learning
Decorrelated Double Q-learning
Gang Chen
33
2
0
12 Jun 2020
From proprioception to long-horizon planning in novel environments: A
  hierarchical RL model
From proprioception to long-horizon planning in novel environments: A hierarchical RL model
Nishad Gothoskar
Miguel Lázaro-Gredilla
Dileep George
33
0
0
11 Jun 2020
Zeroth-Order Supervised Policy Improvement
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
59
9
0
11 Jun 2020
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
Jae Hyun Lim
Aaron Courville
C. Pal
Chin-Wei Huang
DRL
71
23
0
09 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRLOnRL
150
1,838
0
08 Jun 2020
Primal Wasserstein Imitation Learning
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
102
129
0
08 Jun 2020
Refined Continuous Control of DDPG Actors via Parametrised Activation
Refined Continuous Control of DDPG Actors via Parametrised Activation
M. Hossny
Julie Iskander
Mohammed Attia
Khaled Saleh
39
7
0
04 Jun 2020
A Maximum Mutual Information Framework for Multi-Agent Reinforcement
  Learning
A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Y. Sung
50
13
0
04 Jun 2020
Diversity Actor-Critic: Sample-Aware Entropy Regularization for
  Sample-Efficient Exploration
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
Seungyul Han
Y. Sung
45
26
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Distributed Voltage Regulation of Active Distribution System Based on
  Enhanced Multi-agent Deep Reinforcement Learning
Distributed Voltage Regulation of Active Distribution System Based on Enhanced Multi-agent Deep Reinforcement Learning
Di Cao
Junbo Zhao
Weihao Hu
F. Ding
Qi Huang
Zhe Chen
38
11
0
31 May 2020
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement
  Learning
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning
Parvin Malekzadeh
Mohammad Salimibeni
Arash Mohammadi
A. Assa
Konstantinos N. Plataniotis
OffRL
37
12
0
30 May 2020
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Damian Bogunowicz
A. Rybnikov
Komal Vendidandi
Fedor Chervinskii
133
7
0
29 May 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
98
775
0
27 May 2020
A reinforcement learning approach to rare trajectory sampling
A reinforcement learning approach to rare trajectory sampling
Dominic C. Rose
Jamie F. Mair
J. P. Garrahan
78
52
0
26 May 2020
Gradient Monitored Reinforcement Learning
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
90
11
0
25 May 2020
LEAF: Latent Exploration Along the Frontier
LEAF: Latent Exploration Along the Frontier
Homanga Bharadhwaj
Animesh Garg
Florian Shkurti
55
1
0
21 May 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
134
13
0
21 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
136
87
0
20 May 2020
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a
  Continuous Path from Policy Gradient to Q-Learning
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning
Donghoon Lee
9
0
0
18 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
91
88
0
16 May 2020
MOReL : Model-Based Offline Reinforcement Learning
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
117
678
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
81
58
0
12 May 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
246
197
0
08 May 2020
Off-Policy Adversarial Inverse Reinforcement Learning
Off-Policy Adversarial Inverse Reinforcement Learning
Samin Yeasar Arnob
31
12
0
03 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
107
468
0
02 May 2020
Previous
123...383940...424344
Next