ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Resilient Load Restoration in Microgrids Considering Mobile Energy
  Storage Fleets: A Deep Reinforcement Learning Approach
Resilient Load Restoration in Microgrids Considering Mobile Energy Storage Fleets: A Deep Reinforcement Learning Approach
Shuhan Yao
Jiuxiang Gu
Peng Wang
Tianyang Zhao
Huajun Zhang
Xiaochuan Liu
26
38
0
06 Nov 2019
Learning to Manipulate Deformable Objects without Demonstrations
Learning to Manipulate Deformable Objects without Demonstrations
Yilin Wu
Wilson Yan
Thanard Kurutach
Lerrel Pinto
Pieter Abbeel
OffRL
74
202
0
29 Oct 2019
Neural Architecture Evolution in Deep Reinforcement Learning for
  Continuous Control
Neural Architecture Evolution in Deep Reinforcement Learning for Continuous Control
Jörg Franke
Gregor Koehler
Noor H. Awad
Frank Hutter
139
7
0
28 Oct 2019
Better Exploration with Optimistic Actor-Critic
Better Exploration with Optimistic Actor-Critic
K. Ciosek
Q. Vuong
R. Loftin
Katja Hofmann
77
156
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement
  Learning
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
107
124
0
27 Oct 2019
Case Study: Verifying the Safety of an Autonomous Racing Car with a
  Neural Network Controller
Case Study: Verifying the Safety of an Autonomous Racing Car with a Neural Network Controller
Radoslav Ivanov
Taylor J. Carpenter
James Weimer
Rajeev Alur
George J. Pappas
Insup Lee
78
82
0
24 Oct 2019
Collision Avoidance in Pedestrian-Rich Environments with Deep
  Reinforcement Learning
Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning
Michael Everett
Yu Fan Chen
Jonathan P. How
OffRL
131
176
0
24 Oct 2019
Contextual Imagined Goals for Self-Supervised Robotic Learning
Contextual Imagined Goals for Self-Supervised Robotic Learning
Ashvin Nair
Shikhar Bahl
Alexander Khazatsky
Vitchyr H. Pong
Glen Berseth
Sergey Levine
SSL
81
68
0
23 Oct 2019
Teach Biped Robots to Walk via Gait Principles and Reinforcement
  Learning with Adversarial Critics
Teach Biped Robots to Walk via Gait Principles and Reinforcement Learning with Adversarial Critics
Kuangen Zhang
Zhimin Hou
Clarence W. de Silva
Haoyong Yu
Chenglong Fu
31
8
0
22 Oct 2019
Regularization Matters in Policy Optimization
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
72
33
0
21 Oct 2019
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
166
304
0
16 Oct 2019
Regularizing Model-Based Planning with Energy-Based Models
Regularizing Model-Based Planning with Energy-Based Models
Rinu Boney
Arno Solin
Alexander Ilin
76
18
0
12 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT
  Applications: A Taxonomy and Survey
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
56
4
0
11 Oct 2019
Improving Generalization in Meta Reinforcement Learning using Learned
  Objectives
Improving Generalization in Meta Reinforcement Learning using Learned Objectives
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
OffRL
95
119
0
09 Oct 2019
Policy Optimization Through Approximate Importance Sampling
Policy Optimization Through Approximate Importance Sampling
Marcin Tomczak
Dongho Kim
Peter Vrancx
Kyungmin Kim
25
17
0
09 Oct 2019
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Vibhavari Dasagi
Jake Bruce
T. Peynot
Jurgen Leitner
53
10
0
09 Oct 2019
Striving for Simplicity and Performance in Off-Policy DRL: Output
  Normalization and Non-Uniform Sampling
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang
Yanqiu Wu
Q. Vuong
George Andriopoulos
36
6
0
05 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
74
185
0
03 Oct 2019
Reducing Overestimation Bias in Multi-Agent Domains Using Double
  Centralized Critics
Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics
J. Ackermann
Pau Cebrian
Antonio Espinosa
Masashi Sugiyama
OffRL
64
122
0
03 Oct 2019
Never Worse, Mostly Better: Stable Policy Improvement in Deep
  Reinforcement Learning
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning
P. Khanna
Guy Tennenholtz
Nadav Merlis
Shie Mannor
Chen Tessler
OffRL
26
1
0
02 Oct 2019
Improving Sample Efficiency in Model-Free Reinforcement Learning from
  Images
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats
Amy Zhang
Ilya Kostrikov
Brandon Amos
Joelle Pineau
Rob Fergus
DRL
120
449
0
02 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
165
570
0
01 Oct 2019
Meta-Q-Learning
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
95
148
0
30 Sep 2019
Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning
  in Autonomous Driving
Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving
M. Huegle
Gabriel Kalweit
M. Werling
Joschka Boedecker
3DPC
55
37
0
30 Sep 2019
Composite Q-learning: Multi-scale Q-function Decomposition and Separable
  Optimization
Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization
Gabriel Kalweit
M. Huegle
Joschka Boedecker
OffRL
36
5
0
30 Sep 2019
CAQL: Continuous Action Q-Learning
CAQL: Continuous Action Q-Learning
Moonkyung Ryu
Yinlam Chow
Ross Anderson
Christian Tjandraatmadja
Craig Boutilier
280
43
0
26 Sep 2019
RLBench: The Robot Learning Benchmark & Learning Environment
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSLVLMOffRL
120
563
0
26 Sep 2019
Model Imitation for Model-Based Reinforcement Learning
Model Imitation for Model-Based Reinforcement Learning
Yueh-hua Wu
Ting-Han Fan
Peter J. Ramadge
H. Su
OffRL
46
16
0
25 Sep 2019
Multi-task Batch Reinforcement Learning with Metric Learning
Multi-task Batch Reinforcement Learning with Metric Learning
Jiachen Li
Q. Vuong
Shuang Liu
Minghua Liu
K. Ciosek
George Andriopoulos
Henrik I. Christensen
H. Su
OffRL
65
2
0
25 Sep 2019
Residual Reactive Navigation: Combining Classical and Learned Navigation
  Strategies For Deployment in Unknown Environments
Residual Reactive Navigation: Combining Classical and Learned Navigation Strategies For Deployment in Unknown Environments
Krishan Rana
Ben Talbot
Vibhavari Dasagi
Michael Milford
Niko Sünderhauf
90
22
0
24 Sep 2019
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Ofir Nachum
Haoran Tang
Xingyu Lu
S. Gu
Honglak Lee
Sergey Levine
71
102
0
23 Sep 2019
MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning
MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning
Raghunandan Rajan
Jessica Lizeth Borja Diaz
Suresh Guttikonda
Fabio Ferreira
André Biedenkapp
Jan Ole von Hartz
Frank Hutter
141
4
0
17 Sep 2019
Biased Estimates of Advantages over Path Ensembles
Biased Estimates of Advantages over Path Ensembles
Lanxin Lei
Zhizhong Li
Dahua Lin
OffRL
21
0
0
15 Sep 2019
Deep Learned Path Planning via Randomized Reward-Linked-Goals and
  Potential Space Applications
Deep Learned Path Planning via Randomized Reward-Linked-Goals and Potential Space Applications
Tamir Blum
William Jones
Kazuya Yoshida
34
8
0
13 Sep 2019
Mutual-Information Regularization in Markov Decision Processes and
  Actor-Critic Learning
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Felix Leibfried
Jordi Grau-Moya
74
22
0
11 Sep 2019
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an
  Ensemble of Suboptimal Teachers
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
Andrey Kurenkov
Ajay Mandlekar
R. M. Martin
Silvio Savarese
Animesh Garg
61
48
0
09 Sep 2019
Bi-level Actor-Critic for Multi-agent Coordination
Bi-level Actor-Critic for Multi-agent Coordination
Haifeng Zhang
Weizhe Chen
Zeren Huang
Minne Li
Yaodong Yang
Weinan Zhang
Jun Wang
185
93
0
08 Sep 2019
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement
  Learning
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning
Wenjie Shi
Shiji Song
Hui Wu
Yachu Hsu
Cheng Wu
Gao Huang
31
26
0
07 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
77
98
0
03 Sep 2019
Approximating two value functions instead of one: towards characterizing
  a new family of Deep Reinforcement Learning algorithms
Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms
M. Sabatelli
Gilles Louppe
Pierre Geurts
M. Wiering
OffRL
8
0
0
01 Sep 2019
Reinforcement learning with world model
Reinforcement learning with world model
Jingbin Liu
Xinyang Gu
Shuai Liu
31
0
0
30 Aug 2019
Dynamics-aware Embeddings
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
78
53
0
25 Aug 2019
A Comparison of Action Spaces for Learning Manipulation Tasks
A Comparison of Action Spaces for Learning Manipulation Tasks
Patrick Varin
Lev Grossman
S. Kuindersma
62
34
0
23 Aug 2019
From Crystallized Adaptivity to Fluid Adaptivity in Deep Reinforcement
  Learning -- Insights from Biological Systems on Adaptive Flexibility
From Crystallized Adaptivity to Fluid Adaptivity in Deep Reinforcement Learning -- Insights from Biological Systems on Adaptive Flexibility
M. Schilling
Helge J. Ritter
F. Ohl
AI4CE
41
4
0
13 Aug 2019
A review on Deep Reinforcement Learning for Fluid Mechanics
A review on Deep Reinforcement Learning for Fluid Mechanics
Paul Garnier
J. Viquerat
Jean Rabault
A. Larcher
A. Kuhnle
E. Hachem
AI4CE
73
264
0
12 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
120
436
0
11 Aug 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and
  Empowerment
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
110
29
0
26 Jul 2019
Dynamic Input for Deep Reinforcement Learning in Autonomous Driving
Dynamic Input for Deep Reinforcement Learning in Autonomous Driving
Maria Hügle
Gabriel Kalweit
Branka Mirchevska
M. Werling
Joschka Boedecker
58
60
0
25 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model,
  Applications and Challenges
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
81
204
0
22 Jul 2019
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill
  Discovery
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
99
82
0
18 Jul 2019
Previous
123...41424344
Next