ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Feature-Based Interpretable Reinforcement Learning based on
  State-Transition Models
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models
Omid Davoodi
Majid Komeili
FAttOffRL
61
6
0
14 May 2021
Principled Exploration via Optimistic Bootstrapping and Backward
  Induction
Principled Exploration via Optimistic Bootstrapping and Backward Induction
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
65
39
0
13 May 2021
Composable Energy Policies for Reactive Motion Generation and
  Reinforcement Learning
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Julen Urain
Anqi Li
Puze Liu
Carlo DÉramo
Jan Peters
91
26
0
11 May 2021
Efficient Self-Supervised Data Collection for Offline Robot Learning
Efficient Self-Supervised Data Collection for Offline Robot Learning
Shadi Endrawis
Gal Leibovich
Guy Jacob
Gal Novik
Aviv Tamar
SSLOffRL
73
9
0
10 May 2021
AoI-Aware Resource Allocation for Platoon-Based C-V2X Networks via
  Multi-Agent Multi-Task Reinforcement Learning
AoI-Aware Resource Allocation for Platoon-Based C-V2X Networks via Multi-Agent Multi-Task Reinforcement Learning
Mohammad Parvini
M. Javan
Nader Mokari
B. Abbasi
Eduard Axel Jorswieck
OffRL
75
56
0
10 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
68
2
0
08 May 2021
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods
  Inspired by Stigmergy and Ant Colonies
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies
Austin Nguyen
35
1
0
08 May 2021
Benchmarking Structured Policies and Policy Optimization for Real-World
  Dexterous Object Manipulation
Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Niklas Funk
Charles B. Schaff
Rishabh Madan
Takuma Yoneda
Julen Urain De Jesus
...
Stefan Bauer
S. Srinivasa
Tapomayukh Bhattacharjee
Matthew R. Walter
Jan Peters
103
35
0
05 May 2021
Robotic Surgery With Lean Reinforcement Learning
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
Wenjie Wang
Gregory D. Hager
OffRL
74
21
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
48
8
0
03 May 2021
Action Candidate Based Clipped Double Q-learning for Discrete and
  Continuous Action Tasks
Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks
Qianliang Wu
Jin Xie
Jian Yang
OffRL
13
15
0
03 May 2021
Controlling earthquake-like instabilities using artificial intelligence
Controlling earthquake-like instabilities using artificial intelligence
E. Papachristos
I. Stefanou
AI4CE
25
6
0
27 Apr 2021
Computational Performance of Deep Reinforcement Learning to find Nash
  Equilibria
Computational Performance of Deep Reinforcement Learning to find Nash Equilibria
C. Graf
Viktor Zobernig
Johannes Schmidt
Claude Klöckl
23
5
0
26 Apr 2021
Efficient Hyperparameter Optimization for Physics-based Character
  Animation
Efficient Hyperparameter Optimization for Physics-based Character Animation
Zeshi Yang
Zhiqi Yin
AI4CE
89
9
0
26 Apr 2021
MetricOpt: Learning to Optimize Black-Box Evaluation Metrics
MetricOpt: Learning to Optimize Black-Box Evaluation Metrics
Chen Huang
Shuangfei Zhai
Pengsheng Guo
J. Susskind
97
12
0
21 Apr 2021
Outcome-Driven Reinforcement Learning via Variational Inference
Outcome-Driven Reinforcement Learning via Variational Inference
Tim G. J. Rudner
Vitchyr H. Pong
R. McAllister
Y. Gal
Sergey Levine
93
21
0
20 Apr 2021
Model-predictive control and reinforcement learning in multi-energy
  system case studies
Model-predictive control and reinforcement learning in multi-energy system case studies
Glenn Ceusters
Román Cantú Rodríguez
A. García
R. Franke
Geert Deconinck
L. Helsen
Ann Nowé
M. Messagie
L. R. Camargo
55
90
0
20 Apr 2021
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural
  Networks
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng
Ruitong Zhang
Yingtong Dou
Renyu Yang
Jingyi Zhang
Philip S. Yu
140
118
0
16 Apr 2021
Human-in-the-Loop Deep Reinforcement Learning with Application to
  Autonomous Driving
Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chao Huang
Zhongxu Hu
Peng Hang
Yang Xing
Chen Lv
92
42
0
15 Apr 2021
Safe Continuous Control with Constrained Model-Based Policy Optimization
Safe Continuous Control with Constrained Model-Based Policy Optimization
Moritz A. Zanger
Karam Daaboul
J. Marius Zöllner
80
19
0
14 Apr 2021
Reward function shape exploration in adversarial imitation learning: an
  empirical study
Reward function shape exploration in adversarial imitation learning: an empirical study
Yawei Wang
Xiu Li
31
4
0
14 Apr 2021
Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent
  Reinforcement Learning
Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning
Yuan Pu
Shaochen Wang
Rui Yang
Xin Yao
Bin Li
80
19
0
14 Apr 2021
GAN-Based Interactive Reinforcement Learning from Demonstration and
  Human Evaluative Feedback
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
Jie Huang
Rongshun Juan
R. Gomez
Keisuke Nakamura
Q. Sha
Bo He
Guangliang Li
73
10
0
14 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
TAAC: Temporally Abstract Actor-Critic for Continuous Control
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
56
21
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
105
67
0
13 Apr 2021
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy
  Behavior Representation for Deep Reinforcement Learning
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning
Ammar Fayad
M. Ibrahim
BDL
47
3
0
09 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement
  Learning
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
48
3
0
09 Apr 2021
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable
  Physics
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics
Zhiao Huang
Yuanming Hu
Tao Du
Siyuan Zhou
Hao Su
J. Tenenbaum
Chuang Gan
AI4CE
85
133
0
07 Apr 2021
Tactile-RL for Insertion: Generalization to Objects of Unknown Geometry
Tactile-RL for Insertion: Generalization to Objects of Unknown Geometry
Siyuan Dong
Devesh K. Jha
Diego Romeres
Sangwoon Kim
D. Nikovski
Alberto Rodriguez
82
124
0
02 Apr 2021
SkiffOS: Minimal Cross-compiled Linux for Embedded Containers
SkiffOS: Minimal Cross-compiled Linux for Embedded Containers
Christian Stewart
198
54
0
31 Mar 2021
Co-Adaptation of Algorithmic and Implementational Innovations in
  Inference-based Deep Reinforcement Learning
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning
Hiroki Furuta
Tadashi Kozuno
T. Matsushima
Y. Matsuo
S. Gu
120
14
0
31 Mar 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELMOffRL
90
104
0
30 Mar 2021
Autonomous Overtaking in Gran Turismo Sport Using Curriculum
  Reinforcement Learning
Autonomous Overtaking in Gran Turismo Sport Using Curriculum Reinforcement Learning
Yunlong Song
HaoChih Lin
Elia Kaufmann
P. Duerr
Davide Scaramuzza
62
68
0
26 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
55
2
0
26 Mar 2021
A Meta-Reinforcement Learning Approach to Process Control
A Meta-Reinforcement Learning Approach to Process Control
Daniel G. McClement
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
Johan U. Backstrom
R. Bhushan Gopaluni
40
8
0
25 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with
  Deep Reinforcement Learning
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
96
44
0
25 Mar 2021
Deep Reinforcement Learning for Mapless Navigation of a Hybrid Aerial
  Underwater Vehicle with Medium Transition
Deep Reinforcement Learning for Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Medium Transition
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
Nicolas P. Bortoluzzi
P. Pinheiro
A. A. Neto
Paulo L. J. Drews-Jr
69
39
0
23 Mar 2021
Policy Information Capacity: Information-Theoretic Measure for Task
  Complexity in Deep Reinforcement Learning
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
55
14
0
23 Mar 2021
Replacing Rewards with Examples: Example-Based Policy Search via
  Recursive Classification
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
133
50
0
23 Mar 2021
Regularized Softmax Deep Multi-Agent $Q$-Learning
Regularized Softmax Deep Multi-Agent QQQ-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
111
36
0
22 Mar 2021
A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning
  for Mobile Robots
A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots
Xinyi Yu
Yu Fan
Siyu Xu
L. Ou
55
32
0
19 Mar 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
31
5
0
18 Mar 2021
Reward Signal Design for Autonomous Racing
Reward Signal Design for Autonomous Racing
Benjamin Evans
H. Engelbrecht
H. W. Jordaan
33
6
0
18 Mar 2021
Regularized Behavior Value Estimation
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
75
38
0
17 Mar 2021
Inclined Quadrotor Landing using Deep Reinforcement Learning
Inclined Quadrotor Landing using Deep Reinforcement Learning
Jacob E. Kooi
Robert Babuška
54
30
0
16 Mar 2021
Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight
  Campaign Design
Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight Campaign Design
Yuji Takubo
Hao Chen
K. Ho
27
13
0
16 Mar 2021
Goal-Driven Autonomous Exploration Through Deep Reinforcement Learning
Goal-Driven Autonomous Exploration Through Deep Reinforcement Learning
Reinis Cimurs
I. Suh
Jin Han Lee
94
88
0
12 Mar 2021
Discovering Diverse Solutions in Deep Reinforcement Learning by
  Maximizing State-Action-Based Mutual Information
Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
73
33
0
12 Mar 2021
A Quadratic Actor Network for Model-Free Reinforcement Learning
A Quadratic Actor Network for Model-Free Reinforcement Learning
Matthias Weissenbacher
Yoshinobu Kawahara
25
0
0
11 Mar 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement
  Learning
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning
Guillaume Bellegarda
Yiyu Chen
Zhuochen Liu
Quan Nguyen
83
47
0
11 Mar 2021
Previous
123...333435...424344
Next