ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXivPDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 851 papers shown
Title
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
18
127
0
05 Feb 2022
A Temporal-Difference Approach to Policy Gradient Estimation
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
27
2
0
04 Feb 2022
Learning Interpretable, High-Performing Policies for Autonomous Driving
Learning Interpretable, High-Performing Policies for Autonomous Driving
Rohan R. Paleja
Yaru Niu
Andrew Silva
Chace Ritchie
Sugju Choi
Matthew C. Gombolay
32
16
0
04 Feb 2022
Federated Reinforcement Learning for Collective Navigation of Robotic
  Swarms
Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
Seongin Na
Tomáš Rouček
Jiří Ulrich
Jan Pikman
T. Krajník
Barry Lennox
F. Arvin
34
34
0
02 Feb 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
30
17
0
29 Jan 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
38
10
0
24 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for
  Complex Diseases: A Systems Pharmacology Perspective
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement
  Learning
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning
Yuan Tian
Minghao Han
Chetan S. Kulkarni
Olga Fink
25
13
0
20 Jan 2022
Reinforcement Learning based Air Combat Maneuver Generation
Reinforcement Learning based Air Combat Maneuver Generation
Muhammed Murat Özbek
E. Koyuncu
11
4
0
14 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
32
4
0
07 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement
  Learning
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
29
15
0
01 Jan 2022
Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of
  Unmanned Aerial Vehicles
Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
P. Drews
33
34
0
27 Dec 2021
Newsvendor Model with Deep Reinforcement Learning
Newsvendor Model with Deep Reinforcement Learning
Dylan K. Goetting
14
0
0
22 Dec 2021
Value Activation for Bias Alleviation: Generalized-activated Deep Double
  Deterministic Policy Gradients
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRL
AI4CE
39
5
0
21 Dec 2021
Variational Quantum Soft Actor-Critic
Variational Quantum Soft Actor-Critic
Qingfeng Lan
27
20
0
20 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
27
30
0
16 Dec 2021
Learning from Guided Play: A Scheduled Hierarchical Approach for
  Improving Exploration in Adversarial Imitation Learning
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning
Trevor Ablett
Bryan Chan
Jonathan Kelly
37
4
0
16 Dec 2021
Flexible Option Learning
Flexible Option Learning
Martin Klissarov
Doina Precup
OffRL
41
26
0
06 Dec 2021
Coupling Vision and Proprioception for Navigation of Legged Robots
Coupling Vision and Proprioception for Navigation of Legged Robots
Zipeng Fu
Ashish Kumar
Ananye Agarwal
Haozhi Qi
Jitendra Malik
Deepak Pathak
21
73
0
03 Dec 2021
Learning a Robust Multiagent Driving Policy for Traffic Congestion
  Reduction
Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction
Yulin Zhang
William Macke
Jiaxun Cui
Daniel Urieli
Peter Stone
31
8
0
03 Dec 2021
SAGCI-System: Towards Sample-Efficient, Generalizable, Compositional,
  and Incremental Robot Learning
SAGCI-System: Towards Sample-Efficient, Generalizable, Compositional, and Incremental Robot Learning
Jun Lv
Qiaojun Yu
Lin Shao
Wenhai Liu
Wenqiang Xu
Cewu Lu
33
24
0
29 Nov 2021
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
Iman Nematollahi
Erick Rosete-Beas
Adrian Rofer
Tim Welschehold
Abhinav Valada
Wolfram Burgard
19
15
0
25 Nov 2021
Renewable energy integration and microgrid energy trading using
  multi-agent deep reinforcement learning
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning
Daniel J. B. Harrold
Jun Cao
Zhongbo Fan
39
61
0
21 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
CleanRL: High-quality Single-file Implementations of Deep Reinforcement
  Learning Algorithms
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
16
0
0
16 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based
  Autonomous Driving
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
39
60
0
16 Nov 2021
Deep Reinforcement Learning with Shallow Controllers: An Experimental
  Application to PID Tuning
Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning
Nathan P. Lawrence
M. Forbes
Philip D. Loewen
Daniel G. McClement
Johan U. Backstrom
R. Bhushan Gopaluni
OffRL
30
71
0
13 Nov 2021
Cooperative multi-agent reinforcement learning for high-dimensional
  nonequilibrium control
Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control
Shriram Chennakesavalu
Grant M. Rotskoff
16
1
0
12 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
21
21
0
09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
38
24
0
04 Nov 2021
Confidence Composition for Monitors of Verification Assumptions
Confidence Composition for Monitors of Verification Assumptions
I. Ruchkin
Matthew Cleaveland
Radoslav Ivanov
Pengyuan Lu
Taylor J. Carpenter
O. Sokolsky
Insup Lee
49
13
0
03 Nov 2021
Balanced Q-learning: Combining the Influence of Optimistic and
  Pessimistic Targets
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
25
5
0
03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
33
11
0
02 Nov 2021
Generalized Proximal Policy Optimization with Sample Reuse
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
42
47
0
29 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
23
8
0
28 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
45
18
0
27 Oct 2021
A Subgame Perfect Equilibrium Reinforcement Learning Approach to
  Time-inconsistent Problems
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems
Nixie S. Lesmana
Chi Seng Pun
OffRL
29
4
0
27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Beining Han
Chongyi Zheng
Harris Chan
Keiran Paster
Michael Ruogu Zhang
Jimmy Ba
OOD
AI4CE
20
13
0
27 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
22
58
0
26 Oct 2021
Automating Control of Overestimation Bias for Reinforcement Learning
Automating Control of Overestimation Bias for Reinforcement Learning
Arsenii Kuznetsov
Alexander Grishin
Artem Tsypin
Arsenii Ashukha
Artur Kadurin
Dmitry Vetrov
OffRL
8
2
0
26 Oct 2021
Learning Insertion Primitives with Discrete-Continuous Hybrid Action
  Space for Robotic Assembly Tasks
Learning Insertion Primitives with Discrete-Continuous Hybrid Action Space for Robotic Assembly Tasks
Yongyu Wang
Shiyu Jin
Changhao Wang
Xinghao Zhu
Masayoshi Tomizuka
34
42
0
25 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
39
9
0
24 Oct 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement
  Learning and Goal-Aware State Information
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information
Jin Li
Xianyuan Zhan
Zixu Xiao
Guyue Zhou
OffRL
OnRL
29
2
0
21 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
31
31
0
14 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
852
0
12 Oct 2021
Previous
123...101112...161718
Next