ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
73
180
0
16 Feb 2020
Universal Value Density Estimation for Imitation Learning and
  Goal-Conditioned Reinforcement Learning
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
88
13
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin
  Dynamics
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
Volkan Cevher
103
61
0
14 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Reward Tweaking: Maximizing the Total Reward While Planning for Short
  Horizons
Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons
Chen Tessler
Shie Mannor
59
2
0
09 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
  with Advantage Weighted Mixture Policy(SAC-AWMP)
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
103
15
0
07 Feb 2020
Deep Radial-Basis Value Functions for Continuous Control
Deep Radial-Basis Value Functions for Continuous Control
Kavosh Asadi
Neev Parikh
Ronald E. Parr
George Konidaris
Michael L. Littman
37
4
0
05 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
128
165
0
03 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV
  based Random Access IoT Networks with NOMA
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
72
0
31 Jan 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary
  Values
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
104
103
0
29 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep
  Reinforcement Learning
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
139
243
0
23 Jan 2020
Augmenting GAIL with BC for sample efficient imitation learning
Augmenting GAIL with BC for sample efficient imitation learning
Rohit Jena
Changliu Liu
Katia Sycara
76
5
0
21 Jan 2020
Discriminator Soft Actor Critic without Extrinsic Rewards
Discriminator Soft Actor Critic without Extrinsic Rewards
Daichi Nishio
Daiki Kuyoshi
Toi Tsuneda
S. Yamane
OffRL
27
6
0
19 Jan 2020
Continuous-action Reinforcement Learning for Playing Racing Games:
  Comparing SPG to PPO
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO
Mario S. Holubar
M. Wiering
50
10
0
15 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
67
38
0
09 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
82
183
0
09 Jan 2020
Self-guided Approximate Linear Programs
Self-guided Approximate Linear Programs
Parshan Pakiman
Selvaprabu Nadarajah
Negar Soheili
Qihang Lin
16
3
0
09 Jan 2020
Reinforcement Learning with Goal-Distance Gradient
Reinforcement Learning with Goal-Distance Gradient
Kai Jiang
X. Qin
21
0
0
01 Jan 2020
Reward-Conditioned Policies
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
71
96
0
31 Dec 2019
Learning to Combat Compounding-Error in Model-Based Reinforcement
  Learning
Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Chenjun Xiao
Yifan Wu
Chen Ma
Dale Schuurmans
Martin Müller
OffRL
78
44
0
24 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
58
34
0
23 Dec 2019
Soft Q Network
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
51
2
0
20 Dec 2019
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep
  Reinforcement Learning
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning
A. Celli
Marco Ciccone
Raffaele Bongo
N. Gatti
61
12
0
16 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy
  Optimization
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
51
17
0
11 Dec 2019
Entropy Regularization with Discounted Future State Distribution in
  Policy Gradient Methods
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
50
8
0
11 Dec 2019
Measuring the Reliability of Reinforcement Learning Algorithms
Measuring the Reliability of Reinforcement Learning Algorithms
Stephanie C. Y. Chan
Sam Fishman
John F. Canny
Anoop Korattikara Balan
S. Guadarrama
74
84
0
10 Dec 2019
Efficient and Robust Reinforcement Learning with Uncertainty-based Value
  Expansion
Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion
Bo Zhou
Hongsheng Zeng
Fan Wang
Yunxiang Li
Hao Tian
59
18
0
10 Dec 2019
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Abdul Rahman Kreidieh
Yiling You
Nathan Lichtlé
Samyak Parajuli
Rayyan Nasr
Alexandre M. Bayen
113
14
0
05 Dec 2019
AlgaeDICE: Policy Gradient from Arbitrary Experience
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
166
245
0
04 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRLOnRLCLL
73
15
0
03 Dec 2019
Optimization for Reinforcement Learning: From Single Agent to
  Cooperative Agents
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
55
89
0
01 Dec 2019
Distributed Soft Actor-Critic with Multivariate Reward Representation
  and Knowledge Distillation
Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation
Dmitry Akimov
29
10
0
29 Nov 2019
A selected review on reinforcement learning based control for autonomous
  underwater vehicles
A selected review on reinforcement learning based control for autonomous underwater vehicles
Yachu Hsu
Hui Wu
Keyou You
Shiji Song
25
3
0
27 Nov 2019
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous
  Multi-Lane Driving
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving
Rupert Mitchell
Jenny Fletcher
Jacopo Panerati
Amanda Prorok
89
17
0
26 Nov 2019
The problem with DDPG: understanding failures in deterministic
  environments with sparse rewards
The problem with DDPG: understanding failures in deterministic environments with sparse rewards
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
52
67
0
26 Nov 2019
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
120
691
0
26 Nov 2019
A Deep Reinforcement Learning Architecture for Multi-stage Optimal
  Control
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control
Yuguang Yang
21
1
0
25 Nov 2019
Merging Deterministic Policy Gradient Estimations with Varied
  Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Merging Deterministic Policy Gradient Estimations with Varied Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Gang Chen
58
4
0
24 Nov 2019
Which Channel to Ask My Question? Personalized Customer Service Request
  Stream Routing using Deep Reinforcement Learning
Which Channel to Ask My Question? Personalized Customer Service Request Stream Routing using Deep Reinforcement Learning
Zining Liu
Chong Long
Xiaolu Lu
Zehong Hu
Jie Zhang
Yafang Wang
30
9
0
24 Nov 2019
Accelerating Reinforcement Learning with Suboptimal Guidance
Accelerating Reinforcement Learning with Suboptimal Guidance
Eivind Bøhn
Signe Moe
T. Johansen
OnRL
36
0
0
21 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of
  arbitrary future tasks
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
Planning with Goal-Conditioned Policies
Planning with Goal-Conditioned Policies
Soroush Nasiriany
Vitchyr H. Pong
Steven Lin
Sergey Levine
OffRL
152
219
0
19 Nov 2019
Off-Policy Policy Gradient Algorithms by Constraining the State
  Distribution Shift
Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Riashat Islam
Komal K. Teru
Deepak Sharma
Joelle Pineau
OffRL
80
8
0
16 Nov 2019
Adaptive Leader-Follower Formation Control and Obstacle Avoidance via
  Deep Reinforcement Learning
Adaptive Leader-Follower Formation Control and Obstacle Avoidance via Deep Reinforcement Learning
Yanlin Zhou
F. Lu
George Pu
Xiyao Ma
Runhan Sun
Hsi-Yuan Chen
Xiaolin Li
D. Wu
102
19
0
15 Nov 2019
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
76
10
0
12 Nov 2019
Real-Time Reinforcement Learning
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
96
63
0
11 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
58
2
0
11 Nov 2019
Context-aware Active Multi-Step Reinforcement Learning
Context-aware Active Multi-Step Reinforcement Learning
Gang Chen
Dingcheng Li
Ran Xu
24
0
0
11 Nov 2019
MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for
  Recommender Systems
MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems
Fan Wang
Xiaomin Fang
Lihang Liu
Hao Tian
Zhiming Peng
OffRL
35
0
0
06 Nov 2019
Previous
123...4041424344
Next