Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
73
180
0
16 Feb 2020
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
88
13
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
Volkan Cevher
103
61
0
14 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons
Chen Tessler
Shie Mannor
59
2
0
09 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
103
15
0
07 Feb 2020
Deep Radial-Basis Value Functions for Continuous Control
Kavosh Asadi
Neev Parikh
Ronald E. Parr
George Konidaris
Michael L. Littman
37
4
0
05 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
128
165
0
03 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
72
0
31 Jan 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
104
103
0
29 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
139
243
0
23 Jan 2020
Augmenting GAIL with BC for sample efficient imitation learning
Rohit Jena
Changliu Liu
Katia Sycara
76
5
0
21 Jan 2020
Discriminator Soft Actor Critic without Extrinsic Rewards
Daichi Nishio
Daiki Kuyoshi
Toi Tsuneda
S. Yamane
OffRL
27
6
0
19 Jan 2020
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO
Mario S. Holubar
M. Wiering
50
10
0
15 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
67
38
0
09 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
82
183
0
09 Jan 2020
Self-guided Approximate Linear Programs
Parshan Pakiman
Selvaprabu Nadarajah
Negar Soheili
Qihang Lin
16
3
0
09 Jan 2020
Reinforcement Learning with Goal-Distance Gradient
Kai Jiang
X. Qin
21
0
0
01 Jan 2020
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
71
96
0
31 Dec 2019
Learning to Combat Compounding-Error in Model-Based Reinforcement Learning
Chenjun Xiao
Yifan Wu
Chen Ma
Dale Schuurmans
Martin Müller
OffRL
78
44
0
24 Dec 2019
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
58
34
0
23 Dec 2019
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
51
2
0
20 Dec 2019
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning
A. Celli
Marco Ciccone
Raffaele Bongo
N. Gatti
61
12
0
16 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
51
17
0
11 Dec 2019
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
50
8
0
11 Dec 2019
Measuring the Reliability of Reinforcement Learning Algorithms
Stephanie C. Y. Chan
Sam Fishman
John F. Canny
Anoop Korattikara Balan
S. Guadarrama
74
84
0
10 Dec 2019
Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion
Bo Zhou
Hongsheng Zeng
Fan Wang
Yunxiang Li
Hao Tian
59
18
0
10 Dec 2019
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Abdul Rahman Kreidieh
Yiling You
Nathan Lichtlé
Samyak Parajuli
Rayyan Nasr
Alexandre M. Bayen
113
14
0
05 Dec 2019
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
166
245
0
04 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRL
OnRL
CLL
73
15
0
03 Dec 2019
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
55
89
0
01 Dec 2019
Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation
Dmitry Akimov
29
10
0
29 Nov 2019
A selected review on reinforcement learning based control for autonomous underwater vehicles
Yachu Hsu
Hui Wu
Keyou You
Shiji Song
25
3
0
27 Nov 2019
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving
Rupert Mitchell
Jenny Fletcher
Jacopo Panerati
Amanda Prorok
89
17
0
26 Nov 2019
The problem with DDPG: understanding failures in deterministic environments with sparse rewards
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
52
67
0
26 Nov 2019
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
120
691
0
26 Nov 2019
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control
Yuguang Yang
21
1
0
25 Nov 2019
Merging Deterministic Policy Gradient Estimations with Varied Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Gang Chen
58
4
0
24 Nov 2019
Which Channel to Ask My Question? Personalized Customer Service Request Stream Routing using Deep Reinforcement Learning
Zining Liu
Chong Long
Xiaolu Lu
Zehong Hu
Jie Zhang
Yafang Wang
30
9
0
24 Nov 2019
Accelerating Reinforcement Learning with Suboptimal Guidance
Eivind Bøhn
Signe Moe
T. Johansen
OnRL
36
0
0
21 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
Planning with Goal-Conditioned Policies
Soroush Nasiriany
Vitchyr H. Pong
Steven Lin
Sergey Levine
OffRL
152
219
0
19 Nov 2019
Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Riashat Islam
Komal K. Teru
Deepak Sharma
Joelle Pineau
OffRL
80
8
0
16 Nov 2019
Adaptive Leader-Follower Formation Control and Obstacle Avoidance via Deep Reinforcement Learning
Yanlin Zhou
F. Lu
George Pu
Xiyao Ma
Runhan Sun
Hsi-Yuan Chen
Xiaolin Li
D. Wu
102
19
0
15 Nov 2019
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
76
10
0
12 Nov 2019
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
96
63
0
11 Nov 2019
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
58
2
0
11 Nov 2019
Context-aware Active Multi-Step Reinforcement Learning
Gang Chen
Dingcheng Li
Ran Xu
24
0
0
11 Nov 2019
MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems
Fan Wang
Xiaomin Fang
Lihang Liu
Hao Tian
Zhiming Peng
OffRL
35
0
0
06 Nov 2019
Previous
1
2
3
...
40
41
42
43
44
Next