Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Generalizable Episodic Memory for Deep Reinforcement Learning
Haotian Hu
Jianing Ye
Guangxiang Zhu
Zhizhou Ren
Chongjie Zhang
OffRL
84
39
0
11 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
128
186
0
10 Mar 2021
Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and Competition
Pavan Samtani
Francisco Leiva
Javier Ruiz-del-Solar
40
2
0
09 Mar 2021
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
49
6
0
09 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
158
42
0
08 Mar 2021
Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for Resilient Wireless Signal Classification
Salvatore D’oro
Francesco Restuccia
Tommaso Melodia
44
11
0
05 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
138
4
0
03 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
Matthieu Geist
OffRL
103
41
0
02 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning Approach
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
97
15
0
01 Mar 2021
Sim-to-Real Transfer for Robotic Manipulation with Tactile Sensory
Zihan Ding
Ya-Yen Tsai
Wang Wei Lee
Bidan Huang
35
28
0
28 Feb 2021
Revisiting Peng's Q(
λ
λ
λ
) for Modern Reinforcement Learning
Tadashi Kozuno
Yunhao Tang
Mark Rowland
Rémi Munos
Steven Kapturowski
Will Dabney
Michal Valko
David Abel
OffRL
55
19
0
27 Feb 2021
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
57
86
0
25 Feb 2021
Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers
Keidai Iiyama
Kento Tomita
Bhavi Jagatia
Tatsuwaki Nakagawa
K. Ho
65
14
0
24 Feb 2021
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulic
104
100
0
24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism
Jenny Yang
Seongmin Hong
Joo-Young Kim
50
18
0
24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
OffRL
38
9
0
23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
106
25
0
23 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
77
12
0
23 Feb 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
135
69
0
23 Feb 2021
HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving
Sirui Xie
Xiaojian Ma
Peiyu Yu
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
94
20
0
22 Feb 2021
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Brett Daley
Cameron Hickert
Chris Amato
OffRL
23
5
0
22 Feb 2021
Reinforcement Learning with Prototypical Representations
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
81
226
0
22 Feb 2021
Reinforcement Learning of the Prediction Horizon in Model Predictive Control
Eivind Bøhn
S. Gros
Signe Moe
T. Johansen
44
36
0
22 Feb 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
110
14
0
22 Feb 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
122
9
0
21 Feb 2021
Decentralized Deterministic Multi-Agent Reinforcement Learning
Antoine Grosnit
D. Cai
L. Wynter
OffRL
115
7
0
19 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
86
74
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
283
27
0
18 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
42
4
0
16 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
302
433
0
16 Feb 2021
Steadily Learn to Drive with Virtual Memory
Yuhang Zhang
Yao Mu
Yujie Yang
Yang Guan
Shengbo Eben Li
Qi Sun
Jianyu Chen
40
1
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
40
0
0
16 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
97
40
0
16 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
76
18
0
13 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
134
42
0
10 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
43
1
0
07 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
96
59
0
07 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
158
535
0
04 Feb 2021
Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust
Thomas Chaffre
Julien Moras
Adrien Chan-Hon-Tong
J. Marzat
Karl Sammut
G. Chenadec
Benoit Clement
44
5
0
29 Jan 2021
OffCon
3
^3
3
: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
82
8
0
27 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
57
8
0
24 Jan 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
63
9
0
24 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
72
16
0
23 Jan 2021
Breaking the Deadly Triad with a Target Network
Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
AAML
127
45
0
21 Jan 2021
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
121
168
0
21 Jan 2021
Learning Kinematic Feasibility for Mobile Manipulation through Deep Reinforcement Learning
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
76
49
0
13 Jan 2021
Evolving Reinforcement Learning Algorithms
John D. Co-Reyes
Yingjie Miao
Daiyi Peng
Esteban Real
Sergey Levine
Quoc V. Le
Honglak Lee
Aleksandra Faust
131
74
0
08 Jan 2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation
Shangtong Zhang
Yi Wan
R. Sutton
Shimon Whiteson
OffRL
73
31
0
08 Jan 2021
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
Zeyu Zhu
Huijing Zhao
147
159
0
06 Jan 2021
Previous
1
2
3
...
34
35
36
...
42
43
44
Next