Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization
Y. Kadokawa
Lingwei Zhu
Yoshihisa Tsurumine
Takamitsu Matsubara
53
8
0
29 Jul 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
95
4
0
27 Jul 2022
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
52
3
0
27 Jul 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
84
19
0
25 Jul 2022
Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks
Laura Stops
Roel Leenhouts
Qitong Gao
Artur M. Schweidtmann
AI4CE
59
30
0
25 Jul 2022
Tactile Gym 2.0: Sim-to-real Deep Reinforcement Learning for Comparing Low-cost High-Resolution Robot Touch
Yijiong Lin
John Lloyd
Alex Church
Nathan Lepora
87
48
0
21 Jul 2022
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
90
7
0
19 Jul 2022
An Enhanced Graph Representation for Machine Learning Based Automatic Intersection Management
Marvin Klimke
Jasper Gerigk
Benjamin Völz
M. Buchholz
55
9
0
18 Jul 2022
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Ricard Durall
47
6
0
14 Jul 2022
Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents
Nathaniel P. Hamilton
Kyle Dunlap
Taylor T. Johnson
Kerianne L. Hobbs
OffRL
76
8
0
08 Jul 2022
Safe reinforcement learning for multi-energy management systems with known constraint functions
Glenn Ceusters
L. R. Camargo
R. Franke
Ann Nowé
M. Messagie
60
16
0
08 Jul 2022
Vessel-following model for inland waterways based on deep reinforcement learning
Fabian Hart
Ostap Okhrin
M. Treiber
63
12
0
07 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
94
47
0
05 Jul 2022
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
93
17
0
05 Jul 2022
Deep Reinforcement Learning Approach for Trading Automation in The Stock Market
Taylan Kabbani
E. Duman
AIFin
21
56
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
80
9
0
04 Jul 2022
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Francesco Faccio
Aditya A. Ramesh
Vincent Herrmann
J. Harb
Jürgen Schmidhuber
OffRL
111
11
0
04 Jul 2022
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu
Xiao-guang Gao
Kaifang Wan
Qianglong Wang
Yiwei Zhai
65
10
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
42
4
0
03 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
112
38
0
03 Jul 2022
Learning fast and agile quadrupedal locomotion over complex terrain
Xu Chang
Zhitong Zhang
Honglei An
Hongxu Ma
Qing Wei
55
0
0
02 Jul 2022
On the Learning and Learnability of Quasimetrics
Tongzhou Wang
Phillip Isola
105
9
0
30 Jun 2022
Reinforcement Learning in Medical Image Analysis: Concepts, Applications, Challenges, and Future Directions
Mingzhe Hu
Jiahan Zhang
L. Matkovic
Tian Liu
Xiaofeng Yang
OffRL
66
55
0
28 Jun 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
68
2
0
28 Jun 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
Mengdi Xu
Songlin Yang
Shun Zhang
Yuchen Lu
Ding Zhao
J. Tenenbaum
Chuang Gan
OffRL
96
150
0
27 Jun 2022
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Haoyi Niu
Shubham Sharma
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
136
52
0
27 Jun 2022
Analysis of Stochastic Processes through Replay Buffers
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
94
6
0
26 Jun 2022
Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization
Igor Kuznetsov
89
2
0
25 Jun 2022
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
J. MacGlashan
Evan Archer
A. Devlic
Takuma Seno
Craig Sherstan
Peter R. Wurman
AI PeterStoneSony
47
6
0
24 Jun 2022
Reinforcement learning based adaptive metaheuristics
Michele Tessari
Giovanni Iacca
63
5
0
24 Jun 2022
Behavior Transformers: Cloning
k
k
k
modes with one stone
Nur Muhammad (Mahi) Shafiullah
Zichen Jeff Cui
Ariuntuya Altanzaya
Lerrel Pinto
OffRL
78
241
0
22 Jun 2022
Robust Imitation Learning against Variations in Environment Dynamics
Jongseong Chae
Seungyul Han
Whiyoung Jung
Myungsik Cho
Sungho Choi
Young-Jin Sung
OOD
72
20
0
19 Jun 2022
Interactive Visual Reasoning under Uncertainty
Manjie Xu
Guangyuan Jiang
Wei Liang
Song-Chun Zhu
Yixin Zhu
LRM
101
5
0
18 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis
Shayegan Omidshafiei
A. Kapishnikov
Yannick Assogba
Lucas Dixon
Been Kim
OffRL
66
5
0
17 Jun 2022
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
Brandon Trabucco
Mariano Phielipp
Glen Berseth
76
28
0
17 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
74
10
0
17 Jun 2022
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu
Shoufa Chen
Mingyu Ding
Jianyu Chen
Runjian Chen
Ping Luo
ViT
77
9
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
69
4
0
17 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
129
116
0
17 Jun 2022
SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving
Linrui Zhang
Qin Zhang
Li Shen
Bo Yuan
Xueqian Wang
OffRL
98
9
0
17 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
85
26
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
115
164
0
15 Jun 2022
Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising
Zikang Xiong
Joe Eappen
He Zhu
Suresh Jagannathan
AAML
39
10
0
14 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
98
12
0
14 Jun 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading
Zitao Song
Xuyang Jin
Chenliang Li
OffRL
AIFin
48
1
0
13 Jun 2022
Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
Dongwon Son
Myungsin Kim
Jaecheol Sim
Wonsik Shin
46
1
0
12 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
132
54
0
09 Jun 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
106
113
0
09 Jun 2022
Overcoming the Spectral Bias of Neural Value Approximation
Ge Yang
Anurag Ajay
Pulkit Agrawal
103
26
0
09 Jun 2022
Biologically Inspired Dynamic Thresholds for Spiking Neural Networks
Jianchuan Ding
B. Dong
Felix Heide
Yufei Ding
Yunduo Zhou
Baocai Yin
Xin Yang
57
24
0
09 Jun 2022
Previous
1
2
3
...
24
25
26
...
42
43
44
Next