Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Efficient Bayesian Policy Reuse with a Scalable Observation Model in Deep Reinforcement Learning
Donghan Xie
Zhi Wang
Chunlin Chen
D. Dong
OffRL
96
2
0
16 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
77
95
0
14 Apr 2022
JORLDY: a fully customizable open source framework for reinforcement learning
Kyushik Min
Hyunho Lee
Kwansu Shin
Tae-woo Lee
Hojoon Lee
Jinwon Choi
Sung-Hyun Son
OnRL
28
0
0
11 Apr 2022
Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes
Harry Emerson
Matt Guy
Ryan McConville
OffRL
117
50
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments
Yafei Hu
Junyi Geng
Chen Wang
John Keller
Sebastian Scherer
OffRL
102
15
0
07 Apr 2022
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning
Eugene Valassakis
Georgios Papagiannis
Norman Di Palo
Edward Johns
65
43
0
06 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
76
30
0
04 Apr 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert Platt
Chris Amato
56
4
0
02 Apr 2022
Building Decision Forest via Deep Reinforcement Learning
Guixuan Wen
Kaigui Wu
41
4
0
01 Apr 2022
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
Xingyu Lin
Zhiao Huang
Yunzhu Li
J. Tenenbaum
David Held
Chuang Gan
88
73
0
31 Mar 2022
TrajGen: Generating Realistic and Diverse Trajectories with Reactive and Feasible Agent Behaviors for Autonomous Driving
Qichao Zhang
Yinfeng Gao
Yikang Zhang
Youtian Guo
Dawei Ding
Yunpeng Wang
Peng Sun
Dongbin Zhao
113
35
0
31 Mar 2022
Reducing Learning Difficulties: One-Step Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control
Qiong Liu
Ye Guo
Lirong Deng
Haotian Liu
Dongyu Li
Hongbin Sun
Wenqi Huang
24
4
0
30 Mar 2022
Marginalized Operators for Off-policy Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
61
0
0
30 Mar 2022
Topological Experience Replay
Zhang-Wei Hong
Tao Chen
Yen-Chen Lin
Joni Pajarinen
Pulkit Agrawal
75
16
0
29 Mar 2022
ReIL: A Framework for Reinforced Intervention-based Imitation Learning
Rom N. Parnichkun
M. Dailey
Atsushi Yamashita
42
3
0
29 Mar 2022
A Study of Reinforcement Learning Algorithms for Aggregates of Minimalistic Robots
Josh Bloom
Apratim Mukherjee
Carlo Pinciroli
16
4
0
28 Mar 2022
Learning Personalized Human-Aware Robot Navigation Using Virtual Reality Demonstrations from a User Study
Jorge de Heuvel
Nathan Corral
Lilli Bruckschen
Maren Bennewitz
58
14
0
28 Mar 2022
Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning
Q. Sun
Jinbao Fang
Weixing Zheng
Yang Tang
47
30
0
26 Mar 2022
Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment
Apan Dastider
Mingjie Lin
56
2
0
24 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
110
19
0
23 Mar 2022
An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms
Abulikemu Abuduweili
Changliu Liu
28
1
0
23 Mar 2022
Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks
Qianliang Wu
Jin Xie
Jian Yang
OffRL
23
11
0
22 Mar 2022
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle
Ziniu Li
Tian Xu
Yang Yu
86
5
0
22 Mar 2022
One After Another: Learning Incremental Skills for a Changing World
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
76
13
0
21 Mar 2022
Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning
Branka Mirchevska
M. Werling
Joschka Boedecker
OffRL
43
6
0
21 Mar 2022
MicroRacer: a didactic environment for Deep Reinforcement Learning
Andrea Asperti
Marco Del Brutto
60
0
0
20 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
73
41
0
19 Mar 2022
Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach
Daniel G. McClement
Nathan P. Lawrence
Johan U. Backstrom
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
OffRL
64
25
0
17 Mar 2022
GAC: A Deep Reinforcement Learning Model Toward User Incentivization in Unknown Social Networks
Shiqing Wu
Weihua Li
Quan-wei Bai
GNN
63
11
0
17 Mar 2022
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Xi Chen
Ali Ghadirzadeh
Tianhe Yu
Yuan Gao
Jianhao Wang
Wenzhe Li
Bin Liang
Chelsea Finn
Chongjie Zhang
OffRL
83
14
0
16 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
96
49
0
15 Mar 2022
Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task
Vittorio Giammarino
Matthew F. Dunne
Kylie N. Moore
Michael Hasselmo
Chantal E. Stern
I. Paschalidis
OffRL
83
5
0
11 Mar 2022
Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
OffRL
AI4CE
66
6
0
10 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
99
255
0
09 Mar 2022
Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models
Vincent Lim
Ellen R. Novoseller
Jeffrey Ichnowski
Huang Huang
Ken Goldberg
OffRL
71
11
0
08 Mar 2022
Residual Robot Learning for Object-Centric Probabilistic Movement Primitives
João Carvalho
Dorothea Koert
Marek Daniv
Jan Peters
77
9
0
08 Mar 2022
An Analysis of Measure-Valued Derivatives for Policy Gradients
João Carvalho
Jan Peters
OffRL
9
0
0
08 Mar 2022
A Survey on Reinforcement Learning Methods in Character Animation
Ariel Kwiatkowski
Eduardo Alvarado
Vicky Kalogeiton
Chenxi Liu
Julien Pettré
M. van de Panne
Marie-Paule Cani
AI4CE
97
46
0
07 Mar 2022
Knowledge Transfer in Deep Reinforcement Learning for Slice-Aware Mobility Robustness Optimization
Qi Liao
Tianlun Hu
D. Wellington
41
4
0
07 Mar 2022
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations
Sean Gillen
Katie Byl
AI4CE
77
3
0
06 Mar 2022
Learning Goal-Oriented Non-Prehensile Pushing in Cluttered Scenes
Nils Dengler
D. Grossklaus
Maren Bennewitz
76
18
0
04 Mar 2022
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning
Hongpeng Cao
Mirco Theile
Federico G. Wyrwal
Marco Caccamo
111
8
0
04 Mar 2022
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
OffRL
111
243
0
02 Mar 2022
Model-free Neural Lyapunov Control for Safe Robot Navigation
Zikang Xiong
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
57
8
0
02 Mar 2022
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation
Jing Dong
Li Shen
Ying Xu
Baoxiang Wang
88
1
0
28 Feb 2022
Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming
Supriyo Ghosh
L. Wynter
Shiau Hong Lim
D. Nguyen
63
0
0
27 Feb 2022
Inter-Cell Slicing Resource Partitioning via Coordinated Multi-Agent Deep Reinforcement Learning
T. Hu
Qi Liao
Qiang Liu
D. Wellington
Georg Carle
29
11
0
25 Feb 2022
Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing
Fuhong Song
Huanlai Xing
Xinhan Wang
Shouxi Luo
Penglin Dai
Zhiwen Xiao
Bowen Zhao
55
59
0
24 Feb 2022
Comparative analysis of machine learning methods for active flow control
F. Pino
Lorenzo Schena
Jean Rabault
M. A. Mendez
107
44
0
23 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
106
140
0
23 Feb 2022
Previous
1
2
3
...
26
27
28
...
42
43
44
Next