Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 851 papers shown
Title
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
36
8
0
22 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
43
2
0
18 May 2022
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
28
0
0
16 May 2022
Reachability Constrained Reinforcement Learning
Dongjie Yu
Haitong Ma
Sheng Li
Jianyu Chen
63
55
0
16 May 2022
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics
Sizhe Li
Zhiao Huang
Tao Du
Hao Su
J. Tenenbaum
Chuang Gan
25
26
0
05 May 2022
Cost Effective MLaaS Federation: A Combinatorial Reinforcement Learning Approach
Shuzhao Xie
Yuan Xue
Yifei Zhu
Zhi Wang
FedML
18
12
0
29 Apr 2022
TASAC: a twin-actor reinforcement learning framework with stochastic policy for batch process control
Tanuja Joshi
H. Kodamana
Harikumar Kandath
N. Kaisare
OffRL
27
0
0
22 Apr 2022
Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based Robotics Research
Ryan Hoque
K. Shivakumar
Shrey Aeron
Gabriel Deza
Aditya Ganapathi
Adrian S. Wong
Johnny Lee
Andy Zeng
Vincent Vanhoucke
Ken Goldberg
31
21
0
21 Apr 2022
Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes
Harry Emerson
Matt Guy
Ryan McConville
OffRL
32
46
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments
Yafei Hu
Junyi Geng
Chen Wang
John Keller
Sebastian Scherer
OffRL
34
15
0
07 Apr 2022
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning
Eugene Valassakis
Georgios Papagiannis
Norman Di Palo
Edward Johns
32
41
0
06 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
27
28
0
04 Apr 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert Platt
Chris Amato
37
4
0
02 Apr 2022
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
Xingyu Lin
Zhiao Huang
Yunzhu Li
J. Tenenbaum
David Held
Chuang Gan
35
72
0
31 Mar 2022
TrajGen: Generating Realistic and Diverse Trajectories with Reactive and Feasible Agent Behaviors for Autonomous Driving
Qichao Zhang
Yinfeng Gao
Yikang Zhang
Youtian Guo
Dawei Ding
Yunpeng Wang
Peng Sun
Dongbin Zhao
35
34
0
31 Mar 2022
ReIL: A Framework for Reinforced Intervention-based Imitation Learning
Rom N. Parnichkun
M. Dailey
Atsushi Yamashita
24
2
0
29 Mar 2022
Learning Personalized Human-Aware Robot Navigation Using Virtual Reality Demonstrations from a User Study
Jorge de Heuvel
Nathan Corral
Lilli Bruckschen
Maren Bennewitz
30
14
0
28 Mar 2022
Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning
Q. Sun
Jinbao Fang
Weixing Zheng
Yang Tang
19
27
0
26 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
36
19
0
23 Mar 2022
An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms
Abulikemu Abuduweili
Changliu Liu
24
1
0
23 Mar 2022
MicroRacer: a didactic environment for Deep Reinforcement Learning
Andrea Asperti
Marco Del Brutto
32
0
0
20 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
24
40
0
19 Mar 2022
Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach
Daniel G. McClement
Nathan P. Lawrence
Johan U. Backstrom
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
OffRL
27
22
0
17 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
37
45
0
15 Mar 2022
Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task
Vittorio Giammarino
Matthew F. Dunne
Kylie N. Moore
Michael Hasselmo
Chantal E. Stern
I. Paschalidis
OffRL
39
5
0
11 Mar 2022
Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
OffRL
AI4CE
40
6
0
10 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
226
0
09 Mar 2022
Residual Robot Learning for Object-Centric Probabilistic Movement Primitives
João Carvalho
Dorothea Koert
Marek Daniv
Jan Peters
29
8
0
08 Mar 2022
Knowledge Transfer in Deep Reinforcement Learning for Slice-Aware Mobility Robustness Optimization
Qi Liao
Tianlun Hu
D. Wellington
21
3
0
07 Mar 2022
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations
Sean Gillen
Katie Byl
AI4CE
23
3
0
06 Mar 2022
Learning Goal-Oriented Non-Prehensile Pushing in Cluttered Scenes
Nils Dengler
D. Grossklaus
Maren Bennewitz
19
16
0
04 Mar 2022
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning
Hongpeng Cao
Mirco Theile
Federico G. Wyrwal
Marco Caccamo
43
6
0
04 Mar 2022
Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming
Supriyo Ghosh
L. Wynter
Shiau Hong Lim
D. Nguyen
34
0
0
27 Feb 2022
Inter-Cell Slicing Resource Partitioning via Coordinated Multi-Agent Deep Reinforcement Learning
T. Hu
Qi Liao
Qiang Liu
D. Wellington
Georg Carle
17
10
0
25 Feb 2022
Comparative analysis of machine learning methods for active flow control
F. Pino
Lorenzo Schena
Jean Rabault
M. A. Mendez
34
43
0
23 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
45
132
0
23 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel
Haoran Zhang
Chenkun Yin
Yanxin Zhang
S. Jin
Zhenxuan Li
OffRL
21
3
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic
Yuqi Liu
Qichao Zhang
Dongbin Zhao
28
14
0
19 Feb 2022
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control
Killian Dally
E. Kampen
27
16
0
16 Feb 2022
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
Taisuke Kobayashi
31
15
0
15 Feb 2022
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For Reinforcement Learning Algorithms
Burak Han Demirbilek
18
0
0
14 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
40
65
0
13 Feb 2022
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer
Xingyu Liu
Deepak Pathak
Kris Kitani
25
19
0
10 Feb 2022
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
Stephen James
Pieter Abbeel
35
9
0
08 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
38
50
0
08 Feb 2022
Model-Based Offline Meta-Reinforcement Learning with Regularization
Sen Lin
Jialin Wan
Tengyu Xu
Yingbin Liang
Junshan Zhang
OffRL
38
17
0
07 Feb 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
18
3
0
07 Feb 2022
Previous
1
2
3
...
9
10
11
...
16
17
18
Next