Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Alex Beeson
Giovanni Montana
OffRL
OnRL
60
24
0
21 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
84
14
0
21 Nov 2022
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning
Lang Qin
Rui Yan
Huajin Tang
OffRL
58
6
0
21 Nov 2022
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D. Akimov
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
86
11
0
20 Nov 2022
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement Learning
Mauro Martini
Andrea Eirale
Simone Cerrato
Marcello Chiaberge
60
11
0
19 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
78
1
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
91
216
0
14 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
78
16
0
12 Nov 2022
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
Yunpeng Qing
Shunyu Liu
Mingli Song
Huiqiong Wang
Mingli Song
XAI
85
1
0
12 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
65
15
0
10 Nov 2022
Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment
Sadaf Ghaffari
Nikhil Krishnaswamy
38
7
0
08 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
73
11
0
07 Nov 2022
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
94
5
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
68
8
0
06 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert Platt
OffRL
97
20
0
03 Nov 2022
Causal Counterfactuals for Improving the Robustness of Reinforcement Learning
Tom He
Jasmina Gajcin
Ivana Dusparic
CML
71
5
0
02 Nov 2022
Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints
Anika Singh
Aviral Kumar
Q. Vuong
Yevgen Chebotar
Sergey Levine
OffRL
62
14
0
02 Nov 2022
Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz
Ostap Okhrin
96
9
0
02 Nov 2022
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network
Yao Feng
Yuhong Jiang
Hang Su
Dong Yan
Jun Zhu
90
1
0
02 Nov 2022
Reinforcement Learning for Solving Robotic Reaching Tasks in the Neurorobotics Platform
Márton Szep
Leander Lauenburg
Kevin Farkas
Xiyan Su
Chuanlong Zang
59
0
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
54
2
0
29 Oct 2022
Meta-Reinforcement Learning Using Model Parameters
G. Hartmann
A. Azaria
73
0
0
27 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
84
26
0
27 Oct 2022
Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems
Yizhen Dong
Xingyu Zhao
Sen Wang
Xiaowei Huang
106
8
0
26 Oct 2022
Low-Rank Modular Reinforcement Learning via Muscle Synergy
Heng Dong
Tonghan Wang
Jiayuan Liu
Chongjie Zhang
130
18
0
26 Oct 2022
ERL-Re
2
^2
2
: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
89
26
0
26 Oct 2022
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
81
15
0
26 Oct 2022
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data
John So
Amber Xie
Sunggoo Jung
J. Edlund
Rohan Thakker
Ali Agha-mohammadi
Pieter Abbeel
Stephen James
92
9
0
25 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
84
40
0
25 Oct 2022
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains
Manon Flageat
Félix Chalumeau
Antoine Cully
82
26
0
24 Oct 2022
MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System
Huiliang Zhang
Di Wu
Benoit Boulet
78
6
0
23 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
63
2
0
21 Oct 2022
STAP: Sequencing Task-Agnostic Policies
Christopher Agia
Toki Migimatsu
Jiajun Wu
Jeannette Bohg
111
20
0
21 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
68
9
0
21 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
89
5
0
20 Oct 2022
Task Phasing: Automated Curriculum Learning from Demonstrations
Vaibhav Bajaj
Guni Sharon
Peter Stone
66
8
0
20 Oct 2022
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
40
0
0
19 Oct 2022
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
95
2
0
19 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
70
2
0
18 Oct 2022
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto
Onur Celik
Hongyi Zhou
Hanna Ziesche
Ngo Anh Vien
Gerhard Neumann
OffRL
71
19
0
18 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
89
21
0
18 Oct 2022
The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning
Vindula Jayawardana
Catherine Tang
Sirui Li
Da Suo
Cathy Wu
OffRL
108
13
0
16 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
76
19
0
15 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
153
64
0
15 Oct 2022
A Scalable Reinforcement Learning Approach for Attack Allocation in Swarm to Swarm Engagement Problems
Umut Demir
N. K. Üre
48
1
0
15 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale
Kuang-Huei Lee
Ted Xiao
A. Li
Paul Wohlhart
Ian S. Fischer
Yao Lu
120
10
0
15 Oct 2022
DyFEn: Agent-Based Fee Setting in Payment Channel Networks
Kian Asgari
Aida Mohammadian
M. Tefagh
18
7
0
15 Oct 2022
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
86
8
0
15 Oct 2022
Geometric Reinforcement Learning For Robotic Manipulation
Naseem Alhousani
Matteo Saveriano
Ibrahim Sevinc
Talha Abdulkuddus
Hatice Kose
Fares J. Abu-Dakka
70
6
0
14 Oct 2022
Previous
1
2
3
...
21
22
23
...
42
43
44
Next