Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 849 papers shown
Title
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
Aiding reinforcement learning for set point control
Ruoqing Zhang
Per Mattsson
T. Wigren
21
3
0
20 Apr 2023
Filter-Aware Model-Predictive Control
Baris Kayalibay
Atanas Mirchev
Ahmed Agha
Patrick van der Smagt
Justin Bayer
44
0
0
20 Apr 2023
Robust Deep Reinforcement Learning Scheduling via Weight Anchoring
Steffen Gracla
Edgar Beck
C. Bockelmann
Armin Dekorsy
OOD
38
3
0
20 Apr 2023
Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
Marvin Klimke
Benjamin Völz
M. Buchholz
26
5
0
17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
13
0
0
14 Apr 2023
UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment
Xuyang Li
Jianwu Fang
Kai Du
K. Mei
Jianru Xue
34
6
0
07 Apr 2023
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Maximilien Le Clei
Pierre C. Bellec
21
0
0
03 Apr 2023
Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning
Behnaz Hadi
A. Khosravi
Pouria Sarhadi
20
18
0
01 Apr 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
28
16
0
30 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
73
0
28 Mar 2023
The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers
Valentin Macé
Raphael Boige
Félix Chalumeau
Thomas Pierrot
Guillaume Richard
Nicolas Perrin-Gilbert
OffRL
38
12
0
27 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
27
14
0
24 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
29
4
0
24 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
20
4
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
38
5
0
17 Mar 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
AI4TS
AI4MH
43
10
0
16 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
18
21
0
14 Mar 2023
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
35
8
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
53
3
0
08 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
28
1
0
02 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Young-Jin Sung
32
7
0
01 Mar 2023
Human-Inspired Framework to Accelerate Reinforcement Learning
Ali Beikmohammadi
Sindri Magnússon
OffRL
29
4
0
28 Feb 2023
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
29
26
0
28 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
21
2
0
26 Feb 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
34
10
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
21
0
0
22 Feb 2023
Behavior Proximal Policy Optimization
Zifeng Zhuang
Kun Lei
Jinxin Liu
Donglin Wang
Yilang Guo
OffRL
30
34
0
22 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
26
9
0
20 Feb 2023
Price of Anarchy in a Double-Sided Critical Distribution System
David Sychrovský
Jakub Cerny
Sylvain Lichau
M. Loebl
23
1
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
37
23
0
20 Feb 2023
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
49
31
0
19 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
36
1
0
17 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
41
9
0
14 Feb 2023
Time-attenuating Twin Delayed DDPG Reinforcement Learning for Trajectory Tracking Control of Quadrotors
Boyuan Deng
Jian Sun
Zhuo Li
G. Wang
35
0
0
13 Feb 2023
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
33
17
0
08 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
42
163
0
06 Feb 2023
A Strong Baseline for Batch Imitation Learning
Matthew Smith
Lucas Maystre
Zhenwen Dai
K. Ciosek
OffRL
25
4
0
06 Feb 2023
Open Problems and Modern Solutions for Deep Reinforcement Learning
Weiqin Chen
OffRL
23
0
0
05 Feb 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
33
41
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
42
62
0
02 Feb 2023
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
34
3
0
30 Jan 2023
Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
Marvin Klimke
Benjamin Völz
M. Buchholz
41
13
0
30 Jan 2023
Previous
1
2
3
...
5
6
7
...
15
16
17
Next