Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
DeepCPG Policies for Robot Locomotion
Aditya M. Deshpande
Eric Hurd
A. Minai
Manish Kumar
74
9
0
25 Feb 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
72
0
0
25 Feb 2023
Improving the Data Efficiency of Multi-Objective Quality-Diversity through Gradient Assistance and Crowding Exploration
Hannah Janmohamed
Thomas Pierrot
Antoine Cully
113
6
0
24 Feb 2023
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
101
10
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
97
8
0
24 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
103
0
0
22 Feb 2023
Behavior Proximal Policy Optimization
Zifeng Zhuang
Kun Lei
Jinxin Liu
Donglin Wang
Yilang Guo
OffRL
102
37
0
22 Feb 2023
Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning
Hadrien Hendrikx
G. Fort
Eric Moulines
Hoi-To Wai
81
12
0
22 Feb 2023
Adversarial Model for Offline Reinforcement Learning
M. Bhardwaj
Tengyang Xie
Byron Boots
Nan Jiang
Ching-An Cheng
AAML
OffRL
104
29
0
21 Feb 2023
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Dhawal Gupta
Yinlam Chow
Aza Tulepbergenov
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
49
3
0
21 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
113
5
0
21 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
49
9
0
20 Feb 2023
Price of Anarchy in a Double-Sided Critical Distribution System
David Sychrovský
Jakub Cerny
Sylvain Lichau
Martin Loebl
50
1
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
78
23
0
20 Feb 2023
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
126
34
0
19 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
105
2
0
17 Feb 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
99
29
0
16 Feb 2023
Prioritized offline Goal-swapping Experience Replay
Wenyan Yang
Joni Pajarinen
Dinging Cai
Joni Kämäräinen
OffRL
OnRL
48
0
0
15 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
115
9
0
14 Feb 2023
Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning
Bram Grooten
Ghada Sokar
Shibhansh Dohare
Elena Mocanu
Matthew E. Taylor
Mykola Pechenizkiy
Decebal Constantin Mocanu
64
12
0
13 Feb 2023
Time-attenuating Twin Delayed DDPG Reinforcement Learning for Trajectory Tracking Control of Quadrotors
Boyuan Deng
Jian Sun
Zhuo Li
G. Wang
117
0
0
13 Feb 2023
Robust Representation Learning by Clustering with Bisimulation Metrics for Visual Reinforcement Learning with Distractions
Qiyuan Liu
Qi Zhou
Rui Yang
Jie Wang
OffRL
OOD
503
15
0
12 Feb 2023
Digital Twin-Aided Learning for Managing Reconfigurable Intelligent Surface-Assisted, Uplink, User-Centric Cell-Free Systems
Ying-Kai Cui
Tiejun Lv
Wei Ni
Abbas Jamalipour
43
8
0
10 Feb 2023
A SWAT-based Reinforcement Learning Framework for Crop Management
Malvern Madondo
Muneeza Azmat
Kelsey L. DiPietro
R. Horesh
Michael Jacobs
Arun Bawa
Raghavan Srinivasan
Fearghal O'Donncha
14
8
0
10 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
91
53
0
09 Feb 2023
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
79
18
0
08 Feb 2023
Robust Subtask Learning for Compositional Generalization
Kishor Jothimurugan
Steve Hsu
Osbert Bastani
Rajeev Alur
OffRL
71
5
0
06 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
147
184
0
06 Feb 2023
A Strong Baseline for Batch Imitation Learning
Matthew Smith
Lucas Maystre
Zhenwen Dai
K. Ciosek
OffRL
85
5
0
06 Feb 2023
Open Problems and Modern Solutions for Deep Reinforcement Learning
Weiqin Chen
OffRL
112
0
0
05 Feb 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
80
41
0
03 Feb 2023
Better Training of GFlowNets with Local Credit and Incomplete Trajectories
L. Pan
Nikolay Malkin
Dinghuai Zhang
Yoshua Bengio
108
72
0
03 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
OffRL
83
20
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
118
69
0
02 Feb 2023
Distillation Policy Optimization
Jianfei Ma
OffRL
84
1
0
01 Feb 2023
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement Learning
Claude Formanek
Asad Jeewa
Jonathan P. Shock
Arnu Pretorius
OffRL
115
2
0
01 Feb 2023
Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model
Zhihai Wang
Xijun Li
Jie Wang
Yufei Kuang
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
86
42
0
01 Feb 2023
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
83
31
0
31 Jan 2023
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Francisco Roldan Sanchez
Kevin McGuinness
Noel E. O'Connor
S. Redmond
OffRL
68
5
0
30 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
112
4
0
30 Jan 2023
Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
Marvin Klimke
Benjamin Völz
M. Buchholz
79
16
0
30 Jan 2023
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
Jing Zhang
Chi Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
86
10
0
28 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
79
8
0
27 Jan 2023
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
86
16
0
27 Jan 2023
Model-based Offline Reinforcement Learning with Local Misspecification
Kefan Dong
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
68
4
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
56
0
0
26 Jan 2023
A Novel Deep Reinforcement Learning-based Approach for Enhancing Spectral Efficiency of IRS-assisted Wireless Systems
Farimehr Zohari
S. Shahabi
M. Ardebilipour
31
2
0
24 Jan 2023
Quasi-optimal Reinforcement Learning with Continuous Actions
Yuhan Li
Wenzhuo Zhou
Ruoqing Zhu
OffRL
83
5
0
21 Jan 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
65
2
0
20 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
67
0
0
19 Jan 2023
Previous
1
2
3
...
19
20
21
...
42
43
44
Next