Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization
L. N. Alegre
A. Bazzan
D. Roijers
Ann Nowé
Bruno C. da Silva
80
30
0
18 Jan 2023
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles
Ivan Masmitja
Mario Martin
K. Katija
S. Gomáriz
J. Navarro
52
6
0
17 Jan 2023
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Taylor W. Killian
S. Parbhoo
Marzyeh Ghassemi
OffRL
81
7
0
13 Jan 2023
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration Without A-Priori Maps
Shathushan Sivashangaran
A. Eskandarian
60
4
0
10 Jan 2023
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
143
1
0
10 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
77
3
0
10 Jan 2023
Network Slicing via Transfer Learning aided Distributed Deep Reinforcement Learning
Tianlun Hu
Qi Liao
Qian Liu
Georg Carle
OffRL
32
8
0
09 Jan 2023
Centralized Cooperative Exploration Policy for Continuous Control Tasks
Chong Li
Chen Gong
Qiang He
Xinwen Hou
Yu Liu
80
1
0
06 Jan 2023
Extreme Q-Learning: MaxEnt RL without Entropy
Divyansh Garg
Joey Hejna
Matthieu Geist
Stefano Ermon
OffRL
87
80
0
05 Jan 2023
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Sam Powers
Eliot Xing
Abhinav Gupta
KELM
CLL
86
5
0
31 Dec 2022
Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks
Trevor Ablett
Bryan Chan
Jonathan Kelly
129
10
0
30 Dec 2022
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
93
8
0
30 Dec 2022
Offline Policy Optimization in RL with Variance Regularizaton
Riashat Islam
Samarth Sinha
Homanga Bharadhwaj
Samin Yeasar Arnob
Zhuoran Yang
Animesh Garg
Zhaoran Wang
Lihong Li
Doina Precup
OffRL
58
0
0
29 Dec 2022
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
377
5
0
29 Dec 2022
Invariance to Quantile Selection in Distributional Continuous Control
Felix Grün
Muhammad Saif-ur-Rehman
Tobias Glasmachers
Ioannis Iossifidis
37
0
0
29 Dec 2022
Deep Reinforcement Learning for Wind and Energy Storage Coordination in Wholesale Energy and Ancillary Service Markets
Jinhao Li
Changlong Wang
Hao Wang
24
9
0
27 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
54
3
0
25 Dec 2022
Temporally Layered Architecture for Adaptive, Distributed and Continuous Control
Devdhar Patel
Joshua Russell
Frances Walsh
T. Rahman
Terrance Sejnowski
H. Siegelmann
AI4CE
126
1
0
25 Dec 2022
SHIRO: Soft Hierarchical Reinforcement Learning
Kandai Watanabe
Mathew Strong
Omer Eldar
70
1
0
24 Dec 2022
Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu
Justin Fu
George Tucker
Xinlei Pan
Eli Bronstein
...
Brandyn White
Aleksandra Faust
Shimon Whiteson
Drago Anguelov
Sergey Levine
OffRL
111
97
0
21 Dec 2022
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu
Saptarshi Nath
Praveen K. Pilly
Soheil Kolouri
Andrea Soltoggio
CLL
OffRL
89
23
0
21 Dec 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Zhecheng Yuan
Zhengrong Xue
Bo Yuan
Xueqian Wang
Yi Wu
Yang Gao
Huazhe Xu
SSL
OffRL
110
74
0
17 Dec 2022
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning
Linrui Zhang
Zichen Yan
Li Shen
Shoujie Li
Xueqian Wang
Dacheng Tao
OffRL
OnRL
83
3
0
14 Dec 2022
Collision probability reduction method for tracking control in automatic docking / berthing using reinforcement learning
Kouki Wakita
Youhei Akimoto
D. M. Rachman
Yoshiki Miyauchi
Umeda Naoya
A. Maki
39
8
0
13 Dec 2022
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
Linrui Zhang
Qin Zhang
Li Shen
Bo Yuan
Xueqian Wang
Dacheng Tao
OffRL
110
28
0
12 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
77
51
0
12 Dec 2022
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Altun Rzayev
Vahid Tavakol Aghaei
OffRL
62
2
0
11 Dec 2022
Reinforcement Learning for Predicting Traffic Accidents
I. Cho
Praveenbalaji Rajendran
Taeyoung Kim
Dongsoo Har
43
6
0
09 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
81
7
0
08 Dec 2022
Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk
Fabian Hart
Ostap Okhrin
89
13
0
08 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
79
5
0
07 Dec 2022
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chong Li
OffRL
67
1
0
07 Dec 2022
Curiosity creates Diversity in Policy Search
Paul-Antoine Le Tolguenec
Emmanuel Rachelson
Yann Besse
Dennis G. Wilson
67
2
0
07 Dec 2022
Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems
Umut Demir
A. S. Satir
Gülay Goktas
Cansu Yikilmaz
N. K. Üre
31
1
0
06 Dec 2022
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
Wanqi Xue
Qingpeng Cai
Zhenghai Xue
Shuo Sun
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
62
28
0
06 Dec 2022
Physics-Informed Model-Based Reinforcement Learning
Adithya Ramesh
Balaraman Ravindran
73
10
0
05 Dec 2022
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Yuanying Cai
Wei Shen
Li Zhao
Wei Shen
Xuyun Zhang
Lei Song
Jiang Bian
Tao Qin
Tie-Yan Liu
OffRL
54
5
0
05 Dec 2022
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew S. Albright
J. Vaughan
75
1
0
02 Dec 2022
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRL
OnRL
63
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
74
13
0
01 Dec 2022
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning
Naman Saxena
Sandeep Gorantla
Pushpak Jagtap
102
4
0
30 Nov 2022
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu
Zhengning Hu
Zhi Lai
Daiwei Zheng
Xuyun Nie
31
2
0
30 Nov 2022
Computationally Efficient Reinforcement Learning: Targeted Exploration leveraging Simple Rules
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
64
0
0
30 Nov 2022
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu Wang
William Yang Wang
OffRL
98
17
0
29 Nov 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
160
408
0
28 Nov 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
David Meger
OffRL
105
15
0
28 Nov 2022
Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems
Félix Chalumeau
Thomas Pierrot
Valentin Macé
Arthur Flajolet
Karim Beguir
Antoine Cully
Nicolas Perrin-Gilbert
99
7
0
24 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
108
0
0
23 Nov 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
87
49
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
61
4
0
22 Nov 2022
Previous
1
2
3
...
20
21
22
...
42
43
44
Next