Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 849 papers shown
Title
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
Jing Zhang
Chi Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
35
7
0
28 Jan 2023
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
26
14
0
27 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles
Ivan Masmitja
Mario Martin
K. Katija
S. Gomáriz
J. Navarro
24
5
0
17 Jan 2023
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration Without A-Priori Maps
Shathushan Sivashangaran
A. Eskandarian
32
4
0
10 Jan 2023
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
30
1
0
10 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
35
2
0
10 Jan 2023
Network Slicing via Transfer Learning aided Distributed Deep Reinforcement Learning
Tianlun Hu
Qi Liao
Qian Liu
Georg Carle
OffRL
30
8
0
09 Jan 2023
Offline Policy Optimization in RL with Variance Regularizaton
Riashat Islam
Samarth Sinha
Homanga Bharadhwaj
Samin Yeasar Arnob
Zhuoran Yang
Animesh Garg
Zhaoran Wang
Lihong Li
Doina Precup
OffRL
28
0
0
29 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
32
3
0
25 Dec 2022
Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu
Justin Fu
George Tucker
Xinlei Pan
Eli Bronstein
...
Brandyn White
Aleksandra Faust
Shimon Whiteson
Drago Anguelov
Sergey Levine
OffRL
31
93
0
21 Dec 2022
Collision probability reduction method for tracking control in automatic docking / berthing using reinforcement learning
Kouki Wakita
Youhei Akimoto
D. M. Rachman
Yoshiki Miyauchi
Umeda Naoya
A. Maki
21
8
0
13 Dec 2022
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
Linrui Zhang
Qin Zhang
Li Shen
Bo Yuan
Xueqian Wang
Dacheng Tao
OffRL
53
26
0
12 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks
Altun Rzayev
Vahid Tavakol Aghaei
OffRL
23
0
0
11 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
28
4
0
07 Dec 2022
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chong Li
OffRL
32
0
0
07 Dec 2022
Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems
Umut Demir
A. S. Satir
Gülay Goktas
Cansu Yikilmaz
N. K. Üre
26
1
0
06 Dec 2022
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew S. Albright
J. Vaughan
16
1
0
02 Dec 2022
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRL
OnRL
24
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
33
13
0
01 Dec 2022
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning
Naman Saxena
Sandeep Gorantla
Pushpak Jagtap
42
4
0
30 Nov 2022
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu
Zhengning Hu
Zhi Lai
Daiwei Zheng
Xuyun Nie
24
2
0
30 Nov 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
71
365
0
28 Nov 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
David Meger
OffRL
17
14
0
28 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
50
45
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning
Alex Beeson
Giovanni Montana
OffRL
OnRL
26
23
0
21 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
37
13
0
21 Nov 2022
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D. Akimov
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
24
9
0
20 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
27
1
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
34
207
0
14 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
30
15
0
12 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
24
14
0
10 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
23
12
0
08 Nov 2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
27
8
0
07 Nov 2022
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
21
5
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
54
8
0
06 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert W. Platt
OffRL
30
19
0
03 Nov 2022
Causal Counterfactuals for Improving the Robustness of Reinforcement Learning
Tom He
Jasmina Gajcin
Ivana Dusparic
CML
13
5
0
02 Nov 2022
Reinforcement Learning for Solving Robotic Reaching Tasks in the Neurorobotics Platform
Márton Szep
Leander Lauenburg
Kevin Farkas
Xiyan Su
Chuanlong Zang
16
0
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
31
2
0
29 Oct 2022
Meta-Reinforcement Learning Using Model Parameters
G. Hartmann
A. Azaria
32
0
0
27 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
20
24
0
27 Oct 2022
Low-Rank Modular Reinforcement Learning via Muscle Synergy
Heng Dong
Tonghan Wang
Jiayuan Liu
Chongjie Zhang
63
17
0
26 Oct 2022
ERL-Re
2
^2
2
: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
29
24
0
26 Oct 2022
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
40
14
0
26 Oct 2022
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data
John So
Amber Xie
Sunggoo Jung
J. Edlund
Rohan Thakker
Ali Agha-mohammadi
Pieter Abbeel
Stephen James
31
9
0
25 Oct 2022
Previous
1
2
3
...
6
7
8
...
15
16
17
Next