Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
72
13
0
15 Jun 2023
DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback
Junfan Lin
Yuying Zhu
Lingbo Liu
Yang Liu
Guanbin Li
Liang Lin
64
12
0
13 Jun 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
Siyuan Guo
Yanchao Sun
Jifeng Hu
Sili Huang
Hechang Chen
Haiyin Piao
Lichao Sun
Yi-Ju Chang
OffRL
OnRL
83
7
0
13 Jun 2023
Evolving Testing Scenario Generation Method and Intelligence Evaluation Framework for Automated Vehicles
Yining Ma
Wei Jiang
Lingtong Zhang
Junyi Chen
Hong Wang
Chen Lv
X. Wang
Lu Xiong
76
2
0
12 Jun 2023
High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning
B. D. Evans
H. Engelbrecht
H. W. Jordaan
82
21
0
12 Jun 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
109
12
0
12 Jun 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
73
1
0
11 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
89
27
0
11 Jun 2023
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Shixi Lian
Yi-An Ma
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
57
1
0
10 Jun 2023
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
87
4
0
09 Jun 2023
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
Xiao Hu
Yi-An Ma
Chenjun Xiao
Yan Zheng
Zhaopeng Meng
OffRL
65
4
0
09 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
88
8
0
08 Jun 2023
Decision S4: Efficient Sequence-Based RL via State Spaces Layers
Shmuel Bar-David
Itamar Zimerman
Eliya Nachmani
Lior Wolf
OffRL
109
28
0
08 Jun 2023
A Transferability Metric Using Scene Similarity and Local Map Observation for DRL Navigation
Shiwei Lian
Feitian Zhang
44
2
0
08 Jun 2023
Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design
Julien Roy
Pierre-Luc Bacon
C. Pal
Emmanuel Bengio
AI4CE
75
18
0
07 Jun 2023
Dual policy as self-model for planning
J. Yoo
Fernanda De La Torre
G. R. Yang
32
1
0
07 Jun 2023
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL
Peng Cheng
Xianyuan Zhan
Zhihao Wu
Wenjia Zhang
Shoucheng Song
Han Wang
Youfang Lin
Li Jiang
OffRL
137
10
0
07 Jun 2023
Multi-Agent Reinforcement Learning for Cooperative Air Transportation Services in City-Wide Autonomous Urban Air Mobility
C. Park
Gyusun Kim
Soohyun Park
Soyi Jung
Joongheon Kim
51
24
0
07 Jun 2023
Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor
Arshad Javeed
Valentín López Jiménez
90
1
0
06 Jun 2023
Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning
Jan Kaiser
Chenran Xu
Annika Eichler
Andrea Santamaria Garcia
O. Stein
...
H. Dinter
F. Mayet
T. Vinatier
F. Burkart
H. Schlarb
OffRL
53
4
0
06 Jun 2023
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
Linjie Xu
Zhengyao Jiang
Jinyu Wang
Lei Song
Jiang Bian
OffRL
94
0
0
06 Jun 2023
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Jonas Eschmann
Dario Albani
Giuseppe Loianno
OffRL
109
5
0
06 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRL
OnRL
108
17
0
05 Jun 2023
For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
OffRL
95
55
0
04 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
77
8
0
02 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
62
7
0
01 Jun 2023
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Bingyi Kang
Xiao Ma
Yi-Ren Wang
Yang Yue
Shuicheng Yan
OffRL
67
9
0
01 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRL
AI4CE
126
41
0
01 Jun 2023
Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang
Xiao Ma
Chao Du
Tianyu Pang
Shuicheng Yan
OffRL
174
81
0
31 May 2023
Symmetry-Aware Robot Design with Structured Subgroups
Heng Dong
Junyu Zhang
Tonghan Wang
Chongjie Zhang
58
12
0
31 May 2023
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
60
7
0
30 May 2023
Subequivariant Graph Reinforcement Learning in 3D Environments
Runfa Chen
Jiaqi Han
Gang Hua
Wen-bing Huang
OffRL
76
11
0
30 May 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
124
22
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
66
9
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
107
18
0
28 May 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Kang Xu
Chenjia Bai
Xiaoteng Ma
Dong Wang
Bingyan Zhao
Zhen Wang
Xuelong Li
Wei Li
94
18
0
28 May 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
89
1
0
28 May 2023
Probing reaction channels via reinforcement learning
Senwei Liang
Aditya Singh
Yuanran Zhu
David T. Limmer
Chao Yang
57
6
0
27 May 2023
Learning from Integral Losses in Physics Informed Neural Networks
Ehsan Saleh
Saba Ghaffari
Timothy Bretl
Luke N. Olson
Matthew West
PINN
AI4CE
79
4
0
27 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRL
SSL
84
9
0
26 May 2023
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets
Dinghuai Zhang
H. Dai
Nikolay Malkin
Aaron Courville
Yoshua Bengio
L. Pan
122
37
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
93
29
0
26 May 2023
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
OffRL
60
3
0
25 May 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya Zhang
OffRL
OnRL
97
19
0
25 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
77
6
0
23 May 2023
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
Sumeet Batra
Bryon Tjanaka
Matthew C. Fontaine
Aleksei Petrenko
Stefanos Nikolaidis
Gaurav Sukhatme
OffRL
100
17
0
23 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
111
53
0
22 May 2023
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
Naman Saxena
Subhojyoti Khastagir
Shishir Kolathaya
S. Bhatnagar
OffRL
59
8
0
20 May 2023
Attacks on Online Learners: a Teacher-Student Analysis
R. Margiotta
Sebastian Goldt
G. Sanguinetti
AAML
79
1
0
18 May 2023
Deep Metric Tensor Regularized Policy Gradient
Gang Chen
Victoria Huang
78
0
0
18 May 2023
Previous
1
2
3
...
16
17
18
...
42
43
44
Next