Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
84
9
0
30 Apr 2024
Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions
Jordan Beason
Michael Novitzky
John Kliem
Tyler Errico
Zachary Serlin
...
Michael R. Benjamin
Prithviraj Dasgupta
Peter Crowley
Charles O'Donnell
John James
69
2
0
25 Apr 2024
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
Tongzhou Mu
Minghua Liu
Hao Su
OffRL
88
4
0
25 Apr 2024
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
67
2
0
25 Apr 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
98
0
0
25 Apr 2024
AFU: Actor-Free critic Updates in off-policy RL for continuous control
Nicolas Perrin-Gilbert
OffRL
108
0
0
24 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
68
1
0
24 Apr 2024
Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
Xiaoshuang Chen
Gengrui Zhang
Yao Wang
Yulin Wu
Shuo Su
Kaiqiao Zhan
Ben Wang
OffRL
83
2
0
23 Apr 2024
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Alican Mertan
Nick Cheney
104
0
0
22 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
104
9
0
22 Apr 2024
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Xulin Chen
Ruipeng Liu
Garret E. Katz
74
0
0
22 Apr 2024
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
93
3
0
19 Apr 2024
Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming
Jie Wang
Zhihai Wang
Xijun Li
Yufei Kuang
Zhihao Shi
Fangzhou Zhu
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
83
8
0
19 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
75
1
0
18 Apr 2024
Trajectory Planning for Autonomous Vehicle Using Iterative Reward Prediction in Reinforcement Learning
Hyunwoo Park
70
0
0
18 Apr 2024
Actor-Critic Reinforcement Learning with Phased Actor
Ruofan Wu
Junmin Zhong
Jennie Si
39
0
0
18 Apr 2024
Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
Jinmei Liu
Wenbin Li
Xiangyu Yue
Shilin Zhang
Chunlin Chen
Zhi Wang
OffRL
DiffM
75
6
0
16 Apr 2024
Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms
Zehao Zhou
OffRL
31
0
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
75
1
0
16 Apr 2024
Developing An Attention-Based Ensemble Learning Framework for Financial Portfolio Optimisation
Zhenglong Li
Vincent Tam
95
1
0
13 Apr 2024
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Yunxiang Li
Rui Yuan
Chen Fan
Mark Schmidt
Samuel Horváth
Robert Mansel Gower
Martin Takávc
72
0
0
11 Apr 2024
Generative Probabilistic Planning for Optimizing Supply Chain Networks
Hyung-il Ahn
Santiago Olivar
Hershel Mehta
Young Chol Song
61
0
0
11 Apr 2024
UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning
Saichao Liu
Geng Sun
Jiahui Li
Shuang Liang
Qingqing Wu
Pengfei Wang
Dusit Niyato
81
6
0
11 Apr 2024
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
L. Nasvytis
Kai Sandbrink
Jakob N. Foerster
Tim Franzmeyer
Christian Schroeder de Witt
OffRL
OODD
51
9
0
10 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
121
8
0
09 Apr 2024
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Bo Lin
Yangzheng Zhong
Weiqing Ren
50
0
0
08 Apr 2024
IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads
Taiyi Wang
Eiko Yoneki
53
2
0
08 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
75
3
0
03 Apr 2024
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
Dongsu Lee
Chanin Eom
Minhae Kwon
GP
OffRL
43
9
0
03 Apr 2024
Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
Eric M. S. P. Veith
Torben Logemann
Aleksandr Berezin
Arlena Wellßow
Stephan Balduin
66
2
0
02 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
75
4
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
123
0
0
02 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
88
14
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
90
0
0
31 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
324
2
0
29 Mar 2024
Closed-form congestion control via deep symbolic regression
Jean Martins
Igor Almeida
Ricardo Souza
Silvia Lins
27
0
0
28 Mar 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
89
2
0
26 Mar 2024
Exploring CausalWorld: Enhancing robotic manipulation via knowledge transfer and curriculum learning
Xinrui Wang
Yan Jin
100
2
0
25 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
103
6
0
22 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
76
0
0
21 Mar 2024
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
Jean-Baptiste Bouvier
Kartik Nagpal
Negar Mehr
87
5
0
20 Mar 2024
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
106
2
0
19 Mar 2024
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting
Clément Gaspard
G. Passault
Mélodie Daniel
Olivier Ly
42
1
0
19 Mar 2024
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
60
0
0
17 Mar 2024
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou
Haotian Zhang
Guodong Sun
89
0
0
16 Mar 2024
Stimulate the Potential of Robots via Competition
K. Huang
Di Guo
Xinyu Zhang
Xiangyang Ji
Huaping Liu
95
3
0
15 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Shape Control of Deformable Linear Objects
Rita Laezza
Mohammadreza Shetab-Bushehri
Gabriel Arslan Waltersson
Erol Özgür
Y. Mezouar
Y. Karayiannidis
OffRL
80
1
0
15 Mar 2024
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti
Maxence Faldor
Borja G. León
Antoine Cully
118
3
0
15 Mar 2024
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
Motoki Omura
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
46
0
0
12 Mar 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
110
5
0
12 Mar 2024
Previous
1
2
3
...
9
10
11
...
42
43
44
Next