Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 833 papers shown
Title
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Yun Qu
Boyuan Wang
Jianzhun Shao
Yuhang Jiang
Chen Chen
...
Qiang Fu
Wei Yang
Guang Yang
Lanxiao Huang
Xiangyang Ji
OffRL
54
9
0
20 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
95
3
0
20 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
46
6
0
06 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
178
1
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
47
0
0
05 Aug 2024
Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning
Fengjun Yang
Nikolai Matni
OffRL
31
0
0
03 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
62
8
0
02 Aug 2024
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
42
1
0
26 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
42
1
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
16
0
05 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
73
0
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
48
1
0
30 Jun 2024
3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints
Yoonkyu Yoo
Donghwi Jung
Seong-Woo Kim
29
0
0
28 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
42
0
0
24 Jun 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
42
1
0
21 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
43
1
0
17 Jun 2024
Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation
Shojiro Yamabe
Kazuto Fukuchi
Jun Sakuma
AAML
65
0
0
06 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
55
0
0
05 Jun 2024
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
33
0
0
03 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
3
0
31 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
49
17
0
25 May 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
40
0
0
24 May 2024
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
33
2
0
24 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
42
2
0
23 May 2024
Near-Field Spot Beamfocusing: A Correlation-Aware Transfer Learning Approach
Mohammad Amir Fallah
M. Monemi
Mehdi Rasti
Matti Latva-aho
21
1
0
21 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
42
3
0
20 May 2024
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
Thanh Nguyen
Tung M. Luu
Tri Ton
Chang D. Yoo
OffRL
AAML
36
0
0
18 May 2024
NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable Networks
Timothy K Johnsen
Marco Levorato
46
1
0
16 May 2024
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
40
1
0
12 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
44
1
0
07 May 2024
Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies
Paul Templier
Emmanuel Rachelson
Antoine Cully
Dennis G. Wilson
29
0
0
07 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
37
0
0
07 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
David Valencia
Henry Williams
Trevor Gee
Bruce A MacDonaland
Minas V. Liarokapis
Minas Liarokapis
OffRL
40
2
0
04 May 2024
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
48
2
0
03 May 2024
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
Chengqian Gao
William de Vazelhes
Hualin Zhang
Bin Gu
Zhiqiang Xu
54
0
0
02 May 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
40
0
0
25 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
40
0
0
24 Apr 2024
Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
Xiaoshuang Chen
Gengrui Zhang
Yao Wang
Yulin Wu
Shuo Su
Kaiqiao Zhan
Ben Wang
OffRL
22
2
0
23 Apr 2024
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Alican Mertan
Nick Cheney
34
0
0
22 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
36
6
0
22 Apr 2024
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Xulin Chen
Ruipeng Liu
Garret E. Katz
46
0
0
22 Apr 2024
Developing An Attention-Based Ensemble Learning Framework for Financial Portfolio Optimisation
Zhenglong Li
Vincent Tam
37
0
0
13 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
53
6
0
09 Apr 2024
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Bo Lin
Yangzheng Zhong
Weiqing Ren
30
0
0
08 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
31
3
0
03 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
28
2
0
02 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
45
11
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
32
0
0
31 Mar 2024
Closed-form congestion control via deep symbolic regression
Jean Martins
Igor Almeida
Ricardo Souza
Silvia Lins
22
0
0
28 Mar 2024
Previous
1
2
3
4
5
6
...
15
16
17
Next