Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
12
0
0
20 Jun 2025
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
AAML
OffRL
15
0
0
20 Jun 2025
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation
Chenxu Wang
Yonggang Jin
Cheng Hu
Youpeng Zhao
Zipeng Dai
Jian Zhao
Shiyu Huang
Liuyu Xiang
Junge Zhang
Zhaofeng He
14
0
0
20 Jun 2025
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Jiamin He
A. Rupam Mahmood
Martha White
10
0
0
19 Jun 2025
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Roger Creus Castanyer
J. Obando-Ceron
Lu Li
Pierre-Luc Bacon
Glen Berseth
Aaron Courville
Pablo Samuel Castro
22
0
0
18 Jun 2025
Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy
Amornyos Horprasert
Esa Apriaskar
Xingyu Liu
Lanlan Su
Lyudmila S. Mihaylova
20
0
0
16 Jun 2025
Flow-Based Policy for Online Reinforcement Learning
Lei Lv
Y. Li
Yu-Juan Luo
F. Sun
Tao Kong
Jiafeng Xu
Xiao Ma
16
0
0
15 Jun 2025
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
Federico Zocco
Monica Malvezzi
10
0
0
13 Jun 2025
Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients
Chapa Sirithunge
Yue Xie
Saitarun Nadipineni
Fumiya Iida
Thilina Dulantha Lalitharatne
80
0
0
13 Jun 2025
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary
Wassim Uddin Mondal
Laxmidhar Behera
OffRL
93
0
0
11 Jun 2025
Wasserstein Barycenter Soft Actor-Critic
Zahra Shahrooei
Ali Baheri
OffRL
48
0
0
11 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
34
0
0
10 Jun 2025
GPS Spoofing Attacks on AI-based Navigation Systems with Obstacle Avoidance in UAV
Ji Hyuk Jung
Mi Yeon Hong
Ji Won Yoon
AAML
29
0
0
10 Jun 2025
From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium
Xie Yi
Zhanke Zhou
Chentao Cao
Qiyu Niu
Tongliang Liu
Bo Han
23
0
0
09 Jun 2025
A Stable Whitening Optimizer for Efficient Neural Network Training
Kevin Frans
Sergey Levine
Pieter Abbeel
30
0
0
08 Jun 2025
Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning
Eshwar S. R.
Gugan Thoppe
Aditya Gopalan
Gal Dalal
13
0
0
08 Jun 2025
CARoL: Context-aware Adaptation for Robot Learning
Zechen Hu
Tong Xu
Xuesu Xiao
Xuan Wang
17
0
0
08 Jun 2025
Ensemble Elastic DQN: A novel multi-step ensemble approach to address overestimation in deep value-based reinforcement learning
Adrian Ly
Richard Dazeley
Peter Vamplew
F. Cruz
Sunil Aryal
24
0
0
06 Jun 2025
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Motoki Omura
Kazuki Ota
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
37
0
0
06 Jun 2025
Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning
Kyungsoo Kim
Jeongsoo Ha
Yusung Kim
BDL
42
7
0
05 Jun 2025
When Maximum Entropy Misleads Policy Optimization
Ruipeng Zhang
Ya-Chien Chang
Sicun Gao
34
0
0
05 Jun 2025
Composing Agents to Minimize Worst-case Risk
Guruprerana Shabadi
Rajeev Alur
78
0
0
05 Jun 2025
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
89
0
0
04 Jun 2025
Enhancing Decision-Making of Large Language Models via Actor-Critic
Heng Dong
Kefei Duan
Chongjie Zhang
LLMAG
22
0
0
04 Jun 2025
Compositional Learning for Modular Multi-Agent Self-Organizing Networks
Qi Liao
Parijat Bhattacharjee
57
0
0
03 Jun 2025
Trajectory First: A Curriculum for Discovering Diverse Policies
Cornelius V. Braun
Sayantan Auddy
Marc Toussaint
50
0
0
02 Jun 2025
A Reinforcement Learning Approach for RIS-aided Fair Communications
Alex Pierron
Michel Barbeau
L. D. Cicco
José Rubio-Hernán
Joaquin Garcia-Alfaro
22
0
0
01 Jun 2025
Optimistic critics can empower small actors
Olya Mastikhina
Dhruv Sreenivas
Pablo Samuel Castro
46
0
0
01 Jun 2025
Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
Elinor Ginzburg
Itay Segev
Yoash Levron
Sarah Keren
OffRL
22
0
0
31 May 2025
From Rules to Rewards: Reinforcement Learning for Interest Rate Adjustment in DeFi Lending
Hanxiao Qu
Krzysztof Gogol
Florian Groetschla
Claudio J. Tessone
OffRL
12
0
0
31 May 2025
Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control
Zijie Xu
Tong Bu
Zecheng Hao
Jianhao Ding
Zhaofei Yu
22
0
0
30 May 2025
DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control
Wuhao Wang
Zhiyong Chen
OffRL
15
0
0
29 May 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
62
0
0
29 May 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
96
0
0
29 May 2025
Enhanced DACER Algorithm with High Diffusion Efficiency
Yinuo Wang
Mining Tan
Wenjun Zou
Haotian Lin
Xujie Song
...
Guojian Zhan
Tianze Zhu
Shiqi Liu
Jingliang Duan
Shengbo Eben Li
DiffM
70
0
0
29 May 2025
Learning Recommender Mechanisms for Bayesian Stochastic Games
Bengisu Guresti
Chongjie Zhang
Yevgeniy Vorobeychik
OffRL
14
0
0
29 May 2025
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control
Younggyo Seo
Carmelo Sferrazza
Haoran Geng
Michal Nauman
Zhao-Heng Yin
Pieter Abbeel
OffRL
61
0
0
28 May 2025
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
C. Voelcker
Anastasiia Pedan
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
50
0
0
28 May 2025
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
V. Wang
Tinghuai Wang
Joni Pajarinen
BDL
27
0
0
27 May 2025
Situationally-Aware Dynamics Learning
Alejandro Murillo-Gonzalez
Lantao Liu
121
0
0
26 May 2025
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network
Bingdong Li
Mei Jiang
Hong Qian
K. Tang
W. Hong
Peng Yang
127
0
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
67
0
0
26 May 2025
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
Qin-Wen Luo
Ming-Kun Xie
Ye-Wen Wang
Sheng-Jun Huang
OffRL
39
0
0
26 May 2025
Deep Actor-Critics with Tight Risk Certificates
Bahareh Tasdighi
Manuel Haussmann
Yi-Shan Wu
A. Masegosa
M. Kandemir
UQCV
88
0
0
26 May 2025
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Tao Wang
Ruipeng Zhang
Sicun Gao
OffRL
53
0
0
25 May 2025
Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning
Zhuochen Liu
Rahul Jain
Quan Nguyen
40
0
0
25 May 2025
Structured Reinforcement Learning for Combinatorial Decision-Making
Heiko Hoppe
Léo Baty
Louis Bouvier
Axel Parmentier
Maximilian Schiffer
OffRL
109
1
0
25 May 2025
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
Federico Zocco
Andrea Corti
Monica Malvezzi
AI4CE
30
0
0
24 May 2025
Maximum Total Correlation Reinforcement Learning
Bang You
Puze Liu
Huaping Liu
Jan Peters
Oleg Arenz
45
0
0
22 May 2025
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
Ankita Kushwaha
Kiran Ravish
Preeti Lamba
Pawan Kumar
19
0
0
22 May 2025
1
2
3
4
...
42
43
44
Next