Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 831 papers shown
Title
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Siyu Wang
Xiaocong Chen
Lina Yao
CML
OffRL
93
0
0
04 Feb 2025
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
Woojun Kim
Katia Sycara
OffRL
94
0
0
30 Jan 2025
FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities
Mingyuan Li
Jiahao Wang
Bo Du
Jun Shen
Qiang Wu
59
1
0
28 Jan 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
87
1
0
28 Jan 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
Low-altitude Friendly-Jamming for Satellite-Maritime Communications via Generative AI-enabled Deep Reinforcement Learning
Jiawei Huang
Aimin Wang
Geng Sun
Jiahui Li
Jiacheng Wang
Dusit Niyato
Victor C. M. Leung
67
0
0
28 Jan 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
47
0
0
25 Jan 2025
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Yulong Hu
Tingting Dong
Sen Li
OffRL
OnRL
67
0
0
24 Jan 2025
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
39
0
0
24 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
39
0
0
22 Jan 2025
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024
Nikolaus Holzer
Keyi Wang
Kairong Xiao
Xiao-Yang Liu Yanglet
AIFin
35
1
0
18 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank Wood
204
0
0
17 Jan 2025
Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement Learning
Ali Baheri
Zahra Sharooei
Chirayu Salgarkar
260
0
0
17 Jan 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
76
1
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
55
0
0
12 Jan 2025
On the role of Artificial Intelligence methods in modern force-controlled manufacturing robotic tasks
Vincenzo Petrone
Enrico Ferrentino
Pasquale Chiacchio
39
0
0
10 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
44
0
0
08 Jan 2025
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
67
0
0
04 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
107
1
0
22 Dec 2024
Harvesting energy from turbulent winds with Reinforcement Learning
Lorenzo Basile
Maria Grazia Berni
Antonio Celani
74
0
0
18 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
89
0
0
16 Dec 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
76
0
0
24 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
116
1
0
22 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
57
1
0
11 Nov 2024
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade
Xiaoning Jin
OffRL
39
0
0
10 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
44
0
0
27 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
67
0
0
19 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
79
13
0
17 Oct 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
Jinzhu Luo
Dingyang Chen
Qi Zhang
OffRL
26
0
0
16 Oct 2024
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
41
2
0
16 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
39
1
0
15 Oct 2024
Communication-Control Codesign for Large-Scale Wireless Networked Control Systems
Gaoyang Pang
Wanchun Liu
Dusit Niyato
Branka Vucetic
Yonghui Li
AI4CE
26
0
0
15 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
250
3
0
12 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
32
0
0
12 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
1
0
11 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
37
0
0
11 Oct 2024
FRASA: An End-to-End Reinforcement Learning Agent for Fall Recovery and Stand Up of Humanoid Robots
Clément Gaspard
Marc Duclusaud
G. Passault
Mélodie Daniel
Olivier Ly
38
3
0
11 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
44
3
0
10 Oct 2024
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRL
DRL
36
2
0
09 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao
Wenhao Zhan
Jonathan D. Chang
Gokul Swamy
Kianté Brantley
Jason D. Lee
Wen Sun
OffRL
78
3
0
06 Oct 2024
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng
Hao Luan
Kevin Yuchen Ma
Harold Soh
32
2
0
03 Oct 2024
Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions
Mohammad K. Al-Sharman
Luc Edes
Bert Sun
Vishal Jayakumar
Mohamed A. Daoud
Derek Rayside
W. Melek
29
1
0
20 Sep 2024
BetterBodies: Reinforcement Learning guided Diffusion for Antibody Sequence Design
Yannick Vogt
Mehdi Naouar
M. Kalweit
Christoph Cornelius Miething
Justus Duyster
Joschka Boedecker
Gabriel Kalweit
DiffM
42
0
0
09 Sep 2024
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
44
3
0
05 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
37
0
0
02 Sep 2024
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
Pritthijit Nath
Henry Moss
Emily Shuckburgh
Mark Webb
AI4Cl
AI4CE
54
0
0
28 Aug 2024
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning
Wen-Han Hsieh
Jen-Yuan Chang
26
0
0
26 Aug 2024
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
Peng Zhu
Yuante Li
Yifan Hu
Qinyuan Liu
Dawei Cheng
Yuqi Liang
AIFin
AI4TS
46
4
0
26 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
48
1
0
23 Aug 2024
Previous
1
2
3
4
5
...
15
16
17
Next