Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
193
4
0
20 Aug 2024
Training Verifiably Robust Agents Using Set-Based Reinforcement Learning
Manuel Wendl
Lukas Koller
Tobias Ladner
Matthias Althoff
OOD
OffRL
96
0
0
17 Aug 2024
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators
Mark Towers
Yali Du
Christopher T. Freeman
Timothy J. Norman
71
1
0
15 Aug 2024
Experimental evaluation of offline reinforcement learning for HVAC control in buildings
Jun Wang
Linyan Li
Qi Liu
Yu Yang
OffRL
AI4CE
48
1
0
15 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
131
2
0
11 Aug 2024
F1tenth Autonomous Racing With Offline Reinforcement Learning Methods
Prajwal Koirala
Cody Fleming
OffRL
84
1
0
08 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
67
0
0
07 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
126
9
0
06 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
359
2
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
122
0
0
05 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
105
3
0
04 Aug 2024
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
70
2
0
04 Aug 2024
Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning
Fengjun Yang
Nikolai Matni
OffRL
72
0
0
03 Aug 2024
Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Yu Yang
Pan Xu
VLM
OffRL
85
2
0
02 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
170
9
0
02 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
130
0
0
01 Aug 2024
On the Perturbed States for Transformed Input-robust Reinforcement Learning
Tung M. Luu
Haeyong Kang
Matthew Groh
Thanh Nguyen
Chang D. Yoo
OOD
AAML
OffRL
64
0
0
31 Jul 2024
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks
David Valencia
Henry Williams
Yuning Xing
Trevor Gee
Minas V. Liarokapis
Bruce A. MacDonald
69
0
0
31 Jul 2024
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
68
1
0
30 Jul 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&Ro
OffRL
104
7
0
29 Jul 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
116
19
0
29 Jul 2024
Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning
Leen Kweider
Maissa Abou Kassem
Ubai Sandouk
OffRL
80
0
0
29 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
68
2
0
26 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
101
2
0
26 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
89
14
0
18 Jul 2024
On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems
Siyu Wang
Xiaocong Chen
Lina Yao
CML
57
0
0
18 Jul 2024
Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocation
Kangwei Qi
Qiong Wu
Pingyi Fan
Nan Cheng
Wen Chen
Khaled B. Letaief
70
5
0
18 Jul 2024
Estimating Reaction Barriers with Deep Reinforcement Learning
Adittya Pal
65
0
0
17 Jul 2024
Ontology-driven Reinforcement Learning for Personalized Student Support
Ryan Hare
Ying Tang
43
1
0
14 Jul 2024
DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-based Clusters
Haoyu Bai
Minxian Xu
Kejiang Ye
Rajkumar Buyya
Chengzhong Xu
75
6
0
14 Jul 2024
A Benchmark Environment for Offline Reinforcement Learning in Racing Games
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
64
1
0
12 Jul 2024
Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control
Sicong Jiang
Seongjin Choi
Lijun Sun
80
1
0
12 Jul 2024
HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Bowen Jiang
Yilin Wu
Wenxuan Zhou
Chris Paxton
David Held
73
2
0
11 Jul 2024
Real-time system optimal traffic routing under uncertainties -- Can physics models boost reinforcement learning?
Zemian Ke
Qiling Zou
Jiachao Liu
Sean Qian
AI4CE
54
8
0
10 Jul 2024
Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning
Francisco Giral
Ignacio Gómez
S. L. Clainche
72
0
0
09 Jul 2024
System stabilization with policy optimization on unstable latent manifolds
Steffen W. R. Werner
Benjamin Peherstorfer
67
2
0
08 Jul 2024
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
80
3
0
05 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
100
1
0
05 Jul 2024
Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning
I. Lee
Hoang-Giang Cao
Cong-Tinh Dao
Yu-Cheng Chen
I-Chen Wu
54
0
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
163
26
0
05 Jul 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
80
0
0
02 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
150
0
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
78
1
0
30 Jun 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
61
0
0
30 Jun 2024
KOROL: Learning Visualizable Object Feature with Koopman Operator Rollout for Manipulation
Hongyi Chen
Abulikemu Abuduweili
Aviral Agrawal
Yunhai Han
Harish Ravichandar
Changliu Liu
Jeffrey Ichnowski
117
6
0
29 Jun 2024
Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity
Alireza Mohammadshafie
Akram Mirzaeinia
Haseebullah Jumakhan
Amir Mirzaeinia
AIFin
28
1
0
29 Jun 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
112
6
0
29 Jun 2024
3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints
Yoonkyu Yoo
Donghwi Jung
Seong-Woo Kim
53
0
0
28 Jun 2024
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
Bernd Frauenknecht
Ramil Sabirov
Sebastian Trimpe
OffRL
OnRL
117
5
0
28 Jun 2024
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning
Nishesh Singh
Sidharth Ramesh
Abhishek Shankar
Jyotishka Duttagupta
Leander Stephen D'Souza
Sanjay Singh
32
0
0
27 Jun 2024
Previous
1
2
3
...
6
7
8
...
42
43
44
Next