Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.15268
Cited By
Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
28 October 2020
Kenny Young
R. Sutton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning"
4 / 4 papers shown
Title
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Han-Dong Lim
Donghwan Lee
32
0
0
15 Apr 2025
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
37
19
0
12 Jul 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Theodore H. Moskovitz
Brendan O'Donoghue
Vivek Veeriah
Sebastian Flennerhag
Satinder Singh
Tom Zahavy
50
19
0
02 Feb 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
35
25
0
29 Dec 2022
1