Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning

28 October 2020

Papers citing "Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning"

4 / 4 papers shown

Title
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration Han-Dong Lim Donghwan Lee 32 0 0 15 Apr 2025
Transformers in Reinforcement Learning: A Survey Pranav Agarwal A. Rahman P. St-Charles Simon J. D. Prince Samira Ebrahimi Kahou OffRL 37 19 0 12 Jul 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs Theodore H. Moskovitz Brendan O'Donoghue Vivek Veeriah Sebastian Flennerhag Satinder Singh Tom Zahavy 50 19 0 02 Feb 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory Shengchao Hu Li Shen Ya Zhang Yixin Chen Dacheng Tao OffRL 35 25 0 29 Dec 2022