Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space

2 June 2023

Papers citing "Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space"

8 / 8 papers shown

Title
Online Episodic Convex Reinforcement Learning B. Moreno Khaled Eldowa Pierre Gaillard Margaux Brégère Nadia Oudjane OffRL 198 0 0 12 May 2025
From Gradient Clipping to Normalization for Heavy Tailed SGD Florian Hübler Ilyas Fatkhullin Niao He 113 10 0 17 Oct 2024
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods Ric De Santi Manish Prajapat Andreas Krause 97 5 0 13 Jul 2024
MetaCURL: Non-stationary Concave Utility Reinforcement Learning B. Moreno Margaux Brégère Pierre Gaillard Nadia Oudjane OffRL 91 1 0 30 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline Wenjia Meng Qian Zheng Long Yang Yilong Yin Gang Pan OffRL 89 0 0 04 May 2024
Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence Ilyas Fatkhullin Niao He 80 4 0 27 Feb 2024
On the Stochastic (Variance-Reduced) Proximal Gradient Method for Regularized Expected Reward Optimization Ling Liang Haizhao Yang 60 1 0 23 Jan 2024
Inverse Reinforcement Learning with the Average Reward Criterion Feiyang Wu Jingyang Ke Anqi Wu 87 11 0 24 May 2023