Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.02151
Cited By
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
4 July 2020
Junyu Zhang
Alec Koppel
Amrit Singh Bedi
Csaba Szepesvári
Mengdi Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Variational Policy Gradient Method for Reinforcement Learning with General Utilities"
6 / 6 papers shown
Title
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
63
0
0
12 May 2025
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
43
0
0
25 Apr 2024
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
32
5
0
15 Mar 2023
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
27
320
0
01 Aug 2019
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
72
1,313
0
30 May 2017
Mean-Variance Optimization in Markov Decision Processes
Shie Mannor
J. Tsitsiklis
51
126
0
29 Apr 2011
1