ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.04527
  4. Cited By
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes

10 October 2022
Soumyajit Guin
S. Bhatnagar
ArXivPDFHTML

Papers citing "A policy gradient approach for Finite Horizon Constrained Markov Decision Processes"

8 / 8 papers shown
Title
Safe Reinforcement Learning using Finite-Horizon Gradient-based
  Estimation
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
89
2
0
15 Dec 2024
Safe Reinforcement Learning for Constrained Markov Decision Processes
  with Stochastic Stopping Time
Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time
Abhijit Mazumdar
Rafał Wisniewski
Manuela L. Bujorianu
31
3
0
23 Mar 2024
Optimizing Heat Alert Issuance with Reinforcement Learning
Optimizing Heat Alert Issuance with Reinforcement Learning
Ellen M. Considine
Rachel C. Nethery
G. Wellenius
Francesca Dominici
Mauricio Tec
OffRL
34
0
0
21 Dec 2023
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy
  Gradient Methods
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods
Sara Klein
Simon Weissmann
Leif Döring
29
7
0
04 Oct 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kaipeng Zhang
Alejandro Ribeiro
54
20
0
20 Jun 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
35
0
0
22 Mar 2023
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in
  Reinforcement Learning
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning
Naman Saxena
Sandeep Gorantla
Pushpak Jagtap
42
4
0
30 Nov 2022
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
92
146
0
04 May 2020
1