ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.05850
  4. Cited By
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
v1v2 (latest)

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

12 June 2022
Qinbo Bai
Amrit Singh Bedi
Vaneet Aggarwal
ArXiv (abs)PDFHTML

Papers citing "Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm"

12 / 12 papers shown
Title
Runtime Safety through Adaptive Shielding: From Hidden Parameter Inference to Provable Guarantees
Runtime Safety through Adaptive Shielding: From Hidden Parameter Inference to Provable Guarantees
Minjae Kwon
Tyler Ingebrand
Ufuk Topcu
Lu Feng
15
0
0
20 May 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
432
0
0
11 Feb 2025
Last-Iterate Convergence of General Parameterized Policies in
  Constrained MDPs
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
76
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
70
2
0
15 Jul 2024
Deterministic Policies for Constrained Reinforcement Learning in
  Polynomial-Time
Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time
Jeremy McMahan
82
2
0
23 May 2024
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
99
3
0
01 Dec 2023
Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm
  with General Parameterization for Infinite Horizon Discounted Reward Markov
  Decision Processes
Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes
Washim Uddin Mondal
Vaneet Aggarwal
68
11
0
18 Oct 2023
Enhancing Infrared Small Target Detection Robustness with Bi-Level
  Adversarial Framework
Enhancing Infrared Small Target Detection Robustness with Bi-Level Adversarial Framework
Zhu Liu
Zihang Chen
Jinyuan Liu
Long Ma
Xin-Yue Fan
Risheng Liu
AAML
149
1
0
03 Sep 2023
Mean-Field Approximation of Cooperative Constrained Multi-Agent
  Reinforcement Learning (CMARL)
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
74
4
0
15 Sep 2022
Convergence and sample complexity of natural policy gradient primal-dual
  methods for constrained MDPs
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding
Kai Zhang
Jiali Duan
Tamer Bacsar
Mihailo R. Jovanović
73
21
0
06 Jun 2022
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
187
60
0
13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
69
13
0
12 Sep 2021
1