Upper Confidence Primal-Dual Reinforcement Learning for CMDP with
Adversarial Loss

Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

2 March 2020

Papers citing "Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss"

12 / 12 papers shown

Title
Learning Adversarial MDPs with Stochastic Hard Constraints Francesco Emanuele Stradi Matteo Castiglioni A. Marchesi Nicola Gatti 44 5 0 06 Mar 2024
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need Danil Provodin Pratik Gajane Mykola Pechenizkiy M. Kaptein 44 0 0 27 Sep 2023
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP Fan Chen Junyu Zhang Zaiwen Wen OffRL 41 8 0 13 Jul 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints Liyu Chen R. Jain Haipeng Luo 72 25 0 31 Jan 2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations Mridul Agarwal Qinbo Bai Vaneet Aggarwal 38 12 0 12 Sep 2021
Safe Reinforcement Learning Using Advantage-Based Intervention Nolan Wagener Byron Boots Ching-An Cheng 57 52 0 16 Jun 2021
A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback Semih Cayci Yilin Zheng A. Eryilmaz 31 10 0 09 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs Tao-Wen Liu Ruida Zhou D. Kalathil P. R. Kumar Chao Tian 47 79 0 04 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes Honghao Wei Xin Liu Lei Ying 39 21 0 03 Jun 2021
Provably Efficient Algorithms for Multi-Objective Competitive RL Tiancheng Yu Yi Tian J.N. Zhang S. Sra 37 20 0 05 Feb 2021
Learning in Markov Decision Processes under Constraints Rahul Singh Abhishek Gupta Ness B. Shroff 56 27 0 27 Feb 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Chen-Yu Wei Mehdi Jafarnia-Jahromi Haipeng Luo Hiteshi Sharma R. Jain 109 104 0 15 Oct 2019