Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

1 August 2020

Papers citing "Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs"

27 / 27 papers shown

Title
Constrained Online Decision-Making: A Unified Framework Haichen Hu David Simchi-Levi Navid Azizan 34 0 0 11 May 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning Jeremy McMahan 121 0 0 11 Feb 2025
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form Toshinori Kitamura Tadashi Kozuno Wataru Kumagai Kenta Hoshino Y. Hosoe Kazumi Kasaura Masashi Hamaya Paavo Parmas Yutaka Matsuo 72 0 0 29 Aug 2024
Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time Jeremy McMahan 31 2 0 23 May 2024
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning Sihan Zeng Thinh T. Doan Justin Romberg 32 0 0 03 May 2024
Structured Reinforcement Learning for Media Streaming at the Wireless Edge Archana Bura Sarat Chandra Bobbili Shreyas Rameshkumar Desik Rengarajan D. Kalathil S. Shakkottai 26 0 0 10 Apr 2024
What Are the Odds? Improving the foundations of Statistical Model Checking Tobias Meggendorfer Maximilian Weininger Patrick Wienhoft 39 4 0 08 Apr 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Toshinori Kitamura Tadashi Kozuno Masahiro Kato Yuki Ichihara Soichiro Nishimori Akiyoshi Sannai Sho Sonoda Wataru Kumagai Yutaka Matsuo 42 2 0 31 Jan 2024
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms Prashansa Panda Shalabh Bhatnagar 33 0 0 25 Oct 2023
Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows Xiaoshan Lin Abbasali Koochakzadeh Yasin Yazıcıoğlu Derya Aksaray 18 1 0 29 Jul 2023
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity Runyu Zhang Yang Hu Na Li 38 5 0 20 Jun 2023
ROSARL: Reward-Only Safe Reinforcement Learning Geraud Nangue Tasse Tamlin Love Mark W. Nemecek Steven D. James Benjamin Rosman 21 3 0 31 May 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints Ming Shi Yitao Liang Ness B. Shroff 35 8 0 08 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation K. C. Kalagarla Rahul Jain Pierluigi Nuzzo 26 6 0 27 Jan 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction Hoai-An Nguyen Ching-An Cheng OffRL 18 2 0 06 Jan 2023
Provable Safe Reinforcement Learning with Binary Feedback Andrew Bennett Dipendra Kumar Misra Nathan Kallus OffRL 30 4 0 26 Oct 2022
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments Yixuan Wang S. Zhan Ruochen Jiao Zhilu Wang Wanxin Jin Zhuoran Yang Zhaoran Wang Chao Huang Qi Zhu 26 48 0 29 Sep 2022
Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents Nathaniel P. Hamilton Kyle Dunlap Taylor T. Johnson Kerianne L. Hobbs OffRL 19 8 0 08 Jul 2022
Reinforcement Learning with a Terminator Guy Tennenholtz Nadav Merlis Lior Shani Shie Mannor Uri Shalit Gal Chechik Assaf Hallak Gal Dalal 9 5 0 30 May 2022
Finding Safe Zones of policies Markov Decision Processes Lee Cohen Yishay Mansour Michal Moshkovitz 19 1 0 23 Feb 2022
Reinforcement Learning with Almost Sure Constraints Agustin Castellano Hancheng Min J. Bazerque Enrique Mallada 13 15 0 09 Dec 2021
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning Archana Bura Aria HasanzadeZonuzy D. Kalathil S. Shakkottai J. Chamberland 22 28 0 01 Dec 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN Peizheng Li Jonathan D. Thomas Xiaoyang Wang Ahmed Khalil A. Ahmad ... S. Kapoor Arjun Parekh A. Doufexi Arman Shojaeifard Robert Piechocki AI4TS 14 37 0 12 Nov 2021
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits Guojun Xiong Jian Li Rahul Singh 17 4 0 20 Sep 2021
Learning to Act Safely with Limited Exposure and Almost Sure Certainty Agustin Castellano Hancheng Min J. Bazerque Enrique Mallada 13 4 0 18 May 2021
A Meta Reinforcement Learning-based Approach for Self-Adaptive System Mingyue Zhang Jialong Li Haiyan Zhao Kenji Tei S. Honiden Zhi Jin 17 4 0 11 May 2021
Stochastic Linear Bandits with Protected Subspace Advait Parulekar Soumya Basu Aditya Gopalan Karthikeyan Shanmugam Sanjay Shakkottai 71 2 0 02 Nov 2020