Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.00150
Cited By
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
31 January 2022
Liyu Chen
R. Jain
Haipeng Luo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints"
19 / 19 papers shown
Title
Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior Sampling
Danil Provodin
M. Kaptein
Mykola Pechenizkiy
60
0
0
29 May 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
59
6
0
16 Dec 2023
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
39
0
0
27 Sep 2023
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Zihan Zhou
Honghao Wei
Lei Ying
OffRL
43
1
0
27 Sep 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
34
9
0
05 Sep 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kaipeng Zhang
Alejandro Ribeiro
54
20
0
20 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
Mihailo R. Jovanović
OffRL
48
11
0
31 May 2023
Online Resource Allocation in Episodic Markov Decision Processes
Duksang Lee
William Overman
Dabeen Lee
44
1
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
34
10
0
17 May 2023
Graph Exploration for Effective Multi-agent Q-Learning
Ainur Zhaikhan
Ali H. Sayed
42
1
0
19 Apr 2023
Provably Efficient Model-Free Algorithms for Non-stationary CMDPs
Honghao Wei
A. Ghosh
Ness B. Shroff
Lei Ying
Xingyu Zhou
29
13
0
10 Mar 2023
Online Nonstochastic Control with Adversarial and Static Constraints
Xin Liu
Zixi Yang
Lei Ying
42
5
0
05 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
29
2
0
02 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
40
6
0
27 Jan 2023
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
41
12
0
02 Jan 2023
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
46
1
0
08 Sep 2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
38
12
0
12 Sep 2021
Learning in Markov Decision Processes under Constraints
Rahul Singh
Abhishek Gupta
Ness B. Shroff
51
27
0
27 Feb 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
100
0
15 Oct 2019
1