Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.16286
Cited By
v1
v2
v3
v4 (latest)
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
29 August 2024
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form"
46 / 46 papers shown
Title
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
20
0
0
25 May 2025
Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Zhengfei Zhang
Kishan Panaganti
Laixi Shi
Yanan Sui
Adam Wierman
Yisong Yue
OOD
72
5
0
22 Jun 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
67
6
0
02 May 2024
Truly No-Regret Learning in Constrained MDPs
Adrian Müller
Pragnya Alatur
Volkan Cevher
Giorgia Ramponi
Niao He
64
10
0
24 Feb 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
64
3
0
31 Jan 2024
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kai Zhang
Alejandro Ribeiro
94
22
0
20 Jun 2023
Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning
Oswin So
Chuchu Fan
36
24
0
23 May 2023
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Pierre Clavier
E. L. Pennec
Matthieu Geist
77
14
0
10 Feb 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Theodore H. Moskovitz
Brendan O'Donoghue
Vivek Veeriah
Sebastian Flennerhag
Satinder Singh
Tom Zahavy
89
21
0
02 Feb 2023
Robust Markov Decision Processes without Model Estimation
Wenhao Yang
Hanfengzhai Wang
Tadashi Kozuno
S. Jordan
Zhihua Zhang
83
4
0
02 Feb 2023
Policy Gradient for Rectangular Robust Markov Decision Processes
Navdeep Kumar
E. Derman
Matthieu Geist
Kfir Y. Levy
Shie Mannor
59
22
0
31 Jan 2023
Policy Gradient in Robust MDPs with Global Convergence Guarantee
Qiuhao Wang
C. Ho
Marek Petrik
87
29
0
20 Dec 2022
First-order Policy Optimization for Robust Markov Decision Process
Yan Li
Guanghui Lan
Tuo Zhao
118
25
0
21 Sep 2022
On the convex formulations of robust Markov decision processes
Julien Grand-Clément
Marek Petrik
74
11
0
21 Sep 2022
Robust Constrained Reinforcement Learning
Yue Wang
Fei Miao
Shaofeng Zou
57
14
0
14 Sep 2022
Efficient Policy Iteration for Robust Markov Decision Processes via Regularization
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
64
19
0
28 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
113
77
0
15 May 2022
Robust Entropy-regularized Markov Decision Processes
Tien Mai
Patrick Jaillet
25
5
0
31 Dec 2021
Sample Complexity of Robust Reinforcement Learning with a Generative Model
Kishan Panaganti
D. Kalathil
133
77
0
02 Dec 2021
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
80
29
0
01 Dec 2021
Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process
Tianjiao Li
Ziwei Guan
Shaofeng Zou
Tengyu Xu
Yingbin Liang
Guanghui Lan
61
30
0
20 Oct 2021
A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization
Donghao Ying
Yuhao Ding
Javad Lavaei
40
34
0
17 Oct 2021
Twice regularized MDPs and the equivalence between robustness and regularization
E. Derman
Matthieu Geist
Shie Mannor
92
57
0
12 Oct 2021
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
125
109
0
29 Sep 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
70
84
0
04 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
Honghao Wei
Xin Liu
Lei Ying
83
23
0
03 Jun 2021
Reward is enough for convex MDPs
Tom Zahavy
Brendan O'Donoghue
Guillaume Desjardins
Satinder Singh
108
75
0
01 Jun 2021
Bilinear Classes: A Structural Framework for Provable Generalization in RL
S. Du
Sham Kakade
Jason D. Lee
Shachar Lovett
G. Mahajan
Wen Sun
Ruosong Wang
OffRL
171
191
0
19 Mar 2021
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
82
14
0
20 Oct 2020
Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty
R. Russel
M. Benosman
J. Baar
63
22
0
10 Oct 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
52
40
0
01 Aug 2020
Partial Policy Iteration for L1-Robust Markov Decision Processes
C. Ho
Marek Petrik
W. Wiesemann
108
54
0
16 Jun 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
140
294
0
13 May 2020
Scalable First-Order Methods for Robust MDPs
Julien Grand-Clément
Christian Kroer
56
28
0
11 May 2020
Exploration-Exploitation in Constrained MDPs
Yonathan Efroni
Shie Mannor
Matteo Pirotta
117
181
0
04 Mar 2020
Constrained Upper Confidence Reinforcement Learning
Liyuan Zheng
Lillian J. Ratliff
84
68
0
26 Jan 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
148
122
0
07 Jan 2020
Safe Policies for Reinforcement Learning via Primal-Dual Methods
Santiago Paternain
Miguel Calvo-Fullana
Luiz F. O. Chamon
Alejandro Ribeiro
62
105
0
20 Nov 2019
Constrained Reinforcement Learning Has Zero Duality Gap
Santiago Paternain
Luiz F. O. Chamon
Miguel Calvo-Fullana
Alejandro Ribeiro
59
193
0
29 Oct 2019
Reinforcement Learning with Convex Constraints
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
51
92
0
21 Jun 2019
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
82
194
0
05 Jun 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
60
333
0
20 Mar 2019
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
83
541
0
28 May 2018
Robust Nonparametric Regression under Huber's
ε
ε
ε
-contamination Model
S. Du
Yining Wang
Sivaraman Balakrishnan
Pradeep Ravikumar
Aarti Singh
47
12
0
26 May 2018
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
120
1,328
0
30 May 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
76
311
0
22 Mar 2017
1