ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04870
  4. Cited By
Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization
  under Model Uncertainty

Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty

10 October 2020
R. Russel
M. Benosman
J. Baar
ArXivPDFHTML

Papers citing "Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty"

6 / 6 papers shown
Title
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
72
0
0
29 Aug 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with
  Uniform PAC Guarantees
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
42
2
0
31 Jan 2024
Robust Average-Reward Markov Decision Processes
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
33
11
0
02 Jan 2023
Constrained Update Projection Approach to Safe Policy Optimization
Constrained Update Projection Approach to Safe Policy Optimization
Long Yang
Jiaming Ji
Juntao Dai
Linrui Zhang
Binbin Zhou
Pengfei Li
Yaodong Yang
Gang Pan
38
43
0
15 Sep 2022
Sim2real for Reinforcement Learning Driven Next Generation Networks
Sim2real for Reinforcement Learning Driven Next Generation Networks
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Hakan Erdol
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
22
3
0
08 Jun 2022
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
14
37
0
12 Nov 2021
1