ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.05869
  4. Cited By
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee
v1v2v3 (latest)

CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee

11 November 2020
Tengyu Xu
Yingbin Liang
Guanghui Lan
ArXiv (abs)PDFHTML

Papers citing "CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee"

50 / 85 papers shown
Title
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
35
0
0
25 May 2025
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios
Feihong Zhang
Guojian Zhan
Bin Shuai
Tianyi Zhang
Jingliang Duan
Shengbo Eben Li
78
0
0
18 May 2025
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
74
0
0
08 May 2025
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Chenggang Wang
Xinyi Wang
Yutong Dong
Lei Song
Xinping Guan
72
0
0
01 May 2025
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Songyuan Zhang
Oswin So
Mitchell Black
Zachary Serlin
Chuchu Fan
62
0
0
21 Apr 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
131
0
0
09 Mar 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OODOffRL
90
2
0
27 Feb 2025
Safety Representations for Safer Policy Learning
Safety Representations for Safer Policy Learning
Kaustubh Mani
Vincent Mai
Charlie Gauthier
Annie Chen
Samer Nashed
Liam Paull
58
0
0
27 Feb 2025
Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
198
0
0
18 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
148
2
0
28 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
108
0
0
21 Jan 2025
Adversarial Constrained Policy Optimization: Improving Constrained
  Reinforcement Learning by Adapting Budgets
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
52
0
0
28 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable
  Near-Optimality under All-task Optimum Comparator
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
72
1
0
13 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
101
2
0
12 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
99
2
0
03 Oct 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
139
14
0
30 Sep 2024
An Offline Adaptation Framework for Constrained Multi-Objective
  Reinforcement Learning
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
143
1
0
16 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement
  Learning
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
75
2
0
13 Sep 2024
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource
  Allocation and Task Offloading in TeraHertz Band Space Networks
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
Zhifeng Hu
Chong Han
Wolfgang H. Gerstacker
I. F. Akyildiz
85
0
0
12 Sep 2024
Last-Iterate Convergence of General Parameterized Policies in
  Constrained MDPs
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
76
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
70
2
0
15 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia Herbert
194
8
0
12 Jul 2024
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Zahra Shahrooei
Ali Baheri
71
3
0
17 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
125
1
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample
  Manipulation
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
88
2
0
31 May 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
74
1
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
85
13
0
26 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective
  Reinforcement Learning
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
77
3
0
26 May 2024
Federated Reinforcement Learning with Constraint Heterogeneity
Federated Reinforcement Learning with Constraint Heterogeneity
Hao Jin
Liangyu Zhang
Zhihua Zhang
103
0
0
06 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
78
6
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
102
10
0
02 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and
  Learning
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
81
1
0
23 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
104
9
0
22 Apr 2024
Primal Methods for Variational Inequality Problems with Functional Constraints
Primal Methods for Variational Inequality Problems with Functional Constraints
Liang Zhang
Niao He
Michael Muehlebach
103
3
0
19 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective
  Reinforcement Learning
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
76
0
0
01 Mar 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
88
15
0
03 Feb 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
115
4
0
26 Jan 2024
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
164
6
0
23 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region
  Conditional Value at Risk
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
83
19
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
99
3
0
01 Dec 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
120
11
0
03 Nov 2023
Reinforcement Learning in a Safety-Embedded MDP with Trajectory
  Optimization
Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization
Fan Yang
Wen-Min Zhou
Zuxin Liu
Ding Zhao
David Held
53
1
0
10 Oct 2023
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh
  Backhaul Networks
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh Backhaul Networks
Zhifeng Hu
Chong Han
Xudong Wang
57
4
0
08 Oct 2023
Evaluation of Constrained Reinforcement Learning Algorithms for Legged
  Locomotion
Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion
Joonho Lee
Lukas Schroth
Victor Klemm
Marko Bjelonic
Alexander Reske
Marco Hutter
91
17
0
27 Sep 2023
Iterative Reachability Estimation for Safe Reinforcement Learning
Iterative Reachability Estimation for Safe Reinforcement Learning
Milan Ganai
Zheng Gong
Chenning Yu
Sylvia Herbert
Sicun Gao
79
18
0
24 Sep 2023
Price of Safety in Linear Best Arm Identification
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
68
5
0
15 Sep 2023
Task-Oriented Cross-System Design for Timely and Accurate Modeling in
  the Metaverse
Task-Oriented Cross-System Design for Timely and Accurate Modeling in the Metaverse
Zhen Meng
Kan Chen
Yufeng Diao
Changyang She
G. Zhao
Muhammad Ali Imran
Branka Vucetic
77
13
0
11 Sep 2023
Not Only Rewards But Also Constraints: Applications on Legged Robot
  Locomotion
Not Only Rewards But Also Constraints: Applications on Legged Robot Locomotion
Yunho Kim
H. Oh
J. Lee
Jinhyeok Choi
Gwanghyeon Ji
Moonkyu Jung
D. Youm
Jemin Hwangbo
106
47
0
24 Aug 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan Cheng
J. Yang
Yitao Liang
OOD
78
1
0
10 Aug 2023
Probabilistic Constrained Reinforcement Learning with Formal
  Interpretability
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
60
4
0
13 Jul 2023
12
Next