Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.05869
Cited By
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
11 November 2020
Tengyu Xu
Yingbin Liang
Guanghui Lan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee"
50 / 85 papers shown
Title
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
46
0
0
08 May 2025
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Chenggang Wang
Xinyi Wang
Yutong Dong
Lei Song
Xinping Guan
31
0
0
01 May 2025
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Songyuan Zhang
Oswin So
Mitchell Black
Zachary Serlin
Chuchu Fan
31
0
0
21 Apr 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
50
0
0
09 Mar 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OOD
OffRL
36
1
0
27 Feb 2025
Safety Representations for Safer Policy Learning
Kaustubh Mani
Vincent Mai
Charlie Gauthier
Annie Chen
Samer Nashed
Liam Paull
40
0
0
27 Feb 2025
Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
56
0
0
18 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
97
2
0
28 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
36
0
0
21 Jan 2025
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
23
0
0
28 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
30
1
0
13 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
27
1
0
12 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
40
1
0
03 Oct 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
38
9
0
30 Sep 2024
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
28
1
0
16 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
23
2
0
13 Sep 2024
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
Zhifeng Hu
Chong Han
Wolfgang H. Gerstacker
I. F. Akyildiz
16
0
0
12 Sep 2024
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
41
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
40
1
0
15 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia L. Herbert
40
6
0
12 Jul 2024
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Zahra Shahrooei
Ali Baheri
32
2
0
17 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
47
1
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
35
2
0
31 May 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
34
1
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
32
12
0
26 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
42
1
0
26 May 2024
Federated Reinforcement Learning with Constraint Heterogeneity
Hao Jin
Liangyu Zhang
Zhihua Zhang
35
0
0
06 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
44
4
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
57
9
0
02 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
39
1
0
23 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
6
0
22 Apr 2024
Primal Methods for Variational Inequality Problems with Functional Constraints
Liang Zhang
Niao He
Michael Muehlebach
39
2
0
19 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
27
0
0
01 Mar 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
33
10
0
03 Feb 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
24
3
0
26 Jan 2024
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
71
6
0
23 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
14
19
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
30
3
0
01 Dec 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
43
10
0
03 Nov 2023
Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization
Fan Yang
Wen-Min Zhou
Zuxin Liu
Ding Zhao
David Held
17
1
0
10 Oct 2023
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh Backhaul Networks
Zhifeng Hu
Chong Han
Xudong Wang
19
4
0
08 Oct 2023
Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion
Joonho Lee
Lukas Schroth
Victor Klemm
Marko Bjelonic
Alexander Reske
Marco Hutter
25
14
0
27 Sep 2023
Iterative Reachability Estimation for Safe Reinforcement Learning
Milan Ganai
Zheng Gong
Chenning Yu
Sylvia L. Herbert
Sicun Gao
30
17
0
24 Sep 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
19
3
0
15 Sep 2023
Task-Oriented Cross-System Design for Timely and Accurate Modeling in the Metaverse
Zhen Meng
Kan Chen
Yufeng Diao
Changyang She
G. Zhao
Muhammad Ali Imran
B. Vucetic
31
12
0
11 Sep 2023
Not Only Rewards But Also Constraints: Applications on Legged Robot Locomotion
Yunho Kim
H. Oh
J. Lee
Jinhyeok Choi
Gwanghyeon Ji
Moonkyu Jung
D. Youm
Jemin Hwangbo
19
42
0
24 Aug 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan Cheng
J. Yang
Yitao Liang
OOD
36
1
0
10 Aug 2023
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
16
4
0
13 Jul 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kaipeng Zhang
Alejandro Ribeiro
40
19
0
20 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
34
11
0
01 Jun 2023
1
2
Next