Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1805.07708
Cited By
A Lyapunov-based Approach to Safe Reinforcement Learning
20 May 2018
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Lyapunov-based Approach to Safe Reinforcement Learning"
50 / 307 papers shown
LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration
Applied Soft Computing (ASC), 2017
Ruiyu Qiu
Rui Wang
Guanghui Yang
Xiang Li
Zhijiang Shao
182
0
0
11 Nov 2025
Provably Efficient Sample Complexity for Robust CMDP
Sourav Ganguly
Arnob Ghosh
165
0
0
10 Nov 2025
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz
Urvi Bhuwania
Ayush Jain
Erdem Bıyık
OffRL
153
0
0
21 Oct 2025
Towards a Practical Understanding of Lagrangian Methods in Safe Reinforcement Learning
Lindsay Spoor
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
233
0
0
20 Oct 2025
Provably Optimal Reinforcement Learning under Safety Filtering
Donggeon David Oh
D. Nguyen
Haimin Hu
J. F. Fisac
OffRL
203
0
0
20 Oct 2025
Stress-Aware Learning under KL Drift via Trust-Decayed Mirror Descent
Gabriel Nixon Raj
139
0
0
17 Oct 2025
ProSh: Probabilistic Shielding for Model-free Reinforcement Learning
Edwin Hamel-De le Court
Gaspard Ohlmann
Francesco Belardinelli
172
0
0
17 Oct 2025
Multi-Objective
min-max
\textit{min-max}
min-max
Online Convex Optimization
Rahul Vaze
Sumiran Mishra
197
0
0
15 Oct 2025
A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation
Sudip Khadka
L.S. Paudel
148
0
0
09 Oct 2025
A Fast Initialization Method for Neural Network Controllers: A Case Study of Image-based Visual Servoing Control for the multicopter Interception
Chenxu Ke
Congling Tian
Kaichen Xu
Ye Li
Lingcong Bao
106
0
0
23 Sep 2025
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
426
2
0
23 Sep 2025
Off Policy Lyapunov Stability in Reinforcement Learning
Sarvan Gill
Daniela Constantinescu
170
1
0
11 Sep 2025
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma
Ziyi Chen
Yi Zhou
Heng Huang
OffRL
316
7
0
24 Aug 2025
Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
Austin Braniff
Yuhe Tian
116
0
0
22 Aug 2025
A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification
Ahmed Nasir
Abdelhafid Zenati
119
0
0
21 Aug 2025
Action-Constrained Imitation Learning
Chia-Han Yeh
Tse-Sheng Nan
Risto Vuorio
Wei-Ting Hung
Hung-Yen Wu
Shao-Hua Sun
Ping-Chun Hsieh
207
3
0
20 Aug 2025
Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees
Zuyuan Zhang
A. Ghosh
Tian-Shing Lan
239
3
0
07 Aug 2025
Proactive Constrained Policy Optimization with Preemptive Penalty
Ning Yang
Pengyu Wang
Guoqing Liu
Haifeng Zhang
Pin Lyu
Jun Wang
300
0
0
03 Aug 2025
Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems
Hijaz Ahmad
Ehsan Sabouni
Alexander Wasilkoff
Param Budhraja
Zijian Guo
Songyuan Zhang
Chuchu Fan
Christos G. Cassandras
Wenchao Li
360
3
0
20 Jul 2025
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
Alper Kamil Bozkurt
Calin Belta
Ming C. Lin
AI4CE
346
1
0
01 Jun 2025
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
239
3
0
25 May 2025
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
Ankita Kushwaha
Kiran Ravish
Preeti Lamba
Pawan Kumar
234
9
0
22 May 2025
A universal policy wrapper with guarantees
Anton Bolychev
Georgiy Malaniya
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
OffRL
273
0
0
18 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
561
3
0
16 May 2025
On the Connection Between Diffusion Models and Molecular Dynamics
Liam Harcombe
Timothy T. Duignan
DiffM
396
1
0
04 Apr 2025
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
Tan Shu
Li Shen
349
22
0
04 Apr 2025
Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Samuel Tesfazgi
Leonhard Sprandl
Sandra Hirche
AAML
268
0
0
03 Apr 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
299
1
0
19 Mar 2025
Large-Scale Auto-bidding with Nash Equilibrium Constraints
Zhiyu Mou
Miao Xu
Rongquan Bai
Zhuoran Yang
Chuan Yu
Jian Xu
Bo Zheng
359
0
0
13 Mar 2025
Safe Explicable Policy Search
Akkamahadevi Hanni
Jonathan Montaño
Yu Zhang
408
0
0
10 Mar 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
336
1
0
23 Feb 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
925
1
0
11 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
650
27
0
29 Jan 2025
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves
AAAI Conference on Artificial Intelligence (AAAI), 2024
Martin Kurečka
Václav Nevyhoštěný
Petr Novotný
Vít Unčovský
389
1
0
18 Dec 2024
Neural Control and Certificate Repair via Runtime Monitoring
AAAI Conference on Artificial Intelligence (AAAI), 2024
Emily Yu
Đorđe Žikelić
T. Henzinger
AAML
261
5
0
17 Dec 2024
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
489
1
0
05 Dec 2024
Robustness and Generalization in Quantum Reinforcement Learning via Lipschitz Regularization
Nico Meyer
Julian Berberich
Christopher Mutschler
Daniel D. Scherer
288
5
0
28 Oct 2024
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
214
0
0
28 Oct 2024
Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making
Advanced Engineering Informatics (AEI), 2024
Rongliang Zhou
Jiakun Huang
Mingjun Li
Hepeng Li
Haotian Cao
Xiaolin Song
428
11
0
18 Oct 2024
Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching
Xiaoshan Lin
Sadık Bera Yüksel
Yasin Yazıcıoğlu
Derya Aksaray
336
2
0
10 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
917
1
0
08 Oct 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Conference on Robot Learning (CoRL), 2024
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
347
3
0
18 Sep 2024
Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent
IEEE Conference on Decision and Control (CDC), 2024
Pavel Osinenko
Grigory Yaremenko
Roman Zashchitin
Anton Bolychev
Sinan Ibrahim
D. Dobriborsci
322
8
0
15 Sep 2024
Revisiting Safe Exploration in Safe Reinforcement learning
David Eckel
Baohe Zhang
Joschka Bödecker
311
1
0
02 Sep 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning
Conference on Robot Learning (CoRL), 2024
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
256
6
0
26 Aug 2024
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Homayoun Honari
Amir M. Soufi Enayati
Mehran Ghafarian Tamizi
Homayoun Najjaran
257
3
0
15 Aug 2024
q-exponential family for policy optimization
International Conference on Learning Representations (ICLR), 2024
Lingwei Zhu
Haseeb Shah
Zheng Chen
Yukie Nagai
Martha White
OffRL
561
3
0
14 Aug 2024
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Shiqing Gao
Jiaxin Ding
Luoyi Fu
Xinbing Wang
Cheng Zhou
220
4
0
22 Jul 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
220
3
0
15 Jul 2024
Safe and Reliable Training of Learning-Based Aerospace Controllers
Udayan Mandal
Guy Amir
Haoze Wu
Ieva Daukantas
Fletcher Lee Newell
...
Kerianne Hobbs
Milan Ganai
Tobey Shim
Guy Katz
Clark Barrett
285
9
0
09 Jul 2024
1
2
3
4
5
6
7
Next
Page 1 of 7