ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.07708
  4. Cited By
A Lyapunov-based Approach to Safe Reinforcement Learning

A Lyapunov-based Approach to Safe Reinforcement Learning

20 May 2018
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
ArXiv (abs)PDFHTML

Papers citing "A Lyapunov-based Approach to Safe Reinforcement Learning"

50 / 307 papers shown
LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration
LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem ExplorationApplied Soft Computing (ASC), 2017
Ruiyu Qiu
Rui Wang
Guanghui Yang
Xiang Li
Zhijiang Shao
182
0
0
11 Nov 2025
Provably Efficient Sample Complexity for Robust CMDP
Provably Efficient Sample Complexity for Robust CMDP
Sourav Ganguly
Arnob Ghosh
165
0
0
10 Nov 2025
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz
Urvi Bhuwania
Ayush Jain
Erdem Bıyık
OffRL
153
0
0
21 Oct 2025
Towards a Practical Understanding of Lagrangian Methods in Safe Reinforcement Learning
Towards a Practical Understanding of Lagrangian Methods in Safe Reinforcement Learning
Lindsay Spoor
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
233
0
0
20 Oct 2025
Provably Optimal Reinforcement Learning under Safety Filtering
Provably Optimal Reinforcement Learning under Safety Filtering
Donggeon David Oh
D. Nguyen
Haimin Hu
J. F. Fisac
OffRL
203
0
0
20 Oct 2025
Stress-Aware Learning under KL Drift via Trust-Decayed Mirror Descent
Stress-Aware Learning under KL Drift via Trust-Decayed Mirror Descent
Gabriel Nixon Raj
139
0
0
17 Oct 2025
ProSh: Probabilistic Shielding for Model-free Reinforcement Learning
ProSh: Probabilistic Shielding for Model-free Reinforcement Learning
Edwin Hamel-De le Court
Gaspard Ohlmann
Francesco Belardinelli
172
0
0
17 Oct 2025
Multi-Objective $\textit{min-max}$ Online Convex Optimization
Multi-Objective min-max\textit{min-max}min-max Online Convex Optimization
Rahul Vaze
Sumiran Mishra
197
0
0
15 Oct 2025
A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation
A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation
Sudip Khadka
L.S. Paudel
148
0
0
09 Oct 2025
A Fast Initialization Method for Neural Network Controllers: A Case Study of Image-based Visual Servoing Control for the multicopter Interception
A Fast Initialization Method for Neural Network Controllers: A Case Study of Image-based Visual Servoing Control for the multicopter Interception
Chenxu Ke
Congling Tian
Kaichen Xu
Ye Li
Lingcong Bao
106
0
0
23 Sep 2025
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
426
2
0
23 Sep 2025
Off Policy Lyapunov Stability in Reinforcement Learning
Off Policy Lyapunov Stability in Reinforcement Learning
Sarvan Gill
Daniela Constantinescu
170
1
0
11 Sep 2025
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma
Ziyi Chen
Yi Zhou
Heng Huang
OffRL
316
7
0
24 Aug 2025
Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
Austin Braniff
Yuhe Tian
116
0
0
22 Aug 2025
A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification
A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification
Ahmed Nasir
Abdelhafid Zenati
119
0
0
21 Aug 2025
Action-Constrained Imitation Learning
Action-Constrained Imitation Learning
Chia-Han Yeh
Tse-Sheng Nan
Risto Vuorio
Wei-Ting Hung
Hung-Yen Wu
Shao-Hua Sun
Ping-Chun Hsieh
207
3
0
20 Aug 2025
Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees
Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees
Zuyuan Zhang
A. Ghosh
Tian-Shing Lan
239
3
0
07 Aug 2025
Proactive Constrained Policy Optimization with Preemptive Penalty
Proactive Constrained Policy Optimization with Preemptive Penalty
Ning Yang
Pengyu Wang
Guoqing Liu
Haifeng Zhang
Pin Lyu
Jun Wang
300
0
0
03 Aug 2025
Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems
Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems
Hijaz Ahmad
Ehsan Sabouni
Alexander Wasilkoff
Param Budhraja
Zijian Guo
Songyuan Zhang
Chuchu Fan
Christos G. Cassandras
Wenchao Li
360
3
0
20 Jul 2025
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
Alper Kamil Bozkurt
Calin Belta
Ming C. Lin
AI4CE
346
1
0
01 Jun 2025
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
239
3
0
25 May 2025
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
Ankita Kushwaha
Kiran Ravish
Preeti Lamba
Pawan Kumar
234
9
0
22 May 2025
A universal policy wrapper with guarantees
A universal policy wrapper with guarantees
Anton Bolychev
Georgiy Malaniya
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
OffRL
273
0
0
18 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
561
3
0
16 May 2025
On the Connection Between Diffusion Models and Molecular Dynamics
On the Connection Between Diffusion Models and Molecular Dynamics
Liam Harcombe
Timothy T. Duignan
DiffM
396
1
0
04 Apr 2025
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
Tan Shu
Li Shen
349
22
0
04 Apr 2025
Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks
Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF NetworksInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Samuel Tesfazgi
Leonhard Sprandl
Sandra Hirche
AAML
268
0
0
03 Apr 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2025
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
299
1
0
19 Mar 2025
Large-Scale Auto-bidding with Nash Equilibrium Constraints
Large-Scale Auto-bidding with Nash Equilibrium Constraints
Zhiyu Mou
Miao Xu
Rongquan Bai
Zhuoran Yang
Chuan Yu
Jian Xu
Bo Zheng
359
0
0
13 Mar 2025
Safe Explicable Policy Search
Safe Explicable Policy Search
Akkamahadevi Hanni
Jonathan Montaño
Yu Zhang
408
0
0
10 Mar 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
336
1
0
23 Feb 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
925
1
0
11 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
650
27
0
29 Jan 2025
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto
  Curves
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto CurvesAAAI Conference on Artificial Intelligence (AAAI), 2024
Martin Kurečka
Václav Nevyhoštěný
Petr Novotný
Vít Unčovský
389
1
0
18 Dec 2024
Neural Control and Certificate Repair via Runtime Monitoring
Neural Control and Certificate Repair via Runtime MonitoringAAAI Conference on Artificial Intelligence (AAAI), 2024
Emily Yu
Đorđe Žikelić
T. Henzinger
AAML
261
5
0
17 Dec 2024
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
489
1
0
05 Dec 2024
Robustness and Generalization in Quantum Reinforcement Learning via
  Lipschitz Regularization
Robustness and Generalization in Quantum Reinforcement Learning via Lipschitz Regularization
Nico Meyer
Julian Berberich
Christopher Mutschler
Daniel D. Scherer
288
5
0
28 Oct 2024
Adversarial Constrained Policy Optimization: Improving Constrained
  Reinforcement Learning by Adapting Budgets
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
214
0
0
28 Oct 2024
Knowledge Transfer from Simple to Complex: A Safe and Efficient
  Reinforcement Learning Framework for Autonomous Driving Decision-Making
Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-MakingAdvanced Engineering Informatics (AEI), 2024
Rongliang Zhou
Jiakun Huang
Mingjun Li
Hepeng Li
Haotian Cao
Xiaolin Song
428
11
0
18 Oct 2024
Probabilistic Satisfaction of Temporal Logic Constraints in
  Reinforcement Learning via Adaptive Policy-Switching
Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching
Xiaoshan Lin
Sadık Bera Yüksel
Yasin Yazıcıoğlu
Derya Aksaray
336
2
0
10 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
917
1
0
08 Oct 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
347
3
0
18 Sep 2024
Critic as Lyapunov function (CALF): a model-free, stability-ensuring
  agent
Critic as Lyapunov function (CALF): a model-free, stability-ensuring agentIEEE Conference on Decision and Control (CDC), 2024
Pavel Osinenko
Grigory Yaremenko
Roman Zashchitin
Anton Bolychev
Sinan Ibrahim
D. Dobriborsci
322
8
0
15 Sep 2024
Revisiting Safe Exploration in Safe Reinforcement learning
Revisiting Safe Exploration in Safe Reinforcement learning
David Eckel
Baohe Zhang
Joschka Bödecker
311
1
0
02 Sep 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe
  Reinforcement Learning
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
256
6
0
26 Aug 2024
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via
  MetaGradient-based Hyperparameter Tuning
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter TuningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Homayoun Honari
Amir M. Soufi Enayati
Mehran Ghafarian Tamizi
Homayoun Najjaran
257
3
0
15 Aug 2024
q-exponential family for policy optimization
q-exponential family for policy optimizationInternational Conference on Learning Representations (ICLR), 2024
Lingwei Zhu
Haseeb Shah
Zheng Chen
Yukie Nagai
Martha White
OffRL
561
3
0
14 Aug 2024
Exterior Penalty Policy Optimization with Penalty Metric Network under
  Constraints
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Shiqing Gao
Jiaxin Ding
Luoyi Fu
Xinbing Wang
Cheng Zhou
220
4
0
22 Jul 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
220
3
0
15 Jul 2024
Safe and Reliable Training of Learning-Based Aerospace Controllers
Safe and Reliable Training of Learning-Based Aerospace Controllers
Udayan Mandal
Guy Amir
Haoze Wu
Ieva Daukantas
Fletcher Lee Newell
...
Kerianne Hobbs
Milan Ganai
Tobey Shim
Guy Katz
Clark Barrett
285
9
0
09 Jul 2024
1234567
Next
Page 1 of 7