A Lyapunov-based Approach to Safe Reinforcement Learning

20 May 2018

Yinlam Chow

Ofir Nachum

Edgar A. Duénez-Guzmán

Mohammad Ghavamzadeh

ArXiv (abs)PDF HTML

Papers citing "A Lyapunov-based Approach to Safe Reinforcement Learning"

50 / 307 papers shown

LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem ExplorationApplied Soft Computing (ASC), 2017

182

11 Nov 2025

Provably Efficient Sample Complexity for Robust CMDP

Sourav Ganguly

Arnob Ghosh

165

10 Nov 2025

Actor-Free Continuous Control via Structurally Maximizable Q-Functions

153

21 Oct 2025

Towards a Practical Understanding of Lagrangian Methods in Safe Reinforcement Learning

233

20 Oct 2025

Provably Optimal Reinforcement Learning under Safety Filtering

203

20 Oct 2025

Stress-Aware Learning under KL Drift via Trust-Decayed Mirror Descent

Gabriel Nixon Raj

139

17 Oct 2025

ProSh: Probabilistic Shielding for Model-free Reinforcement Learning

Edwin Hamel-De le Court

Gaspard Ohlmann

Francesco Belardinelli

172

17 Oct 2025

$Multi-Objective $\textit{min-max}$ Online Convex Optimization$

Multi-Objective

\textit{min-max}

Online Convex Optimization

Rahul Vaze

Sumiran Mishra

197

15 Oct 2025

A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation

Sudip Khadka

L.S. Paudel

148

09 Oct 2025

A Fast Initialization Method for Neural Network Controllers: A Case Study of Image-based Visual Servoing Control for the multicopter Interception

106

23 Sep 2025

SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer

426

23 Sep 2025

Off Policy Lyapunov Stability in Reinforcement Learning

Sarvan Gill

Daniela Constantinescu

170

11 Sep 2025

Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality

316

24 Aug 2025

Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)

Austin Braniff

Yuhe Tian

116

22 Aug 2025

A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification

Ahmed Nasir

Abdelhafid Zenati

119

21 Aug 2025

Action-Constrained Imitation Learning

207

20 Aug 2025

Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees

Zuyuan Zhang

A. Ghosh

Tian-Shing Lan

239

07 Aug 2025

Proactive Constrained Policy Optimization with Preemptive Penalty

300

03 Aug 2025

Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems

Christos G. Cassandras

Wenchao Li

360

20 Jul 2025

Accelerated Learning with Linear Temporal Logic using Differentiable Simulation

346

01 Jun 2025

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees

239

25 May 2025

A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety

234

22 May 2025

A universal policy wrapper with guarantees

273

18 May 2025

Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions

Kehan Long

Jorge Cortés

Nikolay Atanasov

561

16 May 2025

On the Connection Between Diffusion Models and Molecular Dynamics

Liam Harcombe

Timothy T. Duignan

DiffM

396

04 Apr 2025

FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation

Tan Shu

Li Shen

349

04 Apr 2025

Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF NetworksInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

268

03 Apr 2025

Neural Lyapunov Function Approximation with Self-Supervised Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2025

Luc McCutcheon

Bahman Gharesifard

Saber Fallah

299

19 Mar 2025

Large-Scale Auto-bidding with Nash Equilibrium Constraints

359

13 Mar 2025

Safe Explicable Policy Search

Akkamahadevi Hanni

Jonathan Montaño

Yu Zhang

408

10 Mar 2025

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

Zheli Xiong

336

23 Feb 2025

Polynomial-Time Approximability of Constrained Reinforcement Learning

Jeremy McMahan

925

11 Feb 2025

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Zishun Yu

Tengyu Xu

Di Jin

Karthik Abinav Sankararaman

...

650

29 Jan 2025

Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto CurvesAAAI Conference on Artificial Intelligence (AAAI), 2024

389

18 Dec 2024

Neural Control and Certificate Repair via Runtime MonitoringAAAI Conference on Artificial Intelligence (AAAI), 2024

261

17 Dec 2024

Towards Fast Safe Online Reinforcement Learning via Policy Finetuning

489

05 Dec 2024

Robustness and Generalization in Quantum Reinforcement Learning via Lipschitz Regularization

Nico Meyer

Julian Berberich

Christopher Mutschler

Daniel D. Scherer

288

28 Oct 2024

Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets

Jianmina Ma

Jingtian Ji

Yue Gao

214

28 Oct 2024

Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-MakingAdvanced Engineering Informatics (AEI), 2024

428

18 Oct 2024

Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching

336

10 Oct 2024

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

917

08 Oct 2024

Handling Long-Term Safety and Uncertainty in Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024

Jonas Günster

Puze Liu

Jan Peters

Davide Tateo

OffRL

347

18 Sep 2024

Critic as Lyapunov function (CALF): a model-free, stability-ensuring agentIEEE Conference on Decision and Control (CDC), 2024

322

15 Sep 2024

Revisiting Safe Exploration in Safe Reinforcement learning

David Eckel

Baohe Zhang

Joschka Bödecker

311

02 Sep 2024

Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024

Puze Liu

Jan Peters

256

26 Aug 2024

Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter TuningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Homayoun Honari

Amir M. Soufi Enayati

Mehran Ghafarian Tamizi

Homayoun Najjaran

257

15 Aug 2024

q-exponential family for policy optimizationInternational Conference on Learning Representations (ICLR), 2024

Lingwei Zhu

Haseeb Shah

Zheng Chen

Yukie Nagai

Martha White

OffRL

561

14 Aug 2024

Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints

Shiqing Gao

Jiaxin Ding

Luoyi Fu

Xinbing Wang

Cheng Zhou

220

22 Jul 2024

Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning

Alessandro Montenegro

Marco Mussi

Matteo Papini

Alberto Maria Metelli

BDL

220

15 Jul 2024

Safe and Reliable Training of Learning-Based Aerospace Controllers

...

Clark Barrett

285

09 Jul 2024