Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.10330
Cited By
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
20 May 2022
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Review of Safe Reinforcement Learning: Methods, Theory and Applications"
50 / 150 papers shown
Title
Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty
R. Russel
M. Benosman
J. Baar
44
20
0
10 Oct 2020
Projection-Based Constrained Policy Optimization
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
13
238
0
07 Oct 2020
Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs
Jiafan He
Dongruo Zhou
Quanquan Gu
45
37
0
01 Oct 2020
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Yuke Zhu
J. Wong
Ajay Mandlekar
Roberto Martín-Martín
Abhishek Joshi
Soroush Nasiriany
Yifeng Zhu
Soroush Nasiriany
Yifeng Zhu
144
438
0
25 Sep 2020
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
38
52
0
23 Sep 2020
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
60
52
0
26 Aug 2020
Safe Reinforcement Learning in Constrained Markov Decision Processes
Akifumi Wachi
Yanan Sui
21
145
0
15 Aug 2020
Multi-Principal Assistance Games
Arnaud Fickinger
Simon Zhuang
Dylan Hadfield-Menell
Stuart J. Russell
8
13
0
19 Jul 2020
Verifiably Safe Exploration for End-to-End Reinforcement Learning
Nathan Hunt
Nathan Fulton
Sara Magliacane
Nghia Hoang
Subhro Das
Armando Solar-Lezama
OffRL
35
50
0
02 Jul 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
69
33
0
16 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
45
44
0
12 Jun 2020
Constrained episodic reinforcement learning in concave-convex and knapsack settings
Kianté Brantley
Miroslav Dudík
Thodoris Lykouris
Sobhan Miryoosefi
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
OffRL
38
51
0
09 Jun 2020
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions
Jason J. Choi
F. Castañeda
Claire Tomlin
Koushil Sreenath
31
181
0
16 Apr 2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients
Bei Peng
Tabish Rashid
Christian Schroeder de Witt
Pierre-Alexandre Kamienny
Philip Torr
Wendelin Bohmer
Shimon Whiteson
34
255
0
14 Mar 2020
Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization
Lu Wen
Jingliang Duan
Shengbo Eben Li
Shaobing Xu
H. Peng
35
68
0
03 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
59
162
0
01 Mar 2020
Cautious Reinforcement Learning with Logical Constraints
Mohammadhosein Hasanbeig
Alessandro Abate
Daniel Kroening
26
75
0
26 Feb 2020
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach
Subin Huh
Insoon Yang
24
20
0
24 Feb 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
75
119
0
07 Jan 2020
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
92
1,811
0
13 Dec 2019
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction
Qianggang Ding
Sifan Wu
Hao Sun
Jiadong Guo
Jian Guo
20
127
0
06 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
119
1,197
0
24 Nov 2019
Safe Policies for Reinforcement Learning via Primal-Dual Methods
Santiago Paternain
Miguel Calvo-Fullana
Luiz F. O. Chamon
Alejandro Ribeiro
32
102
0
20 Nov 2019
Constrained Reinforcement Learning Has Zero Duality Gap
Santiago Paternain
Luiz F. O. Chamon
Miguel Calvo-Fullana
Alejandro Ribeiro
26
191
0
29 Oct 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
47
95
0
26 Oct 2019
IPO: Interior-point Policy Optimization under Constraints
Yongshuai Liu
J. Ding
Xin Liu
36
178
0
21 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
88
68
0
16 Oct 2019
Safe Reinforcement Learning on Autonomous Vehicles
David Isele
A. Nakhaei
K. Fujimura
56
77
0
27 Sep 2019
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
Xiangyu Zhao
Changsheng Gu
Haoshenglun Zhang
Xiwang Yang
Xiaobing Liu
Jiliang Tang
Hui Liu
OffRL
31
100
0
09 Sep 2019
An Inductive Synthesis Framework for Verifiable Reinforcement Learning
He Zhu
Zikang Xiong
Stephen Magill
Suresh Jagannathan
47
95
0
16 Jul 2019
Policy Optimization with Stochastic Mirror Descent
Long Yang
Yu Zhang
Gang Zheng
Qian Zheng
Pengfei Li
Jianhang Huang
Jun Wen
Gang Pan
65
34
0
25 Jun 2019
Reinforcement Learning with Convex Constraints
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
34
91
0
21 Jun 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
85
332
0
10 Jun 2019
Challenges of Real-World Reinforcement Learning
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
OffRL
69
545
0
29 Apr 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
45
328
0
20 Mar 2019
A Review of Reinforcement Learning for Autonomous Building Energy Management
Karl Mason
S. Grijalva
AI4CE
36
221
0
12 Mar 2019
Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control
Tianshu Chu
Jie Wang
Lara Codecà
Zhaojian Li
32
662
0
11 Mar 2019
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process Estimation
Jiameng Fan
Wenchao Li
OffRL
OnRL
GP
36
18
0
06 Mar 2019
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Florian Richter
Ryan K. Orosco
Michael C. Yip
OffRL
46
79
0
05 Mar 2019
Parenting: Safe Reinforcement Learning from Human Input
Christopher Frye
Ilya Feige
56
7
0
18 Feb 2019
Verifiably Safe Off-Model Reinforcement Learning
Nathan Fulton
André Platzer
OffRL
26
67
0
14 Feb 2019
Lyapunov-based Safe Policy Optimization for Continuous Control
Yinlam Chow
Ofir Nachum
Aleksandra Faust
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
51
245
0
28 Jan 2019
Macro action selection with deep reinforcement learning in StarCraft
Sijia Xu
Hongyu Kuang
Zhi Zhuang
Renjie Hu
Yang Liu
Huyang Sun
42
27
0
02 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
70
402
0
19 Nov 2018
Modular Architecture for StarCraft II with Deep Reinforcement Learning
Dennis Lee
Mizanur Rahman
Jeffrey O. Zhang
Huazhe Xu
Jerome McClendon
Pieter Abbeel
62
56
0
08 Nov 2018
Safe Reinforcement Learning with Model Uncertainty Estimates
Björn Lütjens
Michael Everett
Jonathan P. How
55
166
0
19 Oct 2018
The highD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems
R. Krajewski
Julian Bock
Laurent Kloeker
L. Eckstein
70
982
0
11 Oct 2018
TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Peng Sun
Xinghai Sun
Lei Han
Jiechao Xiong
Qing Wang
...
Yang Zheng
Ji Liu
Yongsheng Liu
Han Liu
Tong Zhang
47
75
0
19 Sep 2018
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
73
1,865
0
01 Aug 2018
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
82
652
0
01 Jul 2018
Previous
1
2
3
Next