Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.18539
Cited By
Safe and Robust Reinforcement Learning: Principles and Practice
27 March 2024
Taku Yamagata
Raúl Santos-Rodríguez
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Safe and Robust Reinforcement Learning: Principles and Practice"
50 / 52 papers shown
Title
Reinforcement learning
Florentin Wörgötter
75
2,554
0
16 May 2024
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi
Wataru Hashimoto
Xun Shen
Kazumune Hashimoto
53
10
0
05 Oct 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
98
497
0
27 Jul 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
300
3,712
0
29 May 2023
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
177
75
0
08 Sep 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
149
250
0
20 May 2022
Explainability in reinforcement learning: perspective and position
Agneza Krajna
Mario Brčič
T. Lipić
Juraj Dončević
73
27
0
22 Mar 2022
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
114
35
0
17 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
58
86
0
31 Jan 2022
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Alexander Pan
Kush S. Bhatia
Jacob Steinhardt
73
174
0
10 Jan 2022
Reinforcement Learning with Feedback from Multiple Humans with Diverse Skills
Taku Yamagata
Ryan McConville
Raúl Santos-Rodríguez
23
7
0
16 Nov 2021
Multi-Agent Constrained Policy Optimisation
Shangding Gu
J. Kuba
Munning Wen
Ruiqing Chen
Ziyan Wang
Zheng Tian
Jun Wang
Alois Knoll
Yaodong Yang
113
49
0
06 Oct 2021
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
50
610
0
13 Aug 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
56
282
0
09 Jun 2021
Ensemble Quantile Networks: Uncertainty-Aware Reinforcement Learning with Applications in Autonomous Driving
C. Hoel
Krister Wolff
L. Laine
UQCV
EDL
44
40
0
21 May 2021
Hierarchical Program-Triggered Reinforcement Learning Agents For Automated Driving
Briti Gangopadhyay
Harshit Soora
P. Dasgupta
28
35
0
25 Mar 2021
Safe Multi-Agent Reinforcement Learning via Shielding
Ingy Elsayed-Aly
Suda Bharadwaj
Chris Amato
Rüdiger Ehlers
Ufuk Topcu
Lu Feng
35
94
0
27 Jan 2021
Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control
Taku Yamagata
A. O'Kane
Amid Ayobi
Dmitri S. Katz
Katarzyna Stawarz
P. Marshall
Peter A. Flach
Raúl Santos-Rodríguez University of Bristol
OOD
BDL
OffRL
13
10
0
13 Oct 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
159
724
0
24 Sep 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
91
11
0
30 Aug 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
190
1,338
0
15 Apr 2020
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
318
1,950
0
11 Apr 2020
A Comprehensive Survey on Transfer Learning
Fuzhen Zhuang
Zhiyuan Qi
Keyu Duan
Dongbo Xi
Yongchun Zhu
Hengshu Zhu
Hui Xiong
Qing He
173
4,395
0
07 Nov 2019
IPO: Interior-point Policy Optimization under Constraints
Yongshuai Liu
J. Ding
Xin Liu
44
178
0
21 Oct 2019
Deep Drone Racing: From Simulation to Reality with Domain Randomization
Antonio Loquercio
Elia Kaufmann
René Ranftl
Alexey Dosovitskiy
V. Koltun
Davide Scaramuzza
69
210
0
21 May 2019
Risk Averse Robust Adversarial Reinforcement Learning
Xinlei Pan
Daniel Seita
Yang Gao
John F. Canny
AAML
47
96
0
31 Mar 2019
The Ethics of AI Ethics -- An Evaluation of Guidelines
Thilo Hagendorff
AI4TS
58
1,174
0
28 Feb 2019
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
45
56
0
29 Jan 2019
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
Fan Wang
Bo Zhou
Ke Chen
Tingxiang Fan
Xi Zhang
Jiangyong Li
Hao Tian
Jia Pan
29
26
0
15 Nov 2018
Meta-Learning: A Survey
Joaquin Vanschoren
FedML
OOD
58
756
0
08 Oct 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
51
359
0
20 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
85
529
0
14 Jun 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
66
540
0
28 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
152
504
0
20 May 2018
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Erin Grant
Chelsea Finn
Sergey Levine
Trevor Darrell
Thomas Griffiths
BDL
74
505
0
26 Jan 2018
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
66
139
0
18 Nov 2017
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
87
756
0
27 Oct 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
82
1,497
0
21 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
285
18,685
0
20 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
59
231
0
17 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
105
3,243
0
12 Jun 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
101
1,313
0
30 May 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
141
845
0
23 May 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
781
11,793
0
09 Mar 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
83
848
0
08 Mar 2017
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
295
814
0
13 Nov 2016
Successor Features for Transfer in Reinforcement Learning
André Barreto
Will Dabney
Rémi Munos
Jonathan J. Hunt
Tom Schaul
H. V. Hasselt
David Silver
40
566
0
16 Jun 2016
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
65
643
0
09 Jun 2016
Deep Successor Reinforcement Learning
Tejas D. Kulkarni
A. Saeedi
Simanta Gautam
S. Gershman
57
208
0
08 Jun 2016
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
Yinlam Chow
Mohammad Ghavamzadeh
Lucas Janson
Marco Pavone
66
510
0
05 Dec 2015
1
2
Next