Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.05173
Cited By
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
17 July 2017
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"
50 / 87 papers shown
Title
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving
Li Zeqiao
Wang Yijing
Wang Haoyu
Li Zheng
Li Peng
Zuo zhiqiang
Hu Chuan
118
0
0
04 Jun 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
91
1
0
06 Mar 2025
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
150
2
0
21 Feb 2025
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
185
16
0
05 Feb 2025
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
135
1
0
08 Oct 2024
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Min Tan
Yushun Tao
Boyun Zheng
GaoSheng Xie
Lijuan Feng
Zeyang Xia
Jing Xiong
103
0
0
24 Sep 2024
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
113
1
0
28 Jun 2024
Safe and Robust Reinforcement Learning: Principles and Practice
Taku Yamagata
Raúl Santos-Rodríguez
OffRL
101
2
0
27 Mar 2024
Learning Flight Control Systems from Human Demonstrations and Real-Time Uncertainty-Informed Interventions
Prashant Ganesh
J. H. Ramos
Vinicius G. Goecks
Jared Paquet
Matthew Longmire
Nicholas R. Waytowich
Kevin Brink
20
0
0
01 May 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
53
14
0
24 Mar 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
89
52
0
14 Feb 2023
Imitating careful experts to avoid catastrophic events
J.R.P. Hanslope
Laurence Aitchison
OffRL
67
0
0
02 Feb 2023
A Mapping of Assurance Techniques for Learning Enabled Autonomous Systems to the Systems Engineering Lifecycle
Christian Ellis
Maggie B. Wigness
L. Fiondella
67
1
0
30 Dec 2022
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Ekaterina Nikonova
Cheng Xue
Jochen Renz
119
0
0
28 Dec 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
106
2
0
20 Oct 2022
Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes
Niklas Kochdumper
Hanna Krasowski
Xiao Wang
Stanley Bak
Matthias Althoff
88
30
0
19 Oct 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
89
11
0
11 Oct 2022
Bypassing the Simulation-to-reality Gap: Online Reinforcement Learning using a Supervisor
B. D. Evans
Johannes Betz
Hongrui Zheng
H. Engelbrecht
Rahul Mangharam
H. W. Jordaan
OffRL
62
7
0
22 Sep 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELM
AILaw
190
29
0
14 Sep 2022
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Yue Liu
Ding Zhao
OOD
OffRL
100
37
0
29 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
99
365
0
02 May 2022
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
154
59
0
17 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
84
4
0
20 Jan 2022
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy
Roger Girgis
Joshua Romoff
Pierre-Luc Bacon
C. Pal
114
36
0
22 Dec 2021
Cooperation for Scalable Supervision of Autonomy in Mixed Traffic
Cameron Hickert
Sirui Li
Cathy Wu
59
6
0
14 Dec 2021
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
41
7
0
07 Dec 2021
A note on stabilizing reinforcement learning
Pavel Osinenko
Grigory Yaremenko
Ilya Osokin
37
2
0
24 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
81
9
0
10 Nov 2021
OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning
Qiang Liu
Nakjung Choi
Tao Han
OffRL
61
31
0
02 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Allen Nie
Emma Brunskill
Chris Piech
73
11
0
27 Oct 2021
Unsolved Problems in ML Safety
Dan Hendrycks
Nicholas Carlini
John Schulman
Jacob Steinhardt
291
294
0
28 Sep 2021
Prioritized Experience-based Reinforcement Learning with Human Guidance for Autonomous Driving
Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
105
77
0
26 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
158
29
0
13 Jul 2021
Safe Exploration by Solving Early Terminated MDP
Hao Sun
Ziping Xu
Meng Fang
Zhenghao Peng
Jiadong Guo
Bo Dai
Bolei Zhou
47
17
0
09 Jul 2021
Software Engineering for AI-Based Systems: A Survey
Silverio Martínez-Fernández
Justus Bogner
Xavier Franch
Marc Oriol
Julien Siebert
Adam Trendowicz
Anna Maria Vollmer
Stefan Wagner
118
232
0
05 May 2021
Preference learning along multiple criteria: A game-theoretic perspective
Kush S. Bhatia
A. Pananjady
Peter L. Bartlett
Anca Dragan
Martin J. Wainwright
123
13
0
05 May 2021
Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chao Huang
Zhongxu Hu
Peng Hang
Yang Xing
Chen Lv
95
42
0
15 Apr 2021
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach
Shashi Suman
Ali Etemad
F. Rivest
68
15
0
26 Feb 2021
Provably Correct Training of Neural Network Controllers Using Reachability Analysis
Xiaowu Sun
Yasser Shoukry
93
7
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
61
15
0
18 Feb 2021
How RL Agents Behave When Their Actions Are Modified
Eric D. Langlois
Tom Everitt
66
13
0
15 Feb 2021
Shielding Atari Games with Bounded Prescience
Mirco Giacobbe
Mohammadhosein Hasanbeig
Daniel Kroening
H. Wijk
70
23
0
20 Jan 2021
SAFARI: Safe and Active Robot Imitation Learning with Imagination
Norman Di Palo
Edward Johns
74
8
0
18 Nov 2020
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
J. Uesato
Ramana Kumar
Victoria Krakovna
Tom Everitt
Richard Ngo
Shane Legg
69
16
0
17 Nov 2020
Reinforcement Learning Control of Constrained Dynamic Systems with Uniformly Ultimate Boundedness Stability Guarantee
Minghao Han
Yuan Tian
Lixian Zhang
Jun Wang
Wei Pan
68
49
0
13 Nov 2020
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Jonas Tebbe
Lukas Krauch
Yapeng Gao
A. Zell
76
34
0
06 Nov 2020
APPLI: Adaptive Planner Parameter Learning From Interventions
Zizhao Wang
Xuesu Xiao
Bo Liu
Garrett A. Warnell
Peter Stone
62
51
0
01 Nov 2020
Avoiding Side Effects By Considering Future Tasks
Victoria Krakovna
Laurent Orseau
Richard Ngo
Miljan Martic
Shane Legg
78
38
0
15 Oct 2020
1
2
Next