Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.05173
Cited By
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
17 July 2017
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"
45 / 45 papers shown
Title
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
88
2
0
21 Feb 2025
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
107
14
0
05 Feb 2025
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Min Tan
Yushun Tao
Boyun Zheng
GaoSheng Xie
Lijuan Feng
Zeyang Xia
Jing Xiong
27
0
0
24 Sep 2024
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
51
1
0
28 Jun 2024
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
34
0
0
21 Sep 2023
Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning
Shengbo Wang
Ke Li
Yin Yang
Yuting Cao
Tingwen Huang
S. Wen
25
4
0
03 Jul 2023
An Emergency Disposal Decision-making Method with Human--Machine Collaboration
Yibo Guo
Jingyi Xue
Yingkang Zhang
Mingliang Xu
33
0
0
29 May 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
19
14
0
24 Mar 2023
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors
Shangding Gu
Alap Kshirsagar
Yali Du
Guang Chen
Jan Peters
Alois C. Knoll
34
14
0
25 Feb 2023
Imitating careful experts to avoid catastrophic events
J.R.P. Hanslope
Laurence Aitchison
OffRL
27
0
0
02 Feb 2023
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Ekaterina Nikonova
Cheng Xue
Jochen Renz
32
0
0
28 Dec 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
23
12
0
08 Nov 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
36
2
0
20 Oct 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
48
11
0
11 Oct 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELM
AILaw
88
27
0
14 Sep 2022
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
45
35
0
29 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
27
11
0
18 Dec 2021
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
13
7
0
07 Dec 2021
A note on stabilizing reinforcement learning
Pavel Osinenko
Grigory Yaremenko
Ilya Osokin
14
2
0
24 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
18
9
0
10 Nov 2021
OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning
Qiang Liu
Nakjung Choi
Tao Han
OffRL
29
29
0
02 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Allen Nie
Emma Brunskill
Chris Piech
29
11
0
27 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
81
28
0
13 Jul 2021
Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chao Huang
Zhongxu Hu
Peng Hang
Yang Xing
Chen Lv
36
40
0
15 Apr 2021
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach
Shashi Suman
Ali Etemad
F. Rivest
27
15
0
26 Feb 2021
Provably Correct Training of Neural Network Controllers Using Reachability Analysis
Xiaowu Sun
Yasser Shoukry
20
7
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
26
14
0
18 Feb 2021
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
J. Uesato
Ramana Kumar
Victoria Krakovna
Tom Everitt
Richard Ngo
Shane Legg
26
14
0
17 Nov 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
62
33
0
16 Jun 2020
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
17
32
0
08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
30
50
0
30 May 2020
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks
Alexander Egiazarov
Vasileios Mavroeidis
Fabio Massimo Zennaro
Kamer Vishi
15
18
0
11 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning
Fei Ye
Xuxin Cheng
Pin Wang
Ching-yao Chan
Jiucai Zhang
21
98
0
07 Feb 2020
Faster and Safer Training by Embedding High-Level Knowledge into Deep Reinforcement Learning
Haodi Zhang
Zihang Gao
Yi Zhou
Haotong Zhang
Kaishun Wu
Fangzhen Lin
AI4CE
27
17
0
22 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
19
31
0
09 Oct 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Ruohan Zhang
F. Torabi
L. Guan
D. Ballard
Peter Stone
19
87
0
21 Sep 2019
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
14
49
0
26 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
361
0
30 Jan 2019
Residual Reinforcement Learning for Robot Control
T. Johannink
Shikhar Bahl
Ashvin Nair
Jianlan Luo
Avinash Kumar
M. Loskyll
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
30
409
0
07 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
396
0
19 Nov 2018
HG-DAgger: Interactive Imitation Learning with Human Experts
Michael Kelly
Chelsea Sidrane
Katherine Driggs-Campbell
Mykel J. Kochenderfer
OffRL
11
218
0
05 Oct 2018
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
36
115
0
03 May 2018
1