ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05173
  4. Cited By
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

17 July 2017
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"

50 / 87 papers shown
Title
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving
Li Zeqiao
Wang Yijing
Wang Haoyu
Li Zheng
Li Peng
Zuo zhiqiang
Hu Chuan
118
0
0
04 Jun 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
91
1
0
06 Mar 2025
MILE: Model-based Intervention Learning
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
150
2
0
21 Feb 2025
Learning from Active Human Involvement through Proxy Value Propagation
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
185
16
0
05 Feb 2025
Reinforcement Learning From Imperfect Corrective Actions And Proxy
  Rewards
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
135
1
0
08 Oct 2024
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Min Tan
Yushun Tao
Boyun Zheng
GaoSheng Xie
Lijuan Feng
Zeyang Xia
Jing Xiong
103
0
0
24 Sep 2024
Safety through feedback in Constrained RL
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
113
1
0
28 Jun 2024
Safe and Robust Reinforcement Learning: Principles and Practice
Safe and Robust Reinforcement Learning: Principles and Practice
Taku Yamagata
Raúl Santos-Rodríguez
OffRL
101
2
0
27 Mar 2024
Learning Flight Control Systems from Human Demonstrations and Real-Time
  Uncertainty-Informed Interventions
Learning Flight Control Systems from Human Demonstrations and Real-Time Uncertainty-Informed Interventions
Prashant Ganesh
J. H. Ramos
Vinicius G. Goecks
Jared Paquet
Matthew Longmire
Nicholas R. Waytowich
Kevin Brink
20
0
0
01 May 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic
  Environments
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
53
14
0
24 Mar 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
89
52
0
14 Feb 2023
Imitating careful experts to avoid catastrophic events
Imitating careful experts to avoid catastrophic events
J.R.P. Hanslope
Laurence Aitchison
OffRL
67
0
0
02 Feb 2023
A Mapping of Assurance Techniques for Learning Enabled Autonomous
  Systems to the Systems Engineering Lifecycle
A Mapping of Assurance Techniques for Learning Enabled Autonomous Systems to the Systems Engineering Lifecycle
Christian Ellis
Maggie B. Wigness
L. Fiondella
67
1
0
30 Dec 2022
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Ekaterina Nikonova
Cheng Xue
Jochen Renz
119
0
0
28 Dec 2022
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
106
2
0
20 Oct 2022
Provably Safe Reinforcement Learning via Action Projection using
  Reachability Analysis and Polynomial Zonotopes
Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes
Niklas Kochdumper
Hanna Krasowski
Xiao Wang
Stanley Bak
Matthias Althoff
88
30
0
19 Oct 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
89
11
0
11 Oct 2022
Bypassing the Simulation-to-reality Gap: Online Reinforcement Learning
  using a Supervisor
Bypassing the Simulation-to-reality Gap: Online Reinforcement Learning using a Supervisor
B. D. Evans
Johannes Betz
Hongrui Zheng
H. Engelbrecht
Rahul Mangharam
H. W. Jordaan
OffRL
62
7
0
22 Sep 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial
  Intelligence with Humans
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELMAILaw
190
29
0
14 Sep 2022
On the Robustness of Safe Reinforcement Learning under Observational
  Perturbations
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Yue Liu
Ding Zhao
OODOffRL
100
37
0
29 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
99
365
0
02 May 2022
Efficient Learning of Safe Driving Policy via Human-AI Copilot
  Optimization
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
154
59
0
17 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
84
4
0
20 Jan 2022
Direct Behavior Specification via Constrained Reinforcement Learning
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy
Roger Girgis
Joshua Romoff
Pierre-Luc Bacon
C. Pal
114
36
0
22 Dec 2021
Cooperation for Scalable Supervision of Autonomy in Mixed Traffic
Cooperation for Scalable Supervision of Autonomy in Mixed Traffic
Cameron Hickert
Sirui Li
Cathy Wu
59
6
0
14 Dec 2021
Combining Learning from Human Feedback and Knowledge Engineering to
  Solve Hierarchical Tasks in Minecraft
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
41
7
0
07 Dec 2021
A note on stabilizing reinforcement learning
A note on stabilizing reinforcement learning
Pavel Osinenko
Grigory Yaremenko
Ilya Osokin
37
2
0
24 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human
  Intervention
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
81
9
0
10 Nov 2021
OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning
OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning
Qiang Liu
Nakjung Choi
Tao Han
OffRL
61
31
0
02 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision
  Process
Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Allen Nie
Emma Brunskill
Chris Piech
73
11
0
27 Oct 2021
Unsolved Problems in ML Safety
Unsolved Problems in ML Safety
Dan Hendrycks
Nicholas Carlini
John Schulman
Jacob Steinhardt
291
294
0
28 Sep 2021
Prioritized Experience-based Reinforcement Learning with Human Guidance
  for Autonomous Driving
Prioritized Experience-based Reinforcement Learning with Human Guidance for Autonomous Driving
Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
105
77
0
26 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
158
29
0
13 Jul 2021
Safe Exploration by Solving Early Terminated MDP
Safe Exploration by Solving Early Terminated MDP
Hao Sun
Ziping Xu
Meng Fang
Zhenghao Peng
Jiadong Guo
Bo Dai
Bolei Zhou
47
17
0
09 Jul 2021
Software Engineering for AI-Based Systems: A Survey
Software Engineering for AI-Based Systems: A Survey
Silverio Martínez-Fernández
Justus Bogner
Xavier Franch
Marc Oriol
Julien Siebert
Adam Trendowicz
Anna Maria Vollmer
Stefan Wagner
118
232
0
05 May 2021
Preference learning along multiple criteria: A game-theoretic
  perspective
Preference learning along multiple criteria: A game-theoretic perspective
Kush S. Bhatia
A. Pananjady
Peter L. Bartlett
Anca Dragan
Martin J. Wainwright
123
13
0
05 May 2021
Human-in-the-Loop Deep Reinforcement Learning with Application to
  Autonomous Driving
Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving
Jingda Wu
Zhiyu Huang
Chao Huang
Zhongxu Hu
Peng Hang
Yang Xing
Chen Lv
95
42
0
15 Apr 2021
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement
  Learning Approach
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach
Shashi Suman
Ali Etemad
F. Rivest
68
15
0
26 Feb 2021
Provably Correct Training of Neural Network Controllers Using
  Reachability Analysis
Provably Correct Training of Neural Network Controllers Using Reachability Analysis
Xiaowu Sun
Yasser Shoukry
93
7
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
61
15
0
18 Feb 2021
How RL Agents Behave When Their Actions Are Modified
How RL Agents Behave When Their Actions Are Modified
Eric D. Langlois
Tom Everitt
66
13
0
15 Feb 2021
Shielding Atari Games with Bounded Prescience
Shielding Atari Games with Bounded Prescience
Mirco Giacobbe
Mohammadhosein Hasanbeig
Daniel Kroening
H. Wijk
70
23
0
20 Jan 2021
SAFARI: Safe and Active Robot Imitation Learning with Imagination
SAFARI: Safe and Active Robot Imitation Learning with Imagination
Norman Di Palo
Edward Johns
74
8
0
18 Nov 2020
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
J. Uesato
Ramana Kumar
Victoria Krakovna
Tom Everitt
Richard Ngo
Shane Legg
69
16
0
17 Nov 2020
Reinforcement Learning Control of Constrained Dynamic Systems with
  Uniformly Ultimate Boundedness Stability Guarantee
Reinforcement Learning Control of Constrained Dynamic Systems with Uniformly Ultimate Boundedness Stability Guarantee
Minghao Han
Yuan Tian
Lixian Zhang
Jun Wang
Wei Pan
68
49
0
13 Nov 2020
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Jonas Tebbe
Lukas Krauch
Yapeng Gao
A. Zell
76
34
0
06 Nov 2020
APPLI: Adaptive Planner Parameter Learning From Interventions
APPLI: Adaptive Planner Parameter Learning From Interventions
Zizhao Wang
Xuesu Xiao
Bo Liu
Garrett A. Warnell
Peter Stone
62
51
0
01 Nov 2020
Avoiding Side Effects By Considering Future Tasks
Avoiding Side Effects By Considering Future Tasks
Victoria Krakovna
Laurent Orseau
Richard Ngo
Miljan Martic
Shane Legg
78
38
0
15 Oct 2020
12
Next