Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.05173
Cited By
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
17 July 2017
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"
37 / 87 papers shown
Title
LaND: Learning to Navigate from Disengagements
G. Kahn
Pieter Abbeel
Sergey Levine
82
52
0
09 Oct 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
115
11
0
30 Aug 2020
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles
Fei Ye
Pin Wang
Ching-yao Chan
Jiucai Zhang
47
20
0
28 Aug 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop
Jonathan Chung
Anna Luo
Xavier Raffin
Scott Perry
OffRL
50
3
0
20 Jul 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
99
33
0
16 Jun 2020
Pessimism About Unknown Unknowns Inspires Conservatism
Michael K. Cohen
Marcus Hutter
64
13
0
15 Jun 2020
Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity
Adrien Ecoffet
Jeff Clune
Joel Lehman
88
16
0
12 Jun 2020
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
125
32
0
08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
112
53
0
30 May 2020
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks
Alexander Egiazarov
Vasileios Mavroeidis
Fabio Massimo Zennaro
Kamer Vishi
22
22
0
11 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning
Fei Ye
Xuxin Cheng
Pin Wang
Ching-yao Chan
Jiucai Zhang
42
100
0
07 Feb 2020
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
87
77
0
05 Dec 2019
Faster and Safer Training by Embedding High-Level Knowledge into Deep Reinforcement Learning
Haodi Zhang
Zihang Gao
Yi Zhou
Haotong Zhang
Kaishun Wu
Fangzhen Lin
AI4CE
59
17
0
22 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
110
31
0
09 Oct 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Ruohan Zhang
F. Torabi
L. Guan
D. Ballard
Peter Stone
65
87
0
21 Sep 2019
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
105
97
0
13 Aug 2019
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
87
16
0
02 Jul 2019
Towards Empathic Deep Q-Learning
Bart Bussmann
Jacqueline Heinerman
Joel Lehman
AI4CE
72
11
0
26 Jun 2019
Evolutionary Computation and AI Safety: Research Problems Impeding Routine and Safe Real-world Application of Evolution
Joel Lehman
73
7
0
24 Jun 2019
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
58
19
0
22 Mar 2019
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
120
49
0
26 Feb 2019
Parenting: Safe Reinforcement Learning from Human Input
Christopher Frye
Ilya Feige
76
7
0
18 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
130
370
0
30 Jan 2019
Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)
P. Eckersley
119
46
0
31 Dec 2018
Residual Reinforcement Learning for Robot Control
T. Johannink
Shikhar Bahl
Ashvin Nair
Jianlan Luo
Avinash Kumar
M. Loskyll
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
90
420
0
07 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
421
0
19 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
131
398
0
15 Nov 2018
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
Fan Wang
Bo Zhou
Ke Chen
Tingxiang Fan
Xi Zhang
Jiangyong Li
Hao Tian
Jia Pan
68
26
0
15 Nov 2018
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
194
144
0
15 Oct 2018
HG-DAgger: Interactive Imitation Learning with Human Experts
Michael Kelly
Chelsea Sidrane
Katherine Driggs-Campbell
Mykel J. Kochenderfer
OffRL
255
232
0
05 Oct 2018
Verification for Machine Learning, Autonomy, and Neural Networks Survey
Weiming Xiang
Patrick Musau
A. Wild
Diego Manzanas Lopez
Nathaniel P. Hamilton
Xiaodong Yang
Joel A. Rosenfeld
Taylor T. Johnson
95
102
0
03 Oct 2018
Adding Neural Network Controllers to Behavior Trees without Destroying Performance Guarantees
Christopher Iliffe Sprague
Petter Ögren
67
25
0
26 Sep 2018
Cycle-of-Learning for Autonomous Systems from Human Interaction
Nicholas R. Waytowich
Vinicius G. Goecks
Vernon J. Lawhern
55
20
0
28 Aug 2018
Penalizing side effects using stepwise relative reachability
Victoria Krakovna
Laurent Orseau
Ramana Kumar
Miljan Martic
Shane Legg
98
55
0
04 Jun 2018
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
86
116
0
03 May 2018
Active Reinforcement Learning with Monte-Carlo Tree Search
Sebastian Schulze
Owain Evans
60
14
0
13 Mar 2018
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
158
255
0
27 Nov 2017
Previous
1
2