ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05173
  4. Cited By
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

17 July 2017
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"

37 / 87 papers shown
Title
LaND: Learning to Navigate from Disengagements
LaND: Learning to Navigate from Disengagements
G. Kahn
Pieter Abbeel
Sergey Levine
82
52
0
09 Oct 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning
  Systems
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
115
11
0
30 Aug 2020
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous
  Vehicles
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles
Fei Ye
Pin Wang
Ching-yao Chan
Jiucai Zhang
47
20
0
28 Aug 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground
  with Human-in-the-loop
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop
Jonathan Chung
Anna Luo
Xavier Raffin
Scott Perry
OffRL
50
3
0
20 Jul 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
99
33
0
16 Jun 2020
Pessimism About Unknown Unknowns Inspires Conservatism
Pessimism About Unknown Unknowns Inspires Conservatism
Michael K. Cohen
Marcus Hutter
64
13
0
15 Jun 2020
Open Questions in Creating Safe Open-ended AI: Tensions Between Control
  and Creativity
Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity
Adrien Ecoffet
Jeff Clune
Joel Lehman
88
16
0
12 Jun 2020
Reinforcement Learning Under Moral Uncertainty
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
125
32
0
08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
112
53
0
30 May 2020
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural
  Networks
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks
Alexander Egiazarov
Vasileios Mavroeidis
Fabio Massimo Zennaro
Kamer Vishi
22
22
0
11 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based
  Deep Reinforcement Learning
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning
Fei Ye
Xuxin Cheng
Pin Wang
Ching-yao Chan
Jiucai Zhang
42
100
0
07 Feb 2020
Learning Human Objectives by Evaluating Hypothetical Behavior
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
87
77
0
05 Dec 2019
Faster and Safer Training by Embedding High-Level Knowledge into Deep
  Reinforcement Learning
Faster and Safer Training by Embedding High-Level Knowledge into Deep Reinforcement Learning
Haodi Zhang
Zihang Gao
Yi Zhou
Haotong Zhang
Kaishun Wu
Fangzhen Lin
AI4CE
59
17
0
22 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved
  Performance in Dense and Sparse Reward Environments
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
110
31
0
09 Oct 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Leveraging Human Guidance for Deep Reinforcement Learning Tasks
Ruohan Zhang
F. Torabi
L. Guan
D. Ballard
Peter Stone
65
87
0
21 Sep 2019
Reward Tampering Problems and Solutions in Reinforcement Learning: A
  Causal Influence Diagram Perspective
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
105
97
0
13 Aug 2019
Generalizing from a few environments in safety-critical reinforcement
  learning
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
87
16
0
02 Jul 2019
Towards Empathic Deep Q-Learning
Towards Empathic Deep Q-Learning
Bart Bussmann
Jacqueline Heinerman
Joel Lehman
AI4CE
72
11
0
26 Jun 2019
Evolutionary Computation and AI Safety: Research Problems Impeding
  Routine and Safe Real-world Application of Evolution
Evolutionary Computation and AI Safety: Research Problems Impeding Routine and Safe Real-world Application of Evolution
Joel Lehman
73
7
0
24 Jun 2019
Improving Safety in Reinforcement Learning Using Model-Based
  Architectures and Human Intervention
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
58
19
0
22 Mar 2019
Conservative Agency via Attainable Utility Preservation
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
120
49
0
26 Feb 2019
Parenting: Safe Reinforcement Learning from Human Input
Parenting: Safe Reinforcement Learning from Human Input
Christopher Frye
Ilya Feige
76
7
0
18 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
130
370
0
30 Jan 2019
Impossibility and Uncertainty Theorems in AI Value Alignment (or why
  your AGI should not have a utility function)
Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)
P. Eckersley
119
46
0
31 Dec 2018
Residual Reinforcement Learning for Robot Control
Residual Reinforcement Learning for Robot Control
T. Johannink
Shikhar Bahl
Ashvin Nair
Jianlan Luo
Avinash Kumar
M. Loskyll
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
90
420
0
07 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
421
0
19 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
131
398
0
15 Nov 2018
Intervention Aided Reinforcement Learning for Safe and Practical Policy
  Optimization in Navigation
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
Fan Wang
Bo Zhou
Ke Chen
Tingxiang Fan
Xi Zhang
Jiangyong Li
Hao Tian
Jia Pan
68
26
0
15 Nov 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLMOffRL
194
144
0
15 Oct 2018
HG-DAgger: Interactive Imitation Learning with Human Experts
HG-DAgger: Interactive Imitation Learning with Human Experts
Michael Kelly
Chelsea Sidrane
Katherine Driggs-Campbell
Mykel J. Kochenderfer
OffRL
255
232
0
05 Oct 2018
Verification for Machine Learning, Autonomy, and Neural Networks Survey
Verification for Machine Learning, Autonomy, and Neural Networks Survey
Weiming Xiang
Patrick Musau
A. Wild
Diego Manzanas Lopez
Nathaniel P. Hamilton
Xiaodong Yang
Joel A. Rosenfeld
Taylor T. Johnson
95
102
0
03 Oct 2018
Adding Neural Network Controllers to Behavior Trees without Destroying
  Performance Guarantees
Adding Neural Network Controllers to Behavior Trees without Destroying Performance Guarantees
Christopher Iliffe Sprague
Petter Ögren
67
25
0
26 Sep 2018
Cycle-of-Learning for Autonomous Systems from Human Interaction
Cycle-of-Learning for Autonomous Systems from Human Interaction
Nicholas R. Waytowich
Vinicius G. Goecks
Vernon J. Lawhern
55
20
0
28 Aug 2018
Penalizing side effects using stepwise relative reachability
Penalizing side effects using stepwise relative reachability
Victoria Krakovna
Laurent Orseau
Ramana Kumar
Miljan Martic
Shane Legg
98
55
0
04 Jun 2018
AGI Safety Literature Review
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
86
116
0
03 May 2018
Active Reinforcement Learning with Monte-Carlo Tree Search
Active Reinforcement Learning with Monte-Carlo Tree Search
Sebastian Schulze
Owain Evans
60
14
0
13 Mar 2018
AI Safety Gridworlds
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
158
255
0
27 Nov 2017
Previous
12