Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

17 July 2017

Papers citing "Trial without Error: Towards Safe Reinforcement Learning via Human Intervention"

37 / 87 papers shown

Title
LaND: Learning to Navigate from Disengagements G. Kahn Pieter Abbeel Sergey Levine 82 52 0 09 Oct 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems Vinicius G. Goecks 115 11 0 30 Aug 2020
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles Fei Ye Pin Wang Ching-yao Chan Jiucai Zhang 47 20 0 28 Aug 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop Jonathan Chung Anna Luo Xavier Raffin Scott Perry OffRL 50 3 0 20 Jul 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers James Ferlez Mahmoud M. Elnaggar Yasser Shoukry C. Fleming AAML 99 33 0 16 Jun 2020
Pessimism About Unknown Unknowns Inspires Conservatism Michael K. Cohen Marcus Hutter 64 13 0 15 Jun 2020
Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity Adrien Ecoffet Jeff Clune Joel Lehman 88 16 0 12 Jun 2020
Reinforcement Learning Under Moral Uncertainty Adrien Ecoffet Joel Lehman 125 32 0 08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES) Andrew Critch David M. Krueger 112 53 0 30 May 2020
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks Alexander Egiazarov Vasileios Mavroeidis Fabio Massimo Zennaro Kamer Vishi 22 22 0 11 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning Fei Ye Xuxin Cheng Pin Wang Ching-yao Chan Jiucai Zhang 42 100 0 07 Feb 2020
Learning Human Objectives by Evaluating Hypothetical Behavior S. Reddy Anca Dragan Sergey Levine Shane Legg Jan Leike 87 77 0 05 Dec 2019
Faster and Safer Training by Embedding High-Level Knowledge into Deep Reinforcement Learning Haodi Zhang Zihang Gao Yi Zhou Haotong Zhang Kaishun Wu Fangzhen Lin AI4CE 59 17 0 22 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments Vinicius G. Goecks Gregory M. Gremillion Vernon J. Lawhern J. Valasek Nicholas R. Waytowich OffRL 110 31 0 09 Oct 2019
Leveraging Human Guidance for Deep Reinforcement Learning Tasks Ruohan Zhang F. Torabi L. Guan D. Ballard Peter Stone 65 87 0 21 Sep 2019
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective Tom Everitt Marcus Hutter Ramana Kumar Victoria Krakovna 105 97 0 13 Aug 2019
Generalizing from a few environments in safety-critical reinforcement learning Zachary Kenton Angelos Filos Owain Evans Y. Gal 87 16 0 02 Jul 2019
Towards Empathic Deep Q-Learning Bart Bussmann Jacqueline Heinerman Joel Lehman AI4CE 72 11 0 26 Jun 2019
Evolutionary Computation and AI Safety: Research Problems Impeding Routine and Safe Real-world Application of Evolution Joel Lehman 73 7 0 24 Jun 2019
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention Bharat Prakash Mohit Khatwani Nicholas R. Waytowich T. Mohsenin OffRL 58 19 0 22 Mar 2019
Conservative Agency via Attainable Utility Preservation Alexander Matt Turner Dylan Hadfield-Menell Prasad Tadepalli 120 49 0 26 Feb 2019
Parenting: Safe Reinforcement Learning from Human Input Christopher Frye Ilya Feige 76 7 0 18 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune AI4TS 130 370 0 30 Jan 2019
Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function) P. Eckersley 119 46 0 31 Dec 2018
Residual Reinforcement Learning for Robot Control T. Johannink Shikhar Bahl Ashvin Nair Jianlan Luo Avinash Kumar M. Loskyll J. A. Ojea Eugen Solowjow Sergey Levine OffRL 90 420 0 07 Dec 2018
Scalable agent alignment via reward modeling: a research direction Jan Leike David M. Krueger Tom Everitt Miljan Martic Vishal Maini Shane Legg 124 421 0 19 Nov 2018
Reward learning from human preferences and demonstrations in Atari Borja Ibarz Jan Leike Tobias Pohlen G. Irving Shane Legg Dario Amodei 131 398 0 15 Nov 2018
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation Fan Wang Bo Zhou Ke Chen Tingxiang Fan Xi Zhang Jiangyong Li Hao Tian Jia Pan 68 26 0 15 Nov 2018
Deep Reinforcement Learning Yuxi Li VLM OffRL 194 144 0 15 Oct 2018
HG-DAgger: Interactive Imitation Learning with Human Experts Michael Kelly Chelsea Sidrane Katherine Driggs-Campbell Mykel J. Kochenderfer OffRL 255 232 0 05 Oct 2018
Verification for Machine Learning, Autonomy, and Neural Networks Survey Weiming Xiang Patrick Musau A. Wild Diego Manzanas Lopez Nathaniel P. Hamilton Xiaodong Yang Joel A. Rosenfeld Taylor T. Johnson 95 102 0 03 Oct 2018
Adding Neural Network Controllers to Behavior Trees without Destroying Performance Guarantees Christopher Iliffe Sprague Petter Ögren 67 25 0 26 Sep 2018
Cycle-of-Learning for Autonomous Systems from Human Interaction Nicholas R. Waytowich Vinicius G. Goecks Vernon J. Lawhern 55 20 0 28 Aug 2018
Penalizing side effects using stepwise relative reachability Victoria Krakovna Laurent Orseau Ramana Kumar Miljan Martic Shane Legg 98 55 0 04 Jun 2018
AGI Safety Literature Review Tom Everitt G. Lea Marcus Hutter AI4CE 86 116 0 03 May 2018
Active Reinforcement Learning with Monte-Carlo Tree Search Sebastian Schulze Owain Evans 60 14 0 13 Mar 2018
AI Safety Gridworlds Jan Leike Miljan Martic Victoria Krakovna Pedro A. Ortega Tom Everitt Andrew Lefrancq Laurent Orseau Shane Legg 158 255 0 27 Nov 2017