Penalizing side effects using stepwise relative reachability

4 June 2018

Papers citing "Penalizing side effects using stepwise relative reachability"

21 / 21 papers shown

Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback Siow Meng Low Akshat Kumar 60 0 0 17 Apr 2025
Conservative Agency via Attainable Utility Preservation Alexander Matt Turner Dylan Hadfield-Menell Prasad Tadepalli 44 49 0 26 Feb 2019
Preferences Implicit in the State of the World Rohin Shah Dmitrii Krasheninnikov Jordan Alexander Pieter Abbeel Anca Dragan 29 55 0 12 Feb 2019
Safe Exploration in Continuous Action Spaces Gal Dalal Krishnamurthy Dvijotham Matej Vecerík Todd Hester Cosmin Paduraru Yuval Tassa 26 435 0 26 Jan 2018
AI Safety Gridworlds Jan Leike Miljan Martic Victoria Krakovna Pedro A. Ortega Tom Everitt Andrew Lefrancq Laurent Orseau Shane Legg 84 250 0 27 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning Benjamin Eysenbach S. Gu Julian Ibarz Sergey Levine CLL 43 139 0 18 Nov 2017
Inverse Reward Design Dylan Hadfield-Menell S. Milli Pieter Abbeel Stuart J. Russell Anca Dragan 53 393 0 08 Nov 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 25 267 0 28 Sep 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention William Saunders Girish Sastry Andreas Stuhlmuller Owain Evans OffRL 42 230 0 17 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 75 3,197 0 12 Jun 2017
Low Impact Artificial Intelligences Stuart Armstrong B. Levinstein 26 33 0 30 May 2017
A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems J. F. Fisac Anayo K. Akametalu Melanie Zeilinger Shahab Kaynama J. Gillula Claire Tomlin 33 494 0 03 May 2017
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear Zachary Chase Lipton Kamyar Azizzadenesheli Jianfeng Gao Lihong Li Jianshu Chen Li Deng 43 34 0 03 Nov 2016
Concrete Problems in AI Safety Dario Amodei C. Olah Jacob Steinhardt Paul Christiano John Schulman Dandelion Mané 130 2,349 0 21 Jun 2016
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes M. Turchetta Felix Berkenkamp Andreas Krause 43 186 0 15 Jun 2016
Cooperative Inverse Reinforcement Learning Dylan Hadfield-Menell Anca Dragan Pieter Abbeel Stuart J. Russell 37 643 0 09 Jun 2016
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning S. Mohamed Danilo Jimenez Rezende DRL SSL 41 400 0 29 Sep 2015
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach Yinlam Chow Aviv Tamar Shie Mannor Marco Pavone 91 317 0 06 Jun 2015
Risk-sensitive Reinforcement Learning Yun Shen Michael J. Tobia T. Sommer Klaus Obermayer 36 318 0 08 Nov 2013
Empowerment -- an Introduction Christoph Salge C. Glackin Daniel Polani 41 180 0 07 Oct 2013
Safe Exploration in Markov Decision Processes T. Moldovan Pieter Abbeel 89 308 0 22 May 2012