v1v2 (latest)

Penalizing side effects using stepwise relative reachability

4 June 2018

Papers citing "Penalizing side effects using stepwise relative reachability"

21 / 21 papers shown

Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback Siow Meng Low Akshat Kumar 101 0 0 17 Apr 2025
Conservative Agency via Attainable Utility Preservation Alexander Matt Turner Dylan Hadfield-Menell Prasad Tadepalli 82 49 0 26 Feb 2019
Preferences Implicit in the State of the World Rohin Shah Dmitrii Krasheninnikov Jordan Alexander Pieter Abbeel Anca Dragan 70 55 0 12 Feb 2019
Safe Exploration in Continuous Action Spaces Gal Dalal Krishnamurthy Dvijotham Matej Vecerík Todd Hester Cosmin Paduraru Yuval Tassa 53 443 0 26 Jan 2018
AI Safety Gridworlds Jan Leike Miljan Martic Victoria Krakovna Pedro A. Ortega Tom Everitt Andrew Lefrancq Laurent Orseau Shane Legg 118 255 0 27 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning Benjamin Eysenbach S. Gu Julian Ibarz Sergey Levine CLL 66 139 0 18 Nov 2017
Inverse Reward Design Dylan Hadfield-Menell S. Milli Pieter Abbeel Stuart J. Russell Anca Dragan 86 399 0 08 Nov 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 72 272 0 28 Sep 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention William Saunders Girish Sastry Andreas Stuhlmuller Owain Evans OffRL 70 231 0 17 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 218 3,377 0 12 Jun 2017
Low Impact Artificial Intelligences Stuart Armstrong B. Levinstein 48 33 0 30 May 2017
A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems J. F. Fisac Anayo K. Akametalu Melanie Zeilinger Shahab Kaynama J. Gillula Claire Tomlin 63 498 0 03 May 2017
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear Zachary Chase Lipton Kamyar Azizzadenesheli Jianfeng Gao Lihong Li Jianshu Chen Li Deng 95 34 0 03 Nov 2016
Concrete Problems in AI Safety Dario Amodei C. Olah Jacob Steinhardt Paul Christiano John Schulman Dandelion Mané 253 2,405 0 21 Jun 2016
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes M. Turchetta Felix Berkenkamp Andreas Krause 89 189 0 15 Jun 2016
Cooperative Inverse Reinforcement Learning Dylan Hadfield-Menell Anca Dragan Pieter Abbeel Stuart J. Russell 99 644 0 09 Jun 2016
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning S. Mohamed Danilo Jimenez Rezende DRL SSL 99 402 0 29 Sep 2015
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach Yinlam Chow Aviv Tamar Shie Mannor Marco Pavone 130 323 0 06 Jun 2015
Risk-sensitive Reinforcement Learning Yun Shen Michael J. Tobia T. Sommer Klaus Obermayer 98 320 0 08 Nov 2013
Empowerment -- an Introduction Christoph Salge C. Glackin Daniel Polani 102 182 0 07 Oct 2013
Safe Exploration in Markov Decision Processes T. Moldovan Pieter Abbeel 155 311 0 22 May 2012