Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.01186
Cited By
Penalizing side effects using stepwise relative reachability
4 June 2018
Victoria Krakovna
Laurent Orseau
Ramana Kumar
Miljan Martic
Shane Legg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Penalizing side effects using stepwise relative reachability"
21 / 21 papers shown
Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
Akshat Kumar
60
0
0
17 Apr 2025
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
44
49
0
26 Feb 2019
Preferences Implicit in the State of the World
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
29
55
0
12 Feb 2019
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
26
435
0
26 Jan 2018
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
84
250
0
27 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
43
139
0
18 Nov 2017
Inverse Reward Design
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
53
393
0
08 Nov 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
25
267
0
28 Sep 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
42
230
0
17 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
75
3,197
0
12 Jun 2017
Low Impact Artificial Intelligences
Stuart Armstrong
B. Levinstein
26
33
0
30 May 2017
A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems
J. F. Fisac
Anayo K. Akametalu
Melanie Zeilinger
Shahab Kaynama
J. Gillula
Claire Tomlin
33
494
0
03 May 2017
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Zachary Chase Lipton
Kamyar Azizzadenesheli
Jianfeng Gao
Lihong Li
Jianshu Chen
Li Deng
43
34
0
03 Nov 2016
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
130
2,349
0
21 Jun 2016
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
M. Turchetta
Felix Berkenkamp
Andreas Krause
43
186
0
15 Jun 2016
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
37
643
0
09 Jun 2016
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
41
400
0
29 Sep 2015
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
91
317
0
06 Jun 2015
Risk-sensitive Reinforcement Learning
Yun Shen
Michael J. Tobia
T. Sommer
Klaus Obermayer
36
318
0
08 Nov 2013
Empowerment -- an Introduction
Christoph Salge
C. Glackin
Daniel Polani
41
180
0
07 Oct 2013
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
89
308
0
22 May 2012
1