Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.01186
Cited By
v1
v2 (latest)
Penalizing side effects using stepwise relative reachability
4 June 2018
Victoria Krakovna
Laurent Orseau
Ramana Kumar
Miljan Martic
Shane Legg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Penalizing side effects using stepwise relative reachability"
21 / 21 papers shown
Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
Akshat Kumar
101
0
0
17 Apr 2025
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
82
49
0
26 Feb 2019
Preferences Implicit in the State of the World
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
70
55
0
12 Feb 2019
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
53
443
0
26 Jan 2018
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
118
255
0
27 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
66
139
0
18 Nov 2017
Inverse Reward Design
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
86
399
0
08 Nov 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
72
272
0
28 Sep 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
70
231
0
17 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
Low Impact Artificial Intelligences
Stuart Armstrong
B. Levinstein
48
33
0
30 May 2017
A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems
J. F. Fisac
Anayo K. Akametalu
Melanie Zeilinger
Shahab Kaynama
J. Gillula
Claire Tomlin
63
498
0
03 May 2017
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Zachary Chase Lipton
Kamyar Azizzadenesheli
Jianfeng Gao
Lihong Li
Jianshu Chen
Li Deng
95
34
0
03 Nov 2016
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
253
2,405
0
21 Jun 2016
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
M. Turchetta
Felix Berkenkamp
Andreas Krause
89
189
0
15 Jun 2016
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
99
644
0
09 Jun 2016
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
99
402
0
29 Sep 2015
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
130
323
0
06 Jun 2015
Risk-sensitive Reinforcement Learning
Yun Shen
Michael J. Tobia
T. Sommer
Klaus Obermayer
98
320
0
08 Nov 2013
Empowerment -- an Introduction
Christoph Salge
C. Glackin
Daniel Polani
102
182
0
07 Oct 2013
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
155
311
0
22 May 2012
1