ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.01186
  4. Cited By
Penalizing side effects using stepwise relative reachability

Penalizing side effects using stepwise relative reachability

4 June 2018
Victoria Krakovna
Laurent Orseau
Ramana Kumar
Miljan Martic
Shane Legg
ArXivPDFHTML

Papers citing "Penalizing side effects using stepwise relative reachability"

21 / 21 papers shown
Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
Akshat Kumar
60
0
0
17 Apr 2025
Conservative Agency via Attainable Utility Preservation
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
44
49
0
26 Feb 2019
Preferences Implicit in the State of the World
Preferences Implicit in the State of the World
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
29
55
0
12 Feb 2019
Safe Exploration in Continuous Action Spaces
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
26
435
0
26 Jan 2018
AI Safety Gridworlds
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
84
250
0
27 Nov 2017
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement
  Learning
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Benjamin Eysenbach
S. Gu
Julian Ibarz
Sergey Levine
CLL
43
139
0
18 Nov 2017
Inverse Reward Design
Inverse Reward Design
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
53
393
0
08 Nov 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
25
267
0
28 Sep 2017
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
42
230
0
17 Jul 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
75
3,197
0
12 Jun 2017
Low Impact Artificial Intelligences
Low Impact Artificial Intelligences
Stuart Armstrong
B. Levinstein
26
33
0
30 May 2017
A General Safety Framework for Learning-Based Control in Uncertain
  Robotic Systems
A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems
J. F. Fisac
Anayo K. Akametalu
Melanie Zeilinger
Shahab Kaynama
J. Gillula
Claire Tomlin
33
494
0
03 May 2017
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Zachary Chase Lipton
Kamyar Azizzadenesheli
Jianfeng Gao
Lihong Li
Jianshu Chen
Li Deng
43
34
0
03 Nov 2016
Concrete Problems in AI Safety
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
130
2,349
0
21 Jun 2016
Safe Exploration in Finite Markov Decision Processes with Gaussian
  Processes
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
M. Turchetta
Felix Berkenkamp
Andreas Krause
43
186
0
15 Jun 2016
Cooperative Inverse Reinforcement Learning
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
37
643
0
09 Jun 2016
Variational Information Maximisation for Intrinsically Motivated
  Reinforcement Learning
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
41
400
0
29 Sep 2015
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
91
317
0
06 Jun 2015
Risk-sensitive Reinforcement Learning
Risk-sensitive Reinforcement Learning
Yun Shen
Michael J. Tobia
T. Sommer
Klaus Obermayer
36
318
0
08 Nov 2013
Empowerment -- an Introduction
Empowerment -- an Introduction
Christoph Salge
C. Glackin
Daniel Polani
41
180
0
07 Oct 2013
Safe Exploration in Markov Decision Processes
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
89
308
0
22 May 2012
1