Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.08417
Cited By
Reinforcement Learning with a Corrupted Reward Channel
23 May 2017
Tom Everitt
Victoria Krakovna
Laurent Orseau
Marcus Hutter
Shane Legg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning with a Corrupted Reward Channel"
14 / 64 papers shown
Title
Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals
Yunhan Huang
Quanyan Zhu
OffRL
AAML
49
84
0
24 Jun 2019
Advantage Amplification in Slowly Evolving Latent-State Environments
Martin Mladenov
Ofer Meshi
Jayden Ooi
Dale Schuurmans
Craig Boutilier
OffRL
26
9
0
29 May 2019
Conservative Agency via Attainable Utility Preservation
Alexander Matt Turner
Dylan Hadfield-Menell
Prasad Tadepalli
30
49
0
26 Feb 2019
Embedded Agency
A. Demski
Scott Garrabrant
AIFin
35
34
0
25 Feb 2019
Human-Centered Artificial Intelligence and Machine Learning
Mark O. Riedl
SyDa
35
261
0
31 Jan 2019
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
397
0
19 Nov 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
27
27
0
23 Oct 2018
Reinforcement Learning with Perturbed Rewards
Jingkang Wang
Yang Liu
Bo Li
NoLa
30
127
0
02 Oct 2018
Reinforcement Learning for Autonomous Defence in Software-Defined Networking
Yi Han
Benjamin I. P. Rubinstein
Tamas Abraham
T. Alpcan
O. Vel
S. Erfani
David Hubczenko
C. Leckie
Paul Montague
AAML
22
68
0
17 Aug 2018
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain
Khimya Khetarpal
Doina Precup
21
26
0
21 Jul 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
16
42
0
09 May 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
31
875
0
03 Mar 2018
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Miles Brundage
S. Avin
Jack Clark
H. Toner
P. Eckersley
...
Owain Evans
Michael Page
Joanna J. Bryson
Roman V. Yampolskiy
Dario Amodei
42
694
0
20 Feb 2018
Occam's razor is insufficient to infer the preferences of irrational agents
Stuart Armstrong
Sören Mindermann
27
92
0
15 Dec 2017
Previous
1
2