Reinforcement Learning with a Corrupted Reward Channel

Reinforcement Learning with a Corrupted Reward Channel

23 May 2017

Victoria Krakovna

Papers citing "Reinforcement Learning with a Corrupted Reward Channel"

14 / 64 papers shown

Title
Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals Yunhan Huang Quanyan Zhu OffRL AAML 49 84 0 24 Jun 2019
Advantage Amplification in Slowly Evolving Latent-State Environments Martin Mladenov Ofer Meshi Jayden Ooi Dale Schuurmans Craig Boutilier OffRL 26 9 0 29 May 2019
Conservative Agency via Attainable Utility Preservation Alexander Matt Turner Dylan Hadfield-Menell Prasad Tadepalli 30 49 0 26 Feb 2019
Embedded Agency A. Demski Scott Garrabrant AIFin 35 34 0 25 Feb 2019
Human-Centered Artificial Intelligence and Machine Learning Mark O. Riedl SyDa 35 261 0 31 Jan 2019
Scalable agent alignment via reward modeling: a research direction Jan Leike David M. Krueger Tom Everitt Miljan Martic Vishal Maini Shane Legg 34 397 0 19 Nov 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning Vahid Behzadan Arslan Munir 27 27 0 23 Oct 2018
Reinforcement Learning with Perturbed Rewards Jingkang Wang Yang Liu Bo Li NoLa 30 127 0 02 Oct 2018
Reinforcement Learning for Autonomous Defence in Software-Defined Networking Yi Han Benjamin I. P. Rubinstein Tamas Abraham T. Alpcan O. Vel S. Erfani David Hubczenko C. Leckie Paul Montague AAML 22 68 0 17 Aug 2018
Safe Option-Critic: Learning Safety in the Option-Critic Architecture Arushi Jain Khimya Khetarpal Doina Precup 21 26 0 21 Jul 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning Joshua Romoff Peter Henderson Alexandre Piché Vincent François-Lavet Joelle Pineau 16 42 0 09 May 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 31 875 0 03 Mar 2018
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation Miles Brundage S. Avin Jack Clark H. Toner P. Eckersley ... Owain Evans Michael Page Joanna J. Bryson Roman V. Yampolskiy Dario Amodei 42 694 0 20 Feb 2018
Occam's razor is insufficient to infer the preferences of irrational agents Stuart Armstrong Sören Mindermann 27 92 0 15 Dec 2017