AGI Safety Literature Review

AGI Safety Literature Review

3 May 2018

Marcus Hutter

Papers citing "AGI Safety Literature Review"

15 / 15 papers shown

Title
The Elephant in the Room -- Why AI Safety Demands Diverse Teams David Rostcheck Lara Scheibling 28 0 0 07 May 2024
The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward A. Titus Adam Russell 28 1 0 28 Aug 2023
Exploring the Constraints on Artificial General Intelligence: A Game-Theoretic No-Go Theorem Mehmet S. Ismail 9 0 0 25 Sep 2022
The Alignment Problem from a Deep Learning Perspective Richard Ngo Lawrence Chan Sören Mindermann 54 181 0 30 Aug 2022
Technology and Consciousness John Rushby Daniel Sánchez 27 4 0 17 Jul 2022
A Survey on AI Assurance Feras A. Batarseh Laura J. Freeman 27 65 0 15 Nov 2021
Impossibility Results in AI: A Survey Mario Brčič Roman V. Yampolskiy 8 25 0 01 Sep 2021
Reinforcement Learning Under Moral Uncertainty Adrien Ecoffet Joel Lehman 17 32 0 08 Jun 2020
AI safety: state of the field through quantitative lens Mislav Juric A. Sandic Mario Brčič 18 24 0 12 Feb 2020
Unsupervised and Generic Short-Term Anticipation of Human Body Motions Kristina Enes Hassan Errami Moritz Wolter Tim Krake B. Eberhardt A. Weber Jorg Zimmermann OOD 19 1 0 13 Dec 2019
Augmented Utilitarianism for AGI Safety Nadisha-Marie Aliman L. Kester 13 8 0 02 Apr 2019
Scalable agent alignment via reward modeling: a research direction Jan Leike David M. Krueger Tom Everitt Miljan Martic Vishal Maini Shane Legg 28 392 0 19 Nov 2018
AI safety via debate G. Irving Paul Christiano Dario Amodei 201 199 0 02 May 2018
A causal framework for explaining the predictions of black-box sequence-to-sequence models David Alvarez-Melis Tommi Jaakkola CML 227 201 0 06 Jul 2017
Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks Guy Katz Clark W. Barrett D. Dill Kyle D. Julian Mykel Kochenderfer AAML 226 1,835 0 03 Feb 2017