ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.01109
  4. Cited By
AGI Safety Literature Review

AGI Safety Literature Review

3 May 2018
Tom Everitt
G. Lea
Marcus Hutter
    AI4CE
ArXivPDFHTML

Papers citing "AGI Safety Literature Review"

15 / 15 papers shown
Title
The Elephant in the Room -- Why AI Safety Demands Diverse Teams
The Elephant in the Room -- Why AI Safety Demands Diverse Teams
David Rostcheck
Lara Scheibling
28
0
0
07 May 2024
The Promise and Peril of Artificial Intelligence -- Violet Teaming
  Offers a Balanced Path Forward
The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward
A. Titus
Adam Russell
28
1
0
28 Aug 2023
Exploring the Constraints on Artificial General Intelligence: A
  Game-Theoretic No-Go Theorem
Exploring the Constraints on Artificial General Intelligence: A Game-Theoretic No-Go Theorem
Mehmet S. Ismail
9
0
0
25 Sep 2022
The Alignment Problem from a Deep Learning Perspective
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
54
181
0
30 Aug 2022
Technology and Consciousness
Technology and Consciousness
John Rushby
Daniel Sánchez
27
4
0
17 Jul 2022
A Survey on AI Assurance
A Survey on AI Assurance
Feras A. Batarseh
Laura J. Freeman
27
65
0
15 Nov 2021
Impossibility Results in AI: A Survey
Impossibility Results in AI: A Survey
Mario Brčič
Roman V. Yampolskiy
8
25
0
01 Sep 2021
Reinforcement Learning Under Moral Uncertainty
Reinforcement Learning Under Moral Uncertainty
Adrien Ecoffet
Joel Lehman
17
32
0
08 Jun 2020
AI safety: state of the field through quantitative lens
AI safety: state of the field through quantitative lens
Mislav Juric
A. Sandic
Mario Brčič
18
24
0
12 Feb 2020
Unsupervised and Generic Short-Term Anticipation of Human Body Motions
Unsupervised and Generic Short-Term Anticipation of Human Body Motions
Kristina Enes
Hassan Errami
Moritz Wolter
Tim Krake
B. Eberhardt
A. Weber
Jorg Zimmermann
OOD
19
1
0
13 Dec 2019
Augmented Utilitarianism for AGI Safety
Augmented Utilitarianism for AGI Safety
Nadisha-Marie Aliman
L. Kester
13
8
0
02 Apr 2019
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
28
392
0
19 Nov 2018
AI safety via debate
AI safety via debate
G. Irving
Paul Christiano
Dario Amodei
201
199
0
02 May 2018
A causal framework for explaining the predictions of black-box
  sequence-to-sequence models
A causal framework for explaining the predictions of black-box sequence-to-sequence models
David Alvarez-Melis
Tommi Jaakkola
CML
227
201
0
06 Jul 2017
Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks
Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks
Guy Katz
Clark W. Barrett
D. Dill
Kyle D. Julian
Mykel Kochenderfer
AAML
226
1,835
0
03 Feb 2017
1