Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.07228
Cited By
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
20 February 2018
Miles Brundage
S. Avin
Jack Clark
H. Toner
P. Eckersley
Ben Garfinkel
Allan Dafoe
P. Scharre
Thomas Zeitzoff
Bobby Filar
Hyrum S. Anderson
H. Roff
Gregory C. Allen
Jacob Steinhardt
Carrick Flynn
Seán Ó hÉigeartaigh
S. Beard
Haydn Belfield
Sebastian Farquhar
Clare Lyle
Rebecca Crootof
Owain Evans
Michael Page
Joanna J. Bryson
Roman V. Yampolskiy
Dario Amodei
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation"
50 / 66 papers shown
Title
The Steganographic Potentials of Language Models
Artem Karpov
Tinuade Adeleke
Seong Hah Cho
Natalia Perez-Campanero
37
0
0
06 May 2025
Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents
Christian Schroeder de Witt
AAML
AI4CE
231
1
0
04 May 2025
The Precautionary Principle and the Innovation Principle: Incompatible Guides for AI Innovation Governance?
Kim Kaivanto
30
0
0
01 May 2025
Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs
Mohammad Akbar-Tajari
Mohammad Taher Pilehvar
Mohammad Mahmoody
AAML
51
0
0
26 Apr 2025
AI Awareness
Xianrui Li
Haoyuan Shi
Rongwu Xu
Wei Xu
59
0
0
25 Apr 2025
Predictable Artificial Intelligence
Lexin Zhou
Pablo Antonio Moreno Casares
Fernando Martínez-Plumed
John Burden
Ryan Burnell
...
Seán Ó hÉigeartaigh
Danaja Rutar
Wout Schellaert
Konstantinos Voudouris
José Hernández-Orallo
56
2
0
08 Jan 2025
SoK: Decentralized AI (DeAI)
Zhipeng Wang
Rui Sun
Elizabeth Lui
Vatsal Shah
Xihan Xiong
Jiahao Sun
Davide Crapis
William Knottenbelt
107
1
0
26 Nov 2024
Safeguarding AI Agents: Developing and Analyzing Safety Architectures
Ishaan Domkundwar
Mukunda N S
Ishaan Bhola
Riddhik Kochhar
LLMAG
31
1
0
03 Sep 2024
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Riccardo Cantini
Giada Cosenza
A. Orsino
Domenico Talia
AAML
65
5
0
11 Jul 2024
Input Conditioned Graph Generation for Language Agents
Lukas Vierling
Jie Fu
Kai Chen
LLMAG
63
2
0
17 Jun 2024
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models
Xinpeng Wang
Shitong Duan
Xiaoyuan Yi
Jing Yao
Shanlin Zhou
Zhihua Wei
Peng Zhang
Dongkuan Xu
Maosong Sun
Xing Xie
OffRL
43
16
0
07 Mar 2024
On the Societal Impact of Open Foundation Models
Sayash Kapoor
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Ashwin Ramaswami
...
Victor Storchan
Daniel Zhang
Daniel E. Ho
Percy Liang
Arvind Narayanan
26
54
0
27 Feb 2024
Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation
Aditya Desu
Xuanli He
Qiongkai Xu
Wei Lu
WIGM
32
1
0
23 Feb 2024
Understanding Generative AI in Art: An Interview Study with Artists on G-AI from an HCI Perspective
Jingyu Shi
Rahul Jain
Runlin Duan
Karthik Ramani
40
7
0
19 Oct 2023
The Promise and Peril of Artificial Intelligence -- Violet Teaming Offers a Balanced Path Forward
A. Titus
Adam Russell
38
1
0
28 Aug 2023
AI could create a perfect storm of climate misinformation
V. Galaz
Hannah Metzler
Stefan Daume
A. Olsson
B. Lindström
A. Marklund
26
5
0
22 Jun 2023
Model evaluation for extreme risks
Toby Shevlane
Sebastian Farquhar
Ben Garfinkel
Mary Phuong
Jess Whittlestone
...
Vijay Bolina
Jack Clark
Yoshua Bengio
Paul Christiano
Allan Dafoe
ELM
46
152
0
24 May 2023
Fairness in AI and Its Long-Term Implications on Society
Ondrej Bohdal
Timothy M. Hospedales
Philip Torr
Fazl Barez
15
4
0
16 Apr 2023
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
56
10
0
13 Apr 2023
Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images
Roberto Amoroso
Davide Morelli
Marcella Cornia
Lorenzo Baraldi
A. Bimbo
Rita Cucchiara
DiffM
41
29
0
02 Apr 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
31
177
0
27 Mar 2023
Both eyes open: Vigilant Incentives help Regulatory Markets improve AI Safety
Paolo Bova
A. D. Stefano
H. Anh
31
4
0
06 Mar 2023
The Gradient of Generative AI Release: Methods and Considerations
Irene Solaiman
36
98
0
05 Feb 2023
Foundation models in brief: A historical, socio-technical focus
Johannes Schneider
VLM
29
9
0
17 Dec 2022
Analysis of Anomalous Behavior in Network Systems Using Deep Reinforcement Learning with CNN Architecture
Mohammad Hossein Modirrousta
Parisa Forghani
M. A. Shoorehdeli
35
0
0
29 Nov 2022
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models
Peter Henderson
E. Mitchell
Christopher D. Manning
Dan Jurafsky
Chelsea Finn
25
47
0
27 Nov 2022
The European AI Liability Directives -- Critique of a Half-Hearted Approach and Lessons for the Future
P. Hacker
AILaw
31
61
0
25 Nov 2022
CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds
Jesse C. Cresswell
Brendan Leigh Ross
G. Loaiza-Ganem
H. Reyes-González
Marco Letizia
Anthony L. Caterini
24
36
0
23 Nov 2022
Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction
Renee Shelby
Shalaleh Rismani
Kathryn Henne
AJung Moon
Negar Rostamzadeh
...
N'Mah Yilla-Akbari
Jess Gallegos
A. Smart
Emilio Garcia
Gurleen Virk
47
188
0
11 Oct 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
34
91
0
29 Aug 2022
The Fallacy of AI Functionality
Inioluwa Deborah Raji
Indra Elizabeth Kumar
Aaron Horowitz
Andrew D. Selbst
34
180
0
20 Jun 2022
X-Risk Analysis for AI Research
Dan Hendrycks
Mantas Mazeika
38
68
0
13 Jun 2022
Synthetic Disinformation Attacks on Automated Fact Verification Systems
Y. Du
Antoine Bosselut
Christopher D. Manning
AAML
OffRL
36
32
0
18 Feb 2022
Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection
Hooman Alavizadeh
Julian Jang
Hootan Alavizadeh
27
130
0
27 Nov 2021
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models
Jen-Hao Rick Chang
A. Shrivastava
H. Koppula
Xiaoshuai Zhang
Oncel Tuzel
DiffM
51
16
0
06 Oct 2021
Why and How Governments Should Monitor AI Development
Jess Whittlestone
Jack Clark
35
30
0
28 Aug 2021
Cyber-Security Challenges in Aviation Industry: A Review of Current and Future Trends
Elochukwu A. Ukwandu
M. B. Farah
Hanan Hindy
Miroslav Bures
Robert C. Atkinson
Christos Tachtatzis
X. Bellekens
24
62
0
10 Jul 2021
The Threat of Offensive AI to Organizations
Yisroel Mirsky
Ambra Demontis
J. Kotak
Ram Shankar
Deng Gelei
Liu Yang
Xinming Zhang
Wenke Lee
Yuval Elovici
Battista Biggio
38
81
0
30 Jun 2021
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
34
95
0
14 Jun 2021
Responsible Disclosure of Generative Models Using Scalable Fingerprinting
Ning Yu
Vladislav Skripniuk
Dingfan Chen
Larry S. Davis
Mario Fritz
WIGM
52
89
0
16 Dec 2020
AI virtues -- The missing link in putting AI ethics into practice
Thilo Hagendorff
24
56
0
25 Nov 2020
Ethical behavior in humans and machines -- Evaluating training data quality for beneficial machine learning
Thilo Hagendorff
21
26
0
26 Aug 2020
Blackbox Trojanising of Deep Learning Models : Using non-intrusive network structure and binary alterations
Jonathan Pan
AAML
9
3
0
02 Aug 2020
Regulating human control over autonomous systems
Mikołaj Firlej
Araz Taeihagh
22
34
0
22 Jul 2020
Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
Zhi-Xuan Tan
Jordyn L. Mann
Tom Silver
J. Tenenbaum
Vikash K. Mansinghka
OffRL
14
89
0
13 Jun 2020
Studying the Transfer of Biases from Programmers to Programs
Christian Johansen
Tore Pedersen
Johanna Johansen
11
7
0
17 May 2020
On the use of Benford's law to detect GAN-generated images
Nicolo Bonettini
Paolo Bestagini
Simone Milani
Stefano Tubaro
GAN
13
38
0
16 Apr 2020
State of the Art on Neural Rendering
A. Tewari
Ohad Fried
Justus Thies
Vincent Sitzmann
Stephen Lombardi
...
Christian Theobalt
Maneesh Agrawala
Eli Shechtman
Dan B. Goldman
Michael Zollhöfer
3DH
3DV
39
466
0
08 Apr 2020
Distance-Based Learning from Errors for Confidence Calibration
Chen Xing
Sercan Ö. Arik
Zizhao Zhang
Tomas Pfister
FedML
23
39
0
03 Dec 2019
Forbidden knowledge in machine learning -- Reflections on the limits of research and publication
Thilo Hagendorff
22
14
0
19 Nov 2019
1
2
Next