Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.6572
Cited By
v1
v2
v3 (latest)
Explaining and Harnessing Adversarial Examples
20 December 2014
Ian Goodfellow
Jonathon Shlens
Christian Szegedy
AAML
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Explaining and Harnessing Adversarial Examples"
50 / 8,387 papers shown
Title
Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off
Futa Waseda
Ching-Chun Chang
Isao Echizen
AAML
145
0
0
22 Feb 2024
Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion
Yichi Zhang
Zhuo Chen
Lei Liang
Hua-zeng Chen
Wen Zhang
97
7
0
22 Feb 2024
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Vyas Raina
Adian Liusie
Mark Gales
AAML
ELM
101
63
0
21 Feb 2024
A Simple and Yet Fairly Effective Defense for Graph Neural Networks
Sofiane Ennadir
Yassine Abbahaddou
J. Lutzeyer
Michalis Vazirgiannis
Henrik Bostrom
AAML
87
16
0
21 Feb 2024
AttackGNN: Red-Teaming GNNs in Hardware Security Using Reinforcement Learning
Vasudev Gohil
Satwik Patnaik
D. Kalathil
Jeyavijayan Rajendran
AAML
101
4
0
21 Feb 2024
Robustness of Deep Neural Networks for Micro-Doppler Radar Classification
Mikolaj Czerkawski
C. Clemente
C. Michie
Christos Tachtatzis
OOD
AAML
27
3
0
21 Feb 2024
Adversarial Purification and Fine-tuning for Robust UDC Image Restoration
Zhenbo Song
Zhenyuan Zhang
Kaihao Zhang
Wenhan Luo
Zhaoxin Fan
Jianfeng Lu
AAML
116
0
0
21 Feb 2024
Flexible Physical Camouflage Generation Based on a Differential Approach
Yang Li
Wenyi Tan
Chenxing Zhao
Shuangju Zhou
Xinkai Liang
Quanbiao Pan
AAML
65
6
0
21 Feb 2024
The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative
Zhen Tan
Chengshuai Zhao
Raha Moraffah
Yifan Li
Yu Kong
Tianlong Chen
Huan Liu
94
17
0
20 Feb 2024
QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems
Jinjing Shi
Zimeng Xiao
Heyuan Shi
Yu Jiang
Xuelong Li
AAML
86
1
0
20 Feb 2024
VGMShield: Mitigating Misuse of Video Generative Models
Yan Pang
Yang Zhang
Yang Zhang
Tianhao Wang
130
3
0
20 Feb 2024
Defending Jailbreak Prompts via In-Context Adversarial Game
Yujun Zhou
Yufei Han
Haomin Zhuang
Kehan Guo
Zhenwen Liang
Hongyan Bao
Xiangliang Zhang
LLMAG
AAML
119
15
0
20 Feb 2024
Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems
Zhaowei Zhang
Fengshuo Bai
Mingzhi Wang
Haoyang Ye
Chengdong Ma
Yaodong Yang
77
6
0
20 Feb 2024
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Christian Schlarmann
Naman D. Singh
Francesco Croce
Matthias Hein
VLM
AAML
106
50
0
19 Feb 2024
Query-Based Adversarial Prompt Generation
Jonathan Hayase
Ema Borevkovic
Nicholas Carlini
Florian Tramèr
Milad Nasr
AAML
SILM
104
32
0
19 Feb 2024
Attacks on Node Attributes in Graph Neural Networks
Ying Xu
Michael Lanier
Anindya Sarkar
Yevgeniy Vorobeychik
GNN
AAML
88
3
0
19 Feb 2024
Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training
L. Park
Jaeuk Kim
Myung Gyo Oh
Jaewoo Park
T.-H. Kwon
AAML
138
5
0
19 Feb 2024
Manipulating hidden-Markov-model inferences by corrupting batch data
William N. Caballero
Jose Manuel Camacho
Tahir Ekin
Roi Naveiro
AAML
63
1
0
19 Feb 2024
Robustness and Exploration of Variational and Machine Learning Approaches to Inverse Problems: An Overview
Alexander Auras
Kanchana Vaishnavi Gandikota
Hannah Droege
Michael Moeller
AAML
81
0
0
19 Feb 2024
AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization
Jiyao Li
Mingze Ni
Yifei Dong
Tianqing Zhu
Wei Liu
AAML
52
3
0
19 Feb 2024
The Effectiveness of Random Forgetting for Robust Generalization
V. Ramkumar
Bahram Zonooz
Elahe Arani
AAML
68
1
0
18 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
121
20
0
18 Feb 2024
Evaluating Adversarial Robustness of Low dose CT Recovery
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
Hannah Dröge
Michael Moeller
OOD
AAML
67
3
0
18 Feb 2024
A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models
Cuong Dang
Dung D. Le
Thai Le
AAML
80
2
0
18 Feb 2024
Neural Networks with (Low-Precision) Polynomial Approximations: New Insights and Techniques for Accuracy Improvement
Chi Zhang
Jingjing Fan
Man Ho Au
Siu-Ming Yiu
107
1
0
17 Feb 2024
DART: A Principled Approach to Adversarially Robust Unsupervised Domain Adaptation
Yunjuan Wang
Hussein Hazimeh
Natalia Ponomareva
Alexey Kurakin
Ibrahim Hammoud
Raman Arora
OOD
AAML
82
0
0
16 Feb 2024
VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models
Ziyi Yin
Muchao Ye
Tianrong Zhang
Jiaqi Wang
Han Liu
Jinghui Chen
Ting Wang
Fenglong Ma
OOD
AAML
51
2
0
16 Feb 2024
Generalizability of Mixture of Domain-Specific Adapters from the Lens of Signed Weight Directions and its Application to Effective Model Pruning
Tuc Nguyen
Thai Le
MoMe
95
3
0
16 Feb 2024
Theoretical Understanding of Learning from Adversarial Perturbations
Soichiro Kumano
Hiroshi Kera
Toshihiko Yamasaki
AAML
96
3
0
16 Feb 2024
Quantum-Inspired Analysis of Neural Network Vulnerabilities: The Role of Conjugate Variables in System Attacks
Jun-Jie Zhang
Deyu Meng
AAML
83
3
0
16 Feb 2024
Feature Accentuation: Revealing 'What' Features Respond to in Natural Images
Christopher Hamblin
Thomas Fel
Srijani Saha
Talia Konkle
George A. Alvarez
FAtt
103
3
0
15 Feb 2024
Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts
Tobias Enders
James Harrison
Maximilian Schiffer
OOD
99
5
0
15 Feb 2024
Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks
Álvaro Huertas-García
Alejandro Martín
Javier Huertas-Tato
David Camacho
AAML
73
0
0
15 Feb 2024
Examining Pathological Bias in a Generative Adversarial Network Discriminator: A Case Study on a StyleGAN3 Model
Alvin Grissom II
Ryan F. Lei
Matt Gusdorff
Jeova Farias Sales Rocha Neto
Bailey Lin
Ryan Trotter
GAN
91
0
0
15 Feb 2024
Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion
Edgar Heinert
Matthias Rottmann
Kira Maag
Karsten Kahl
70
6
0
14 Feb 2024
Stability and Multigroup Fairness in Ranking with Uncertain Predictions
Siddartha Devic
Aleksandra Korolova
David Kempe
Vatsal Sharan
104
6
0
14 Feb 2024
Only My Model On My Data: A Privacy Preserving Approach Protecting one Model and Deceiving Unauthorized Black-Box Models
Weiheng Chai
Brian Testa
Huantao Ren
Asif Salekin
Senem Velipasalar
42
0
0
14 Feb 2024
Exploring the Adversarial Capabilities of Large Language Models
Lukas Struppek
Minh Hieu Le
Dominik Hintersdorf
Kristian Kersting
ELM
AAML
71
4
0
14 Feb 2024
End-to-End Training Induces Information Bottleneck through Layer-Role Differentiation: A Comparative Analysis with Layer-wise Training
Keitaro Sakamoto
Issei Sato
77
4
0
14 Feb 2024
Detecting Adversarial Spectrum Attacks via Distance to Decision Boundary Statistics
Wenwei Zhao
Xiaowen Li
Shangqing Zhao
Jie Xu
Yao-Hong Liu
Zhuo Lu
AAML
62
1
0
14 Feb 2024
Is my Data in your AI Model? Membership Inference Test with Application to Face Images
Daniel DeAlcala
Aythami Morales
Julian Fierrez
Gonzalo Mancera
Ruben Tolosana
J. Ortega-Garcia
CVBM
124
7
0
14 Feb 2024
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space
Leo Schwinn
David Dobre
Sophie Xhonneux
Gauthier Gidel
Stephan Gunnemann
AAML
159
49
0
14 Feb 2024
Adversarially Robust Feature Learning for Breast Cancer Diagnosis
Degan Hao
Dooman Arefan
M. Zuley
Wendie Berg
Shandong Wu
OOD
MedIm
68
1
0
13 Feb 2024
Enhancing Robustness of Indoor Robotic Navigation with Free-Space Segmentation Models Against Adversarial Attacks
Qiyuan An
Christos Sevastopoulos
F. Makedon
60
1
0
13 Feb 2024
Generating Universal Adversarial Perturbations for Quantum Classifiers
Gautham Anil
Vishnu Vinod
Apurva Narayan
AAML
82
5
0
13 Feb 2024
Faster Repeated Evasion Attacks in Tree Ensembles
Lorenzo Cascioli
Laurens Devos
Ondvrej Kuvzelka
Jesse Davis
AAML
62
0
0
13 Feb 2024
Test-Time Backdoor Attacks on Multimodal Large Language Models
Dong Lu
Tianyu Pang
Chao Du
Qian Liu
Xianjun Yang
Min Lin
AAML
167
26
0
13 Feb 2024
Two Tales of Single-Phase Contrastive Hebbian Learning
R. Høier
Christopher Zach
64
1
0
13 Feb 2024
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu
Xiaosen Zheng
Tianyu Pang
Chao Du
Qian Liu
Ye Wang
Jing Jiang
Min Lin
LLMAG
LM&Ro
52
63
0
13 Feb 2024
Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models
Shaeke Salman
M. Shams
Xiuwen Liu
Lingjiong Zhu
VLM
60
2
0
13 Feb 2024
Previous
1
2
3
...
28
29
30
...
166
167
168
Next