Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.06083
Cited By
v1
v2
v3
v4 (latest)
Towards Deep Learning Models Resistant to Adversarial Attacks
19 June 2017
Aleksander Madry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
SILM
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (752★)
Papers citing
"Towards Deep Learning Models Resistant to Adversarial Attacks"
50 / 6,612 papers shown
Title
Fundamental Limits of Perfect Concept Erasure
Somnath Basu Roy Chowdhury
Avinava Dubey
Ahmad Beirami
Rahul Kidambi
Nicholas Monath
Amr Ahmed
Snigdha Chaturvedi
107
1
0
25 Mar 2025
Bitstream Collisions in Neural Image Compression via Adversarial Perturbations
Jordan Madden
Lhamo Dorje
Xiaohua Li
AAML
79
0
0
25 Mar 2025
Quality-focused Active Adversarial Policy for Safe Grasping in Human-Robot Interaction
Chenghao Li
Razvan Beuran
Nak Young Chong
AAML
135
0
0
25 Mar 2025
LLaVAction: evaluating and training multi-modal large language models for action recognition
Shaokai Ye
Haozhe Qi
Alexander Mathis
Mackenzie W. Mathis
134
1
0
24 Mar 2025
STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models
Xunguang Wang
Wenxuan Wang
Zhenlan Ji
Zongjie Li
Pingchuan Ma
Daoyuan Wu
Shuai Wang
103
3
0
23 Mar 2025
Opportunities and Challenges of Frontier Data Governance With Synthetic Data
Madhavendra Thakur
Jason Hausenloy
91
0
0
21 Mar 2025
EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision
Xiaofeng Mao
YueFeng Chen
Rong Zhang
Hui Xue
Zhao Li
Hang Su
AAML
VLM
81
0
0
21 Mar 2025
Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers
Gaojie Jin
Tianjin Huang
Ronghui Mu
Xiaowei Huang
AAML
77
0
0
21 Mar 2025
Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability
Sanjif Shanmugavelu
Mathieu Taillefumier
Christopher Culver
Vijay Ganesh
Oscar Hernandez
Ada Sedova
AAML
59
1
0
21 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
112
0
0
21 Mar 2025
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng
Yanchen Huang
Yingchao Yu
Zizheng Zhu
Junfeng Tang
Zhaofei Yu
Yaochu Jin
88
0
0
20 Mar 2025
REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Jie M. Zhang
Zheng Yuan
Ziyi Wang
Bei Yan
Sibo Wang
Xiangkui Cao
Zonghui Guo
Shiguang Shan
Xilin Chen
ELM
137
0
0
20 Mar 2025
Rethinking Robustness in Machine Learning: A Posterior Agreement Approach
João B. S. Carvalho
Alessandro Torcinovich
Victor Jimenez Rodriguez
Antonio Emanuele Cinà
Carlos Cotrini
Lea Schönherr
J. M. Buhmann
OOD
111
0
0
20 Mar 2025
Narrowing Class-Wise Robustness Gaps in Adversarial Training
Fatemeh Amerehi
Patrick Healy
101
0
0
20 Mar 2025
On the Robustness Tradeoff in Fine-Tuning
Kunyang Li
Jean-Charles Noirot Ferrand
Ryan Sheatsley
Blaine Hoak
Yohan Beugin
Eric Pauley
Patrick McDaniel
91
0
0
19 Mar 2025
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Yizhou Sun
Juan Yin
Juan Zhao
Fan Zhang
Yongheng Liu
Hongji Chen
62
0
0
19 Mar 2025
AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark
Xinhao Xiang
Xiao Liu
Zizhong Li
Zhuosheng Liu
Jiawei Zhang
91
0
0
18 Mar 2025
Aligning Multimodal LLM with Human Preference: A Survey
Tao Yu
Yize Zhang
Chaoyou Fu
Junkang Wu
Jinda Lu
...
Qingsong Wen
Zheng Zhang
Yan Huang
Liang Wang
Tieniu Tan
439
4
0
18 Mar 2025
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
Rohan Menon
Nicola Franco
Stephan Günnemann
80
0
0
18 Mar 2025
TarPro: Targeted Protection against Malicious Image Editing
Kaixin Shen
Ruijie Quan
Jiaxu Miao
Jun Xiao
Yi Yang
111
1
0
18 Mar 2025
RAT: Boosting Misclassification Detection Ability without Extra Data
Ge Yan
Tsui-Wei Weng
AAML
140
0
0
18 Mar 2025
Safeguarding LLM Embeddings in End-Cloud Collaboration via Entropy-Driven Perturbation
Shuaifan Jin
Xiaoyi Pang
Peng Kuang
He Wang
Jiacheng Du
Jiahui Hu
Kui Ren
SILM
AAML
132
0
0
17 Mar 2025
Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization
Yize Zhang
Yingzhe Xu
Junyu Shi
L. Zhang
Shengshan Hu
Minghui Li
Yanjun Zhang
AAML
139
2
0
17 Mar 2025
Securing Virtual Reality Experiences: Unveiling and Tackling Cybersickness Attacks with Explainable AI
Ripan Kumar Kundu
Matthew Denton
Genova Mongalo
Prasad Calyam
K. A. Hoque
AAML
84
0
0
17 Mar 2025
GSBA
K
^K
K
:
t
o
p
top
t
o
p
-
K
K
K
Geometric Score-based Black-box Attack
Md. Farhamdur Reza
Richeng Jin
Tianfu Wu
H. Dai
AAML
114
0
0
17 Mar 2025
Robust Dataset Distillation by Matching Adversarial Trajectories
Wei Lai
Tianyu Ding
ren dongdong
Lei Wang
Jing Huo
Yang Gao
Wenbin Li
AAML
DD
102
0
0
15 Mar 2025
Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense
Shuyang Hao
Yijiao Wang
Bryan Hooi
Ming Yang
Qingbin Liu
Chengcheng Tang
Zi Huang
Yujun Cai
AAML
97
1
0
14 Mar 2025
Are Deep Speech Denoising Models Robust to Adversarial Noise?
Will Schwarzer
Philip S. Thomas
Andrea Fanelli
Xiaoyu Liu
75
0
0
14 Mar 2025
Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data
Lilin Zhang
Chengpei Wu
Ning Yang
101
0
0
14 Mar 2025
Provenance Detection for AI-Generated Images: Combining Perceptual Hashing, Homomorphic Encryption, and AI Detection Models
Shree Singhi
Aayan Yadav
Aayush Gupta
Shariar Ebrahimi
Parisa Hassanizadeh
88
1
0
14 Mar 2025
Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Yingjie Zhang
Tong Liu
Zhe Zhao
Guozhu Meng
Kai Chen
AAML
105
1
0
14 Mar 2025
Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix
Junbiao Pang
Tianyang Cai
132
1
0
14 Mar 2025
Robustness Tokens: Towards Adversarial Robustness of Transformers
Brian Pulfer
Yury Belousov
S. Voloshynovskiy
AAML
85
0
0
13 Mar 2025
Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction
Xiaobo Xia
Xiaofeng Liu
Jiale Liu
K. Fang
Lu Lu
Samet Oymak
William S. Currie
Tongliang Liu
130
0
0
13 Mar 2025
Enhancing Facial Privacy Protection via Weakening Diffusion Purification
Ali Salar
Qing Liu
Yingli Tian
Guoying Zhao
DiffM
92
0
0
13 Mar 2025
Attacking Multimodal OS Agents with Malicious Image Patches
Lukas Aichberger
Alasdair Paren
Y. Gal
Philip Torr
Adel Bibi
AAML
121
5
0
13 Mar 2025
Learning Interpretable Logic Rules from Deep Vision Models
Chuqin Geng
Yuhe Jiang
Ziyu Zhao
Haolin Ye
Zhaoyue Wang
X. Si
NAI
FAtt
VLM
112
1
0
13 Mar 2025
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption
Joonsung Jeon
Woo Jae Kim
Suhyeon Ha
Sooel Son
Sung-eui Yoon
DiffM
AAML
144
2
0
13 Mar 2025
A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1
Zhaoyi Li
Xiaohan Zhao
Dong-Dong Wu
Jiacheng Cui
Zhiqiang Shen
AAML
VLM
144
3
0
13 Mar 2025
Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain
Yuanmin Huang
Mi Zhang
Zhaoxiang Wang
Wenxuan Li
Min Yang
AAML
AI4TS
100
1
0
12 Mar 2025
Enhancing Adversarial Example Detection Through Model Explanation
Qian Ma
Ziping Ye
AAML
100
0
0
12 Mar 2025
AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks
Jin Li
Ziqiang He
Anwei Luo
Jian-Fang Hu
Zhong Wang
Xiangui Kang
DiffM
123
0
0
12 Mar 2025
All Your Knowledge Belongs to Us: Stealing Knowledge Graphs via Reasoning APIs
Zhaohan Xi
93
0
0
12 Mar 2025
Seal Your Backdoor with Variational Defense
Ivan Sabolić
Matej Grcić
Sinisa Segvic
AAML
460
0
0
11 Mar 2025
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Junying Wang
Hongyuan Zhang
Yuan Yuan
AAML
PICV
137
2
0
11 Mar 2025
Efficient and Accurate Estimation of Lipschitz Constants for Hybrid Quantum-Classical Decision Models
Sajjad Hashemian
Mohammad Saeed Arvenaghi
92
0
0
11 Mar 2025
Tangentially Aligned Integrated Gradients for User-Friendly Explanations
Lachlan Simpson
Federico Costanza
Kyle Millar
A. Cheng
Cheng-Chew Lim
Hong-Gunn Chew
FAtt
146
2
0
11 Mar 2025
FairDeFace: Evaluating the Fairness and Adversarial Robustness of Face Obfuscation Methods
Seyyed Mohammad Sadegh Moosavi Khorzooghi
Poojitha Thota
Mohit Singhal
Abolfazl Asudeh
Gautam Das
Shirin Nilizadeh
AAML
73
0
0
11 Mar 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models
Khoat Than
Dat Phan
BDL
AAML
VLM
102
0
0
10 Mar 2025
Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs
Wenzhuo Xu
Zhipeng Wei
Xiongtao Sun
Deyue Zhang
Dongdong Yang
Quanchen Zou
Xinming Zhang
AAML
90
0
0
10 Mar 2025
Previous
1
2
3
...
5
6
7
...
131
132
133
Next