Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.06083
Cited By
v1
v2
v3
v4 (latest)
Towards Deep Learning Models Resistant to Adversarial Attacks
19 June 2017
Aleksander Madry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
SILM
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (752★)
Papers citing
"Towards Deep Learning Models Resistant to Adversarial Attacks"
50 / 6,612 papers shown
Title
Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Huanran Chen
Yinpeng Dong
Zeming Wei
Yao Huang
Yichi Zhang
Hang Su
Jun Zhu
MoMe
92
1
0
23 May 2025
Towards more transferable adversarial attack in black-box manner
Chun Tong Lei
Zhongliang Guo
Hon Chung Lee
Minh Quoc Duong
Chun Pong Lau
DiffM
AAML
513
0
0
23 May 2025
Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition
Ping Li
Jianan Ni
Bo Pang
AAML
250
0
0
23 May 2025
Adversarial Robustness of Nonparametric Regression
Parsa Moradi
Hanzaleh Akabrinodehi
M. Maddah-ali
AAML
76
0
0
23 May 2025
MixAT: Combining Continuous and Discrete Adversarial Training for LLMs
Csaba Dékány
Stefan Balauca
Robin Staab
Dimitar I. Dimitrov
Martin Vechev
AAML
55
0
0
22 May 2025
When Safety Detectors Aren't Enough: A Stealthy and Effective Jailbreak Attack on LLMs via Steganographic Techniques
Jianing Geng
Biao Yi
Zekun Fei
Tongxi Wu
Lihai Nie
Zheli Liu
AAML
40
0
0
22 May 2025
TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Yuhao Xue
Zhifei Zhang
Xinyang Jiang
Yifei Shen
Junyao Gao
Wentao Gu
Jiale Zhao
Miaojing Shi
Cairong Zhao
AAML
67
0
0
22 May 2025
SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models
Hossein Khalili
Seongbin Park
Venkat Bollapragada
Nader Sehatbakhsh
AAML
220
0
0
22 May 2025
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
Punya Syon Pandey
Samuel Simko
Kellin Pelrine
Zhijing Jin
AAML
52
0
0
22 May 2025
GAMA++: Disentangled Geometric Alignment with Adaptive Contrastive Perturbation for Reliable Domain Transfer
Kim Yun
Hana Satou
F Monkey
68
0
0
21 May 2025
My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping
Hon Ming Yam
Zhongliang Guo
Chun Pong Lau
DiffM
AAML
60
0
0
21 May 2025
Enhancing Certified Robustness via Block Reflector Orthogonal Layers and Logit Annealing Loss
Bo-Han Lai
Pin-Han Huang
Bo-Han Kung
Shang-Tse Chen
70
0
0
21 May 2025
GAMA: Geometry-Aware Manifold Alignment via Structured Adversarial Perturbations for Robust Domain Adaptation
Hana Satou
F Monkey
70
0
0
21 May 2025
Geometrically Regularized Transfer Learning with On-Manifold and Off-Manifold Perturbation
Hana Satou
Alan Mitkiy
F Monkey
AAML
59
0
0
21 May 2025
Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off
Yury Belousov
Brian Pulfer
Vitaliy Kinakh
Slava Voloshynovskiy
DiffM
51
0
0
21 May 2025
Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models
Sajjad Ghiasvand
Haniyeh Ehsani Oskouie
Mahnoosh Alizadeh
Ramtin Pedarsani
AAML
VLM
62
0
0
21 May 2025
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models
Guangke Chen
Fu Song
Zhe Zhao
Xiaojun Jia
Yang Liu
Yanchen Qiao
Weizhe Zhang
AuLLM
AAML
113
1
0
20 May 2025
SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment
Wonje Jeung
Sangyeon Yoon
Minsuk Kahng
Albert No
LRM
LLMSV
198
1
0
20 May 2025
Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving
Jingzheng Li
Tiancheng Wang
Xingyu Peng
Jiasi Chen
Zhijun Chen
Bing Li
Xianglong Liu
ELM
78
0
0
20 May 2025
Adversarially Pretrained Transformers may be Universally Robust In-Context Learners
Soichiro Kumano
Hiroshi Kera
Toshihiko Yamasaki
AAML
127
0
0
20 May 2025
Symmetry-Breaking Descent for Invariant Cost Functionals
Mikhail Osipov
64
0
0
19 May 2025
Spiking Neural Network: a low power solution for physical layer authentication
Jung Hoon Lee
Sujith Vijayan
68
0
0
19 May 2025
Causality-Inspired Robustness for Nonlinear Models via Representation Learning
Marin Šola
Peter Bühlmann
Xinwei Shen
OOD
91
0
0
19 May 2025
Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning
Ajian Liu
Haocheng Yuan
Xiao Guo
Hui Ma
Wanyi Zhuang
...
Yanyan Liang
Weiqiang Wang
Jun Wan
Xiaoming Liu
Zhen Lei
AAML
CVBM
81
0
0
19 May 2025
Two out of Three (ToT): using self-consistency to make robust predictions
Jung Hoon Lee
Sujith Vijayan
OOD
64
0
0
19 May 2025
On the Mechanisms of Adversarial Data Augmentation for Robust and Adaptive Transfer Learning
Hana Satou
Alan Mitkiy
AAML
83
0
0
19 May 2025
Counter-Inferential Behavior in Natural and Artificial Cognitive Systems
Serge Dolgikh
67
0
0
19 May 2025
FlowPure: Continuous Normalizing Flows for Adversarial Purification
Elias Collaert
Abel Rodríguez
Sander Joos
Lieven Desmet
Vera Rimmer
AAML
67
0
0
19 May 2025
SPIRIT: Patching Speech Language Models against Jailbreak Attacks
Amirbek Djanibekov
Nurdaulet Mukhituly
Kentaro Inui
Hanan Aldarmaki
Nils Lukas
AAML
87
0
0
18 May 2025
Fixed Point Explainability
Emanuele La Malfa
Jon Vadillo
Marco Molinari
Michael Wooldridge
153
0
0
18 May 2025
Beyond Single-Point Judgment: Distribution Alignment for LLM-as-a-Judge
Luyu Chen
Zeyu Zhang
Haoran Tan
Quanyu Dai
Hao-ran Yang
Zhenhua Dong
Xu Chen
52
0
0
18 May 2025
Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation
Zhiying Li
Guanggang Geng
Yeying Jin
Zhizhi Guo
Bruce Gu
Jidong Huo
Zhaoxin Fan
Wenjun Wu
AAML
68
0
0
17 May 2025
EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents
Xilong Wang
John Bloch
Zedian Shao
Yuepeng Hu
Shuyan Zhou
Neil Zhenqiang Gong
AAML
LLMAG
108
0
0
16 May 2025
Anti-Sensing: Defense against Unauthorized Radar-based Human Vital Sign Sensing with Physically Realizable Wearable Oscillators
Md Farhan Tasnim Oshim
Nigel Doering
Bashima Islam
Tsui-Wei Weng
Tauhidur Rahman
43
0
0
16 May 2025
Adversarially Robust Spiking Neural Networks with Sparse Connectivity
Mathias Schmolli
Maximilian Baronig
Robert Legenstein
Ozan Özdenizci
AAML
45
0
0
16 May 2025
Adversarial Suffix Filtering: a Defense Pipeline for LLMs
David Khachaturov
Robert D. Mullins
AAML
69
0
0
14 May 2025
Evaluating the Robustness of Adversarial Defenses in Malware Detection Systems
Mostafa Jafari
Alireza Shameli-Sendi
AAML
51
0
0
14 May 2025
DArFace: Deformation Aware Robustness for Low Quality Face Recognition
Sadaf Gulshad
Abdullah Aldahlawi Thakaa
CVBM
105
0
0
13 May 2025
Visual Watermarking in the Era of Diffusion Models: Advances and Challenges
Junxian Duan
Jiyang Guan
Wenkui Yang
Ran He
WIGM
127
0
0
13 May 2025
Wasserstein Distributionally Robust Nonparametric Regression
Changyu Liu
Yuling Jiao
Junhui Wang
Jian Huang
OOD
69
0
0
12 May 2025
Convergence of Time-Averaged Mean Field Gradient Descent Dynamics for Continuous Multi-Player Zero-Sum Games
Yulong Lu
Pierre Monmarché
MLT
51
1
0
12 May 2025
A Formally Verified Robustness Certifier for Neural Networks (Extended Version)
James Tobler
Hira Taqdees Syeda
Toby Murray
AAML
58
0
0
11 May 2025
A stochastic gradient method for trilevel optimization
Tommaso Giovannelli
G. Kent
Luis Nunes Vicente
72
0
0
11 May 2025
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification
Dongyoon Yang
Jihu Lee
Yongdai Kim
99
0
0
10 May 2025
Dynamic Domain Information Modulation Algorithm for Multi-domain Sentiment Analysis
Chunyi Yue
Ang Li
67
0
0
10 May 2025
Engineering Risk-Aware, Security-by-Design Frameworks for Assurance of Large-Scale Autonomous AI Models
Krti Tallam
61
2
0
09 May 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
Hanxun Huang
Sarah Monazam Erfani
Yige Li
Xingjun Ma
James Bailey
AAML
155
1
0
08 May 2025
Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks
Yixin Cheng
Hongcheng Guo
Yangming Li
Leonid Sigal
AAML
WaLM
223
1
0
08 May 2025
MTL-UE: Learning to Learn Nothing for Multi-Task Learning
Yi Yu
Song Xia
Siyuan Yang
Chenqi Kong
Wenhan Yang
Shijian Lu
Yap-Peng Tan
Alex Chichung Kot
134
1
0
08 May 2025
Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain
Spyridon Raptis
Haralampos-G. Stratigopoulos
AAML
72
0
0
07 May 2025
Previous
1
2
3
4
5
6
...
131
132
133
Next