Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.06083
Cited By
v1
v2
v3
v4 (latest)
Towards Deep Learning Models Resistant to Adversarial Attacks
19 June 2017
Aleksander Madry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
SILM
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (752★)
Papers citing
"Towards Deep Learning Models Resistant to Adversarial Attacks"
50 / 6,613 papers shown
Title
MEAT: Median-Ensemble Adversarial Training for Improving Robustness and Generalization
Zhaozhe Hu
Jia-Li Yin
Bin Chen
Luojun Lin
Bo-Hao Chen
Ximeng Liu
AAML
132
0
0
20 Jun 2024
Enhancing robustness of data-driven SHM models: adversarial training with circle loss
Xiangli Yang
Xijie Deng
Hanwei Zhang
Yang Zou
Jianxi Yang
AAML
62
0
0
20 Jun 2024
Exploring Layerwise Adversarial Robustness Through the Lens of t-SNE
Inês Valentim
Nuno Antunes
Nuno Lourenço
AAML
71
1
0
20 Jun 2024
Elliptical Attention
Stefan K. Nielsen
Laziz U. Abdullaev
R. Teo
Tan M. Nguyen
87
4
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
91
4
0
19 Jun 2024
Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators
Matéo Mahaut
Laura Aina
Paula Czarnowska
Momchil Hardalov
Thomas Müller
Lluís Marquez
HILM
99
24
0
19 Jun 2024
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Xikang Yang
Xuehai Tang
Fuqing Zhu
Jizhong Han
Songlin Hu
VLM
AAML
79
1
0
19 Jun 2024
Large-Scale Dataset Pruning in Adversarial Training through Data Importance Extrapolation
Bjorn Nieth
Thomas Altstidl
Leo Schwinn
Björn Eskofier
AAML
109
3
0
19 Jun 2024
Towards Trustworthy Unsupervised Domain Adaptation: A Representation Learning Perspective for Enhancing Robustness, Discrimination, and Generalization
Jia-Li Yin
Haoyuan Zheng
Ximeng Liu
AAML
70
0
0
19 Jun 2024
MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification
Harrison Gietz
Jugal Kalita
AAML
63
1
0
18 Jun 2024
Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness
Kejia Zhang
Juanjuan Weng
Junwei Wu
Guoqing Yang
Shaozi Li
Shaozi Li
AAML
100
1
0
17 Jun 2024
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness
Maayan Ehrenberg
Roy Ganz
Nir Rosenfeld
AAML
129
1
0
17 Jun 2024
Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection
Sungwon Park
Sungwon Han
Xing Xie
Jae-Gil Lee
Meeyoung Cha
158
1
0
17 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Di Lin
Dacheng Tao
Liangpei Zhang
164
27
0
17 Jun 2024
Improving Adversarial Robustness via Decoupled Visual Representation Masking
Decheng Liu
Tao Chen
Chunlei Peng
Nannan Wang
Ruimin Hu
Xinbo Gao
AAML
73
1
0
16 Jun 2024
Imperceptible Face Forgery Attack via Adversarial Semantic Mask
Decheng Liu
Qixuan Su
Chunlei Peng
Nannan Wang
Xinbo Gao
AAML
88
1
0
16 Jun 2024
IG2: Integrated Gradient on Iterative Gradient Path for Feature Attribution
Yue Zhuo
Zhiqiang Ge
62
9
0
16 Jun 2024
NBA: defensive distillation for backdoor removal via neural behavior alignment
Zonghao Ying
Bin Wu
AAML
55
10
0
16 Jun 2024
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models
Carson E. Denison
M. MacDiarmid
Fazl Barez
David Duvenaud
Shauna Kravec
...
Jared Kaplan
Buck Shlegeris
Samuel R. Bowman
Ethan Perez
Evan Hubinger
132
44
0
14 Jun 2024
Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis
Zhang Chen
Christian Scano
Srishti Gupta
Xiaoyi Feng
Zhaoqiang Xia
...
Maura Pintor
Luca Oneto
Ambra Demontis
Battista Biggio
Fabio Roli
AAML
92
2
0
14 Jun 2024
PID: Prompt-Independent Data Protection Against Latent Diffusion Models
Ang Li
Yichuan Mo
Mingjie Li
Yisen Wang
AAML
79
2
0
14 Jun 2024
Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models
Changjiang Li
Ren Pang
Bochuan Cao
Jinghui Chen
Fenglong Ma
Shouling Ji
Ting Wang
DiffM
73
4
0
14 Jun 2024
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs
Zhao Xu
Fan Liu
Hao Liu
AAML
126
16
0
13 Jun 2024
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Samar Fares
Klea Ziu
Toluwani Aremu
Nikita Durasov
Martin Takáč
Pascal Fua
Karthik Nandakumar
Ivan Laptev
VLM
AAML
99
5
0
13 Jun 2024
Improving Adversarial Robustness via Feature Pattern Consistency Constraint
Jiacong Hu
Jingwen Ye
Zunlei Feng
Jiazhen Yang
Shunyu Liu
Xiaotian Yu
Lingxiang Jia
Mingli Song
AAML
88
2
0
13 Jun 2024
On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
H. Malik
Numan Saeed
Asif Hanif
Muzammal Naseer
Mohammad Yaqub
Salman Khan
Fahad Shahbaz Khan
110
1
0
12 Jun 2024
WMAdapter: Adding WaterMark Control to Latent Diffusion Models
Hai Ci
Yiren Song
Pei Yang
Jinheng Xie
Mike Zheng Shou
WIGM
68
15
0
12 Jun 2024
Genetic Column Generation for Computing Lower Bounds for Adversarial Classification
Maximilian Penka
57
0
0
12 Jun 2024
AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer
Yitao Xu
Tong Zhang
Sabine Süsstrunk
ViT
86
1
0
12 Jun 2024
Decoupling the Class Label and the Target Concept in Machine Unlearning
Jianing Zhu
Bo Han
Jiangchao Yao
Jianliang Xu
Gang Niu
Masashi Sugiyama
CLL
MU
62
4
0
12 Jun 2024
Adversarial Patch for 3D Local Feature Extractor
Yu Wen Pao
Li Chang Lai
Hong-Yi Lin
AAML
33
0
0
12 Jun 2024
I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors
Zijin Lin
Yue Zhao
Kai Chen
Jinwen He
AAML
65
1
0
12 Jun 2024
Understanding Visual Concepts Across Models
Brandon Trabucco
Max Gurinas
Kyle Doherty
Ruslan Salakhutdinov
VLM
70
0
0
11 Jun 2024
Merging Improves Self-Critique Against Jailbreak Attacks
Victor Gallego
AAML
MoMe
93
4
0
11 Jun 2024
Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models
Som Sagar
Aditya Taparia
Ransalu Senanayake
91
10
0
11 Jun 2024
Fast White-Box Adversarial Streaming Without a Random Oracle
Ying Feng
Aayush Jain
David P. Woodruff
AAML
82
1
0
10 Jun 2024
Texture Re-scalable Universal Adversarial Perturbation
Yihao Huang
Qing Guo
Felix Juefei-Xu
Ming Hu
Xiaojun Jia
Xiaochun Cao
Geguang Pu
Yang Liu
AAML
82
8
0
10 Jun 2024
MeanSparse: Post-Training Robustness Enhancement Through Mean-Centered Feature Sparsification
Sajjad Amini
Mohammadreza Teymoorianfard
Shiqing Ma
Amir Houmansadr
OOD
AAML
96
10
0
09 Jun 2024
Stealthy Targeted Backdoor Attacks against Image Captioning
Wenshu Fan
Hongwei Li
Wenbo Jiang
Meng Hao
Shui Yu
Xiao Zhang
DiffM
68
6
0
09 Jun 2024
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Zhiyuan Cheng
Cheng Han
James Liang
Qifan Wang
Xiangyu Zhang
Dongfang Liu
AAML
79
5
0
09 Jun 2024
ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations
Sravanti Addepalli
Priyam Dey
R. Venkatesh Babu
96
0
0
09 Jun 2024
Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability
Junqi Gao
Biqing Qi
Yao Li
Zhichang Guo
Dong Li
Yuming Xing
Dazhi Zhang
AAML
75
7
0
08 Jun 2024
Exploring Adversarial Robustness of Deep State Space Models
Biqing Qi
Yang Luo
Junqi Gao
Pengfei Li
Kai Tian
Zhiyuan Ma
Bowen Zhou
AAML
65
1
0
08 Jun 2024
Enhancing Adversarial Transferability via Information Bottleneck Constraints
Biqing Qi
Junqi Gao
Jianxing Liu
Ligang Wu
Bowen Zhou
AAML
71
2
0
08 Jun 2024
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
Hao Fang
Jiawei Kong
Wenbo Yu
Bin Chen
Jiawei Li
Hao Wu
Ke Xu
Ke Xu
AAML
VLM
133
14
0
08 Jun 2024
Compositional Curvature Bounds for Deep Neural Networks
Taha Entesari
Sina Sharifi
Mahyar Fazlyab
AAML
78
1
0
07 Jun 2024
Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
Fan Liu
Zhao Xu
Hao Liu
AAML
130
13
0
07 Jun 2024
The Price of Implicit Bias in Adversarially Robust Generalization
Nikolaos Tsilivis
Natalie Frank
Nathan Srebro
Julia Kempe
111
4
0
07 Jun 2024
HateDebias: On the Diversity and Variability of Hate Speech Debiasing
Nankai Lin
Hongyan Wu
Zhengming Chen
Zijian Li
Lianxi Wang
Shengyi Jiang
Dong Zhou
Aimin Yang
95
0
0
07 Jun 2024
A Survey of Fragile Model Watermarking
Zhenzhe Gao
Yu Cheng
Zhaoxia Yin
AAML
64
0
0
07 Jun 2024
Previous
1
2
3
...
18
19
20
...
131
132
133
Next