Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.00420
Cited By
v1
v2
v3
v4 (latest)
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples
1 February 2018
Anish Athalye
Nicholas Carlini
D. Wagner
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples"
50 / 1,929 papers shown
Title
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Nathaniel Li
Ziwen Han
Ian Steneker
Willow Primack
Riley Goodside
Hugh Zhang
Zifan Wang
Cristina Menghini
Summer Yue
AAML
MU
105
57
0
27 Aug 2024
Iterative Window Mean Filter: Thwarting Diffusion-based Adversarial Purification
Hanrui Wang
Ruoxi Sun
Cunjian Chen
Minhui Xue
Lay-Ki Soon
Shuo Wang
Zhe Jin
DiffM
AAML
92
2
0
20 Aug 2024
Robust Image Classification: Defensive Strategies against FGSM and PGD Adversarial Attacks
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAML
29
5
0
20 Aug 2024
Criticality Leveraged Adversarial Training (CLAT) for Boosted Performance via Parameter Efficiency
Bhavna Gopal
Huanrui Yang
Jingyang Zhang
Mark Horton
Yiran Chen
AAML
90
0
0
19 Aug 2024
Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information
Mingkun Zhang
Jianing Li
Wei Chen
Jiafeng Guo
Xueqi Cheng
96
6
0
12 Aug 2024
Adversarially Robust Industrial Anomaly Detection Through Diffusion Model
Yuanpu Cao
Lu Lin
Jinghui Chen
DiffM
78
1
0
09 Aug 2024
Label Augmentation for Neural Networks Robustness
Fatemeh Amerehi
Patrick Healy
AAML
90
1
0
04 Aug 2024
Deepfake Media Forensics: State of the Art and Challenges Ahead
Irene Amerini
Mauro Barni
Sebastiano Battiato
Paolo Bestagini
Giulia Boato
...
Davide Salvi
Stefano Tubaro
Claudia Melis Tonti
Massimo Villari
D. Vitulano
AAML
102
7
0
01 Aug 2024
OTAD: An Optimal Transport-Induced Robust Model for Agnostic Adversarial Attack
Kuo Gai
Sicong Wang
Shihua Zhang
AAML
83
0
0
01 Aug 2024
ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Xiao-Li Li
Wenxuan Sun
Huanran Chen
Qiongxiu Li
Yining Liu
Yingzhe He
Jie Shi
Xiaolin Hu
AAML
173
12
0
01 Aug 2024
Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks
Yunfeng Diao
Na Zhai
Changtao Miao
Xun Yang
Meng Wang
Xun Yang
Meng Wang
AAML
162
2
0
30 Jul 2024
RSC-SNN: Exploring the Trade-off Between Adversarial Robustness and Accuracy in Spiking Neural Networks via Randomized Smoothing Coding
Keming Wu
Man Yao
Yuhong Chou
Xuerui Qiu
Rui Yang
Boxing Xu
Guoqi Li
AAML
62
4
0
29 Jul 2024
Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi
Jongheon Jeong
Huiwon Jang
Jinwoo Shin
DiffM
111
2
0
26 Jul 2024
Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis
Cristian-Alexandru Botocan
Raphael Meier
Ljiljana Dolamic
AAML
61
0
0
25 Jul 2024
Representation Magnitude has a Liability to Privacy Vulnerability
Xingli Fang
Jung-Eun Kim
53
1
0
23 Jul 2024
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders
Senthooran Rajamanoharan
Tom Lieberum
Nicolas Sonnerat
Arthur Conmy
Vikrant Varma
János Kramár
Neel Nanda
85
105
0
19 Jul 2024
Relaxing Graph Transformers for Adversarial Attacks
Philipp Foth
Lukas Gosch
Simon Geisler
Leo Schwinn
Stephan Günnemann
AAML
154
1
0
16 Jul 2024
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao-Li Li
Yining Liu
Na Dong
Sitian Qin
Xiaolin Hu
85
4
0
15 Jul 2024
Mitigating Low-Frequency Bias: Feature Recalibration and Frequency Attention Regularization for Adversarial Robustness
Kejia Zhang
Juanjuan Weng
Yuanzheng Cai
Zhiming Luo
Shaozi Li
AAML
177
0
0
04 Jul 2024
L
p
L_p
L
p
-norm Distortion-Efficient Adversarial Attack
Chao Zhou
Yuan-Gen Wang
Zi-Jia Wang
Xiangui Kang
72
0
0
03 Jul 2024
MALT Powers Up Adversarial Attacks
Odelia Melamed
Gilad Yehudai
Adi Shamir
AAML
51
0
0
02 Jul 2024
Query-Efficient Hard-Label Black-Box Attack against Vision Transformers
Chao Zhou
Xiaowen Shi
Yuan-Gen Wang
ViT
AAML
79
0
0
29 Jun 2024
Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness
Erh-Chung Chen
Pin-Yu Chen
I-Hsin Chung
Che-Rung Lee
80
3
0
28 Jun 2024
DataFreeShield: Defending Adversarial Attacks without Training Data
Hyeyoon Lee
Kanghyun Choi
Dain Kwon
Sunjong Park
Mayoore S. Jaiswal
Noseong Park
Jonghyun Choi
Jinho Lee
78
0
0
21 Jun 2024
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors
Peter Lorenz
Mario Fernandez
Jens Müller
Ullrich Kothe
AAML
244
1
0
21 Jun 2024
Improving Adversarial Robustness via Decoupled Visual Representation Masking
Decheng Liu
Tao Chen
Chunlei Peng
Nannan Wang
Ruimin Hu
Xinbo Gao
AAML
73
1
0
16 Jun 2024
Adaptive Randomized Smoothing: Certifying Multi-Step Defences against Adversarial Examples
Saiyue Lyu
Shadab Shaikh
Frederick Shpilevskiy
Evan Shelhamer
Mathias Lécuyer
AAML
65
0
0
14 Jun 2024
Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis
Zhang Chen
Christian Scano
Srishti Gupta
Xiaoyi Feng
Zhaoqiang Xia
...
Maura Pintor
Luca Oneto
Ambra Demontis
Battista Biggio
Fabio Roli
AAML
87
2
0
14 Jun 2024
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Samar Fares
Klea Ziu
Toluwani Aremu
Nikita Durasov
Martin Takáč
Pascal Fua
Karthik Nandakumar
Ivan Laptev
VLM
AAML
99
5
0
13 Jun 2024
Improving Adversarial Robustness via Feature Pattern Consistency Constraint
Jiacong Hu
Jingwen Ye
Zunlei Feng
Jiazhen Yang
Shunyu Liu
Xiaotian Yu
Lingxiang Jia
Mingli Song
AAML
84
2
0
13 Jun 2024
AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer
Yitao Xu
Tong Zhang
Sabine Süsstrunk
ViT
86
1
0
12 Jun 2024
ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations
Sravanti Addepalli
Priyam Dey
R. Venkatesh Babu
91
0
0
09 Jun 2024
DMS: Addressing Information Loss with More Steps for Pragmatic Adversarial Attacks
Zhiyu Zhu
Jiayu Zhang
Xinyi Wang
Zhibo Jin
Huaming Chen
AAML
56
1
0
09 Jun 2024
The Price of Implicit Bias in Adversarially Robust Generalization
Nikolaos Tsilivis
Natalie Frank
Nathan Srebro
Julia Kempe
106
4
0
07 Jun 2024
CTBENCH: A Library and Benchmark for Certified Training
Yuhao Mao
Stefan Balauca
Martin Vechev
OOD
133
5
0
07 Jun 2024
ZeroPur: Succinct Training-Free Adversarial Purification
Xiuli Bi
Zonglin Yang
Bo Liu
Xiaodong Cun
Chi-Man Pun
121
0
0
05 Jun 2024
Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized Smoothing
Youwei Shu
Xi Xiao
Derui Wang
Yuxin Cao
Siji Chen
Jason Xue
Linyi Li
Yue Liu
80
2
0
04 Jun 2024
Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
G. Farnadi
Mohammad Havaei
Negar Rostamzadeh
82
3
0
03 Jun 2024
Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation
Yuan Xiao
Shiqing Ma
Juan Zhai
Chunrong Fang
Jinyuan Jia
Zhenyu Chen
AAML
82
1
0
02 Jun 2024
Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Jiacheng Zhang
Feng Liu
Dawei Zhou
Jingfeng Zhang
Tongliang Liu
AAML
63
4
0
02 Jun 2024
Certifying Global Robustness for Deep Neural Networks
You Li
Guannan Zhao
Shuyu Kong
Yunqi He
Hai Zhou
AAML
56
0
0
31 May 2024
AI Risk Management Should Incorporate Both Safety and Security
Xiangyu Qi
Yangsibo Huang
Yi Zeng
Edoardo Debenedetti
Jonas Geiping
...
Chaowei Xiao
Yue Liu
Dawn Song
Peter Henderson
Prateek Mittal
AAML
117
12
0
29 May 2024
Spectral regularization for adversarially-robust representation learning
Sheng Yang
Jacob A. Zavatone-Veth
Cengiz Pehlevan
AAML
OOD
110
0
0
27 May 2024
Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency
Runqi Lin
Chaojian Yu
Bo Han
Hang Su
Tongliang Liu
AAML
128
4
0
25 May 2024
Robust width: A lightweight and certifiable adversarial defense
Jonathan Peck
Bart Goossens
AAML
76
2
0
24 May 2024
Can Implicit Bias Imply Adversarial Robustness?
Hancheng Min
Rene Vidal
90
3
0
24 May 2024
Certifiably Robust RAG against Retrieval Corruption
Chong Xiang
Tong Wu
Zexuan Zhong
David Wagner
Danqi Chen
Prateek Mittal
SILM
99
58
0
24 May 2024
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
Yimeng Zhang
Xin Chen
Jinghan Jia
Yihua Zhang
Chongyu Fan
Jiancheng Liu
Mingyi Hong
Ke Ding
Sijia Liu
DiffM
119
68
0
24 May 2024
Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds
Hanwei Zhang
Luo Cheng
Qisong He
Wei Huang
Renjue Li
R. Sicre
Xiaowei Huang
Holger Hermanns
Lijun Zhang
AAML
64
1
0
23 May 2024
Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space
Qian Liu
Yufei Kuang
Jie Wang
AAML
42
2
0
20 May 2024
Previous
1
2
3
4
5
6
...
37
38
39
Next