Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.12654
Cited By
v1
v2 (latest)
BackdoorBench: A Comprehensive Benchmark of Backdoor Learning
25 June 2022
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Chaoxiao Shen
ELM
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BackdoorBench: A Comprehensive Benchmark of Backdoor Learning"
50 / 91 papers shown
Title
Circumventing Backdoor Space via Weight Symmetry
Jie Peng
Hongwei Yang
Jing Zhao
Hengji Dong
Hui He
Weizhe Zhang
Haoyu He
AAML
17
0
0
09 Jun 2025
Variance-Based Defense Against Blended Backdoor Attacks
Sujeevan Aseervatham
Achraf Kerzazi
Younès Bennani
AAML
65
0
0
02 Jun 2025
Trojan Horse Hunt in Time Series Forecasting for Space Operations
Krzysztof Kotowski
Ramez Shendy
J. Nalepa
P. Biecek
Piotr Wilczyñski
Agata Kaczmarek
Dawid Płudowski
Artur Janicki
Evridiki Vasileia Ntagiou
72
0
0
02 Jun 2025
Wolf Hidden in Sheep's Conversations: Toward Harmless Data-Based Backdoor Attacks for Jailbreaking Large Language Models
Jiawei Kong
Hao Fang
Xiaochen Yang
Kuofeng Gao
Bin Chen
Shu-Tao Xia
Yaowei Wang
Min Zhang
AAML
74
0
0
23 May 2025
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
Shuaiwei Yuan
Junyu Dong
Yuezun Li
AAML
117
0
0
13 May 2025
MergeGuard: Efficient Thwarting of Trojan Attacks in Machine Learning Models
Soheil Zibakhsh Shabgahi
Yaman Jandali
F. Koushanfar
MoMe
AAML
106
0
0
06 May 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
97
0
0
24 Apr 2025
Prototype Guided Backdoor Defense
Venkat Adithya Amula
Sunayana Samavedam
Saurabh Saini
Avani Gupta
Narayanan P J
AAML
78
0
0
26 Mar 2025
BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models
Zenghui Yuan
Jiawen Shi
Pan Zhou
Neil Zhenqiang Gong
Lichao Sun
AAML
163
3
0
20 Mar 2025
Class-Conditional Neural Polarizer: A Lightweight and Effective Backdoor Defense by Purifying Poisoned Features
Mingli Zhu
Shaokui Wei
Hongyuan Zha
Baoyuan Wu
AAML
126
0
0
23 Feb 2025
BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model
Weilin Lin
Nanjun Zhou
Yijiao Wang
Jianze Li
Hui Xiong
Li Liu
AAML
DiffM
446
1
0
17 Feb 2025
Scanning Trojaned Models Using Out-of-Distribution Samples
Hossein Mirzaei
Ali Ansari
Bahar Dibaei Nia
Mojtaba Nafez
Moein Madadi
...
Kian Shamsaie
Mahdi Hajialilue
Jafar Habibi
Mohammad Sabokrou
M. Rohban
OODD
147
3
0
28 Jan 2025
Cut the Deadwood Out: Post-Training Model Purification with Selective Module Substitution
Yao Tong
Weijun Li
Xuanli He
Haolan Zhan
Xingliang Yuan
AAML
92
1
0
31 Dec 2024
Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Zongru Wu
Pengzhou Cheng
Lingyong Fang
Zhuosheng Zhang
Gongshen Liu
AAML
SILM
131
1
0
03 Dec 2024
Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
Ruotong Wang
Mingli Zhu
Zihao Zhu
Baoyuan Wu
AAML
156
2
0
18 Nov 2024
BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation
Haiyang Yu
Tian Xie
Jiaping Gui
Pengyang Wang
P. Yi
Yue Wu
134
1
0
17 Nov 2024
Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models
Yige Li
Hanxun Huang
Jiaming Zhang
Xingjun Ma
Yu-Gang Jiang
AAML
66
2
0
25 Oct 2024
Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace
Jinluan Yang
Anke Tang
Didi Zhu
Zhengyu Chen
Li Shen
Leilei Gan
MoMe
AAML
166
7
0
17 Oct 2024
Long-Tailed Backdoor Attack Using Dynamic Data Augmentation Operations
Lu Pang
Tao Sun
Weimin Lyu
Haibin Ling
Chong Chen
AAML
68
0
0
16 Oct 2024
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
Rui Min
Zeyu Qin
Nevin L. Zhang
Li Shen
Minhao Cheng
AAML
91
4
0
13 Oct 2024
PAD-FT: A Lightweight Defense for Backdoor Attacks via Data Purification and Fine-Tuning
Yukai Xu
Yujie Gu
Kouichi Sakurai
AAML
46
0
0
18 Sep 2024
TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors
Yichuan Mo
Hui Huang
Mingjie Li
Ang Li
Yisen Wang
AAML
DiffM
87
16
0
09 Sep 2024
Understanding Data Importance in Machine Learning Attacks: Does Valuable Data Pose Greater Harm?
Rui Wen
Michael Backes
Yang Zhang
TDI
AAML
85
2
0
05 Sep 2024
Fisher Information guided Purification against Backdoor Attacks
Nazmul Karim
Abdullah Al Arafat
Adnan Siraj Rakin
Zhishan Guo
Nazanin Rahnavard
AAML
117
2
0
01 Sep 2024
Rethinking Backdoor Detection Evaluation for Language Models
Jun Yan
Wenjie Jacky Mo
Xiang Ren
Robin Jia
ELM
106
3
0
31 Aug 2024
Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation
Weilin Lin
Li Liu
Jianze Li
Hui Xiong
AAML
109
1
0
28 Aug 2024
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
Yige Li
Hanxun Huang
Yunhan Zhao
Xingjun Ma
Jun Sun
AAML
SILM
113
1
0
23 Aug 2024
PADetBench: Towards Benchmarking Physical Attacks against Object Detection
Jiawei Lian
Jianhong Pan
L. Wang
Yi Wang
Lap-Pui Chau
Shaohui Mei
AAML
106
0
0
17 Aug 2024
Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
Xiaoting Lyu
Yufei Han
Wei Wang
Jingkai Liu
Yongsheng Zhu
Guangquan Xu
Jiqiang Liu
Xiangliang Zhang
AAML
FedML
105
7
0
10 Jun 2024
Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness
Weilin Lin
Li Liu
Shaokui Wei
Jianze Li
Hui Xiong
AAML
97
2
0
30 May 2024
Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack
Mingli Zhu
Siyuan Liang
Baoyuan Wu
AAML
122
18
0
25 May 2024
Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor
Shaokui Wei
Hongyuan Zha
Baoyuan Wu
AAML
99
3
0
25 May 2024
Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning
Nay Myat Min
Long H. Pham
Jun Sun
MU
AAML
122
0
0
23 May 2024
Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks
Boheng Li
Yishuo Cai
Haowei Li
Feng Xue
Zhifeng Li
Yiming Li
MQ
AAML
89
21
0
21 May 2024
Towards Robust Physical-world Backdoor Attacks on Lane Detection
Xinwei Zhang
Aishan Liu
Tianyuan Zhang
Siyuan Liang
Xianglong Liu
AAML
125
13
0
09 May 2024
Unlearning Backdoor Attacks through Gradient-Based Model Pruning
Kealan Dunnett
Reza Arablouei
Dimity Miller
Volkan Dedeoglu
Raja Jurdak
AAML
80
1
0
07 May 2024
Mechanistic Interpretability for AI Safety -- A Review
Leonard Bereska
E. Gavves
AI4CE
139
158
0
22 Apr 2024
Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion
Markus Frey
Sichu Liang
Wentao Hu
Matthias Nau
Ju Jia
Shilin Wang
AAML
89
4
0
21 Apr 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
Stephen Casper
Lennart Schulze
Oam Patel
Dylan Hadfield-Menell
AAML
124
40
0
08 Mar 2024
A general approach to enhance the survivability of backdoor attacks by decision path coupling
Yufei Zhao
Dingji Wang
Bihuan Chen
Ziqian Chen
Xin Peng
AAML
76
0
0
05 Mar 2024
Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge
Ansh Arora
Xuanli He
Maximilian Mozes
Srinibas Swain
Mark Dras
Xingliang Yuan
SILM
MoMe
AAML
121
14
0
29 Feb 2024
Model X-ray:Detect Backdoored Models via Decision Boundary
Yanghao Su
Jie Zhang
Ting Xu
Tianwei Zhang
Weiming Zhang
Neng H. Yu
AAML
161
1
0
27 Feb 2024
On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem
Georg Pichler
Marco Romanelli
Divya Prakash Manivannan
Prashanth Krishnamurthy
Farshad Khorrami
Siddharth Garg
62
3
0
26 Feb 2024
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
Jiawei Liang
Siyuan Liang
Man Luo
Aishan Liu
Dongchen Han
Ee-Chien Chang
Xiaochun Cao
105
47
0
21 Feb 2024
Testing autonomous vehicles and AI: perspectives and challenges from cybersecurity, transparency, robustness and fairness
David Fernández Llorca
Ronan Hamon
Henrik Junklewitz
Kathrin Grosse
Lars Kunze
...
Nick Reed
Alexandre Alahi
Emilia Gómez
Ignacio E. Sánchez
Á. Kriston
113
5
0
21 Feb 2024
Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Zongru Wu
Zhuosheng Zhang
Pengzhou Cheng
Gongshen Liu
AAML
125
6
0
19 Feb 2024
Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection
Jiawei Liang
Siyuan Liang
Aishan Liu
Xiaojun Jia
Junhao Kuang
Xiaochun Cao
AAML
65
24
0
18 Feb 2024
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning
Baoyuan Wu
Hongrui Chen
Ruotong Wang
Zihao Zhu
Shaokui Wei
Danni Yuan
Mingli Zhu
Ke Xu
Li Liu
Chaoxiao Shen
AAML
ELM
131
11
0
26 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
154
95
0
25 Jan 2024
WPDA: Frequency-based Backdoor Attack with Wavelet Packet Decomposition
Zhengyao Song
Yongqiang Li
Danni Yuan
Li Liu
Shaokui Wei
Baoyuan Wu
AAML
92
4
0
24 Jan 2024
1
2
Next