Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.13682
Cited By
Threats, Attacks, and Defenses in Machine Unlearning: A Survey
20 March 2024
Ziyao Liu
Huanyi Ye
Chen Chen
Yongsen Zheng
K. Lam
AAML
MU
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Threats, Attacks, and Defenses in Machine Unlearning: A Survey"
50 / 117 papers shown
Title
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELM
MU
63
0
0
28 May 2025
Leveraging Per-Instance Privacy for Machine Unlearning
N. Sepahvand
Anvith Thudi
Berivan Isik
Ashmita Bhattacharyya
Nicolas Papernot
Eleni Triantafillou
Daniel M. Roy
Gintare Karolina Dziugaite
MU
FedML
19
0
0
24 May 2025
LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures
Francisco Aguilera-Martínez
Fernando Berzal
PILM
88
0
0
02 May 2025
Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions
Yiming Du
Wenyu Huang
Danna Zheng
Zhaowei Wang
Sébastien Montella
Mirella Lapata
Kam-Fai Wong
Jeff Z. Pan
KELM
MU
154
3
0
01 May 2025
A Survey on Unlearnable Data
Jiahao Li
Yiqiang Chen
Yunbing Xing
Yang Gu
Xiangyuan Lan
AAML
75
0
0
30 Mar 2025
How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks
M. Prabhakaran
S. Nicolazzo
Antonino Nocera
Vinod Puthuvath
AAML
MU
108
0
0
26 Mar 2025
NoT: Federated Unlearning via Weight Negation
Yasser H. Khalil
Leo Maxime Brunswic
Soufiane Lamghari
Xu Li
Mahdi Beitollahi
Xi Chen
MU
103
2
0
07 Mar 2025
Go Beyond Your Means: Unlearning with Per-Sample Gradient Orthogonalization
Aviv Shamsian
E. Shaar
Aviv Navon
Gal Chechik
Ethan Fetaya
MU
110
0
0
04 Mar 2025
Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities
Zora Che
Stephen Casper
Robert Kirk
Anirudh Satheesh
Stewart Slocum
...
Zikui Cai
Bilal Chughtai
Y. Gal
Furong Huang
Dylan Hadfield-Menell
MU
AAML
ELM
113
3
0
03 Feb 2025
FedUHB: Accelerating Federated Unlearning via Polyak Heavy Ball Method
Yu Jiang
Chee Wei Tan
K. Lam
FedML
MU
66
1
0
17 Nov 2024
Survey of Security and Data Attacks on Machine Unlearning In Financial and E-Commerce
Carl E. J. Brodzinski
AAML
66
1
0
29 Sep 2024
Machine Unlearning in Generative AI: A Survey
Zheyuan Liu
Guangyao Dou
Zhaoxuan Tan
Yijun Tian
Meng Jiang
MU
77
16
0
30 Jul 2024
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Meng Wang
Yunzhi Yao
Ziwen Xu
Shuofei Qiao
Shumin Deng
...
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
Ningyu Zhang
81
31
0
22 Jul 2024
Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning
Binhao Ma
Tianhang Zheng
Hongsheng Hu
Di Wang
Shuo Wang
Zhongjie Ba
Zhan Qin
Kui Ren
AAML
65
3
0
06 Jul 2024
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
Zhuoran Jin
Pengfei Cao
Chenhao Wang
Zhitao He
Hongbang Yuan
Jiachun Li
Yubo Chen
Kang Liu
Jun Zhao
KELM
MU
88
16
0
16 Jun 2024
A Survey on Machine Unlearning: Techniques and New Emerged Privacy Risks
Hengzhu Liu
Ping Xiong
Tianqing Zhu
Philip S. Yu
70
7
0
10 Jun 2024
Language Models Resist Alignment
Yalan Qin
Kaile Wang
Tianyi Qiu
Boyuan Chen
Jiayi Zhou
Changye Li
Hantao Lou
Yaodong Yang
62
1
0
10 Jun 2024
Guaranteeing Data Privacy in Federated Unlearning with Dynamic User Participation
Ziyao Liu
Yu Jiang
Weifeng Jiang
Jiale Guo
Jun Zhao
Kwok-Yan Lam
MU
FedML
86
6
0
03 Jun 2024
Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable
Martín Bertrán
Shuai Tang
Michael Kearns
Jamie Morgenstern
Aaron Roth
Zhiwei Steven Wu
AAML
51
7
0
30 May 2024
Exploring Fairness in Educational Data Mining in the Context of the Right to be Forgotten
Wei Qian
Aobo Chen
Chenxu Zhao
Yangyi Li
Mengdi Huai
MU
75
1
0
27 May 2024
Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction
Yuyang Xue
Jingshuai Liu
Jingyu Sun
Sotirios A. Tsaftaris
MU
46
2
0
24 May 2024
Machine Unlearning: A Comprehensive Survey
Weiqi Wang
Zhiyi Tian
Chenhan Zhang
Shui Yu
MU
AILaw
67
15
0
13 May 2024
Privacy-Preserving Federated Unlearning with Certified Client Removal
Ziyao Liu
Huanyi Ye
Yu Jiang
Jiyuan Shen
Jiale Guo
Ivan Tjuawinata
Kwok-Yan Lam
MU
52
5
0
15 Apr 2024
Unbridled Icarus: A Survey of the Potential Perils of Image Inputs in Multimodal Large Language Model Security
Yihe Fan
Yuxin Cao
Ziyu Zhao
Ziyao Liu
Shaofeng Li
45
13
0
08 Apr 2024
Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning
Hongsheng Hu
Shuo Wang
Tian Dong
Minhui Xue
AAML
64
19
0
04 Apr 2024
Steganographic Passport: An Owner and User Verifiable Credential for Deep Model IP Protection Without Retraining
Qi Cui
Ruohan Meng
Chaohui Xu
Chip-Hong Chang
57
3
0
03 Apr 2024
LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model
Yuxin Cao
Jinghao Li
Xi Xiao
Derui Wang
Minhui Xue
Hao Ge
Wei Liu
Guangwu Hu
AAML
54
1
0
18 Mar 2024
Machine Unlearning: Taxonomy, Metrics, Applications, Challenges, and Prospects
Na Li
Chunyi Zhou
Yansong Gao
Hui Chen
Anmin Fu
Zhi-Li Zhang
Yu Shui
MU
52
9
0
13 Mar 2024
Guardrail Baselines for Unlearning in LLMs
Pratiksha Thaker
Yash Maurya
Shengyuan Hu
Zhiwei Steven Wu
Virginia Smith
MU
68
43
0
05 Mar 2024
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Nathaniel Li
Alexander Pan
Anjali Gopal
Summer Yue
Daniel Berrios
...
Yan Shoshitaishvili
Jimmy Ba
K. Esvelt
Alexandr Wang
Dan Hendrycks
ELM
80
157
0
05 Mar 2024
Eight Methods to Evaluate Robust Unlearning in LLMs
Aengus Lynch
Phillip Guo
Aidan Ewart
Stephen Casper
Dylan Hadfield-Menell
ELM
MU
84
67
0
26 Feb 2024
Corrective Machine Unlearning
Shashwat Goel
Ameya Prabhu
Philip Torr
Ponnurangam Kumaraguru
Amartya Sanyal
OnRL
65
16
0
21 Feb 2024
Unlink to Unlearn: Simplifying Edge Unlearning in GNNs
Jiajun Tan
Fei Sun
Ruichen Qiu
Du Su
Huawei Shen
MU
75
8
0
16 Feb 2024
Towards Safer Large Language Models through Machine Unlearning
Zheyuan Liu
Guangyao Dou
Zhaoxuan Tan
Yijun Tian
Meng Jiang
KELM
MU
62
75
0
15 Feb 2024
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space
Leo Schwinn
David Dobre
Sophie Xhonneux
Gauthier Gidel
Stephan Gunnemann
AAML
72
39
0
14 Feb 2024
Rethinking Machine Unlearning for Large Language Models
Sijia Liu
Yuanshun Yao
Jinghan Jia
Stephen Casper
Nathalie Baracaldo
...
Hang Li
Kush R. Varshney
Mohit Bansal
Sanmi Koyejo
Yang Liu
AILaw
MU
108
92
0
13 Feb 2024
Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models
Lingzhi Wang
Xingshan Zeng
Jinsong Guo
Kam-Fai Wong
Georg Gottlob
MU
AAML
KELM
29
14
0
08 Feb 2024
Federated Unlearning: a Perspective of Stability and Fairness
Jiaqi Shao
Tao Lin
Xuanyu Cao
Bing Luo
MU
46
7
0
02 Feb 2024
Scalable Federated Unlearning via Isolated and Coded Sharding
Yi-Lan Lin
Zhipeng Gao
Hongyang Du
Dusit Niyato
Gui Gui
Shuguang Cui
Jinke Ren
FedML
51
4
0
29 Jan 2024
Blockchain-enabled Trustworthy Federated Unlearning
Yi-Lan Lin
Zhipeng Gao
Hongyang Du
Jinke Ren
Zhiqiang Xie
Dusit Niyato
MU
50
5
0
29 Jan 2024
Unlearning Traces the Influential Training Data of Language Models
Masaru Isonuma
Ivan Titov
MU
65
7
0
26 Jan 2024
Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning
Yu Jiang
Jiyuan Shen
Ziyao Liu
Chee Wei Tan
Kwok-Yan Lam
AAML
FedML
76
5
0
16 Jan 2024
Federated Unlearning: A Survey on Methods, Design Guidelines, and Evaluation Metrics
Nicolò Romandini
Alessio Mora
Carlo Mazzocca
R. Montanari
Paolo Bellavista
FedML
MU
74
22
0
10 Jan 2024
Digger: Detecting Copyright Content Mis-usage in Large Language Model Training
Haodong Li
Gelei Deng
Yi Liu
Kailong Wang
Yuekang Li
Tianwei Zhang
Yang Liu
Guoai Xu
Guosheng Xu
Haoyu Wang
60
25
0
01 Jan 2024
SAME: Sample Reconstruction against Model Extraction Attacks
Yi Xie
Jie Zhang
Shiqian Zhao
Tianwei Zhang
Xiaofeng Chen
AAML
MIACV
72
4
0
17 Dec 2023
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer
Yuxin Cao
Ziyu Zhao
Xi Xiao
Derui Wang
Minhui Xue
Jin Lu
AAML
55
4
0
15 Dec 2023
FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
S. Kadhe
Anisa Halimi
Ambrish Rawat
Nathalie Baracaldo
MU
40
7
0
12 Dec 2023
Knowledge Unlearning for LLMs: Tasks, Methods, and Challenges
Nianwen Si
Hao Zhang
Heyu Chang
Wenlin Zhang
Dan Qu
Weiqiang Zhang
KELM
MU
119
27
0
27 Nov 2023
Ethics and Responsible AI Deployment
P. Radanliev
Omar Santos
SILM
50
33
0
12 Nov 2023
Bounded and Unbiased Composite Differential Privacy
Kai Zhang
Yanjun Zhang
Ruoxi Sun
Pei-Wei Tsai
M. Hassan
Xingliang Yuan
Minhui Xue
Jinjun Chen
56
32
0
04 Nov 2023
1
2
3
Next