Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.17216
Cited By
Machine Unlearning Fails to Remove Data Poisoning Attacks
25 June 2024
Martin Pawelczyk
Jimmy Z. Di
Yiwei Lu
Gautam Kamath
Ayush Sekhari
Seth Neel
AAML
MU
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Machine Unlearning Fails to Remove Data Poisoning Attacks"
10 / 10 papers shown
Title
Certified Data Removal Under High-dimensional Settings
Haolin Zou
Arnab Auddy
Yongchan Kwon
Kamiar Rahnama Rad
A. Maleki
MU
39
0
0
12 May 2025
Erasing Without Remembering: Implicit Knowledge Forgetting in Large Language Models
Huazheng Wang
Yongcheng Jing
Haifeng Sun
Yingjie Wang
J. Wang
Jianxin Liao
Dacheng Tao
KELM
MU
47
0
0
27 Feb 2025
Delta-Influence: Unlearning Poisons via Influence Functions
Wenjie Li
Jiawei Li
Christian Schroeder de Witt
Ameya Prabhu
Amartya Sanyal
TDI
MU
97
0
0
20 Nov 2024
Attribute-to-Delete: Machine Unlearning via Datamodel Matching
Kristian Georgiev
Roy Rinberg
Sung Min Park
Shivam Garg
Andrew Ilyas
Aleksander Madry
Seth Neel
MU
49
3
0
30 Oct 2024
Data Deletion for Linear Regression with Noisy SGD
Zhangjie Xia
Chi-Hua Wang
Guang Cheng
30
2
0
12 Oct 2024
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MU
AAML
73
32
0
26 Sep 2024
Poisoning Language Models During Instruction Tuning
Alexander Wan
Eric Wallace
Sheng Shen
Dan Klein
SILM
94
124
0
01 May 2023
Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks
Yiwei Lu
Gautam Kamath
Yaoliang Yu
AAML
39
18
0
07 Mar 2023
Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Joel Jang
Dongkeun Yoon
Sohee Yang
Sungmin Cha
Moontae Lee
Lajanugen Logeswaran
Minjoon Seo
KELM
PILM
MU
147
191
0
04 Oct 2022
Linear Adversarial Concept Erasure
Shauli Ravfogel
Michael Twiton
Yoav Goldberg
Ryan Cotterell
KELM
81
57
0
28 Jan 2022
1