Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.20150
Cited By
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
31 October 2023
Jiaao Chen
Diyi Yang
MU
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unlearn What You Want to Forget: Efficient Unlearning for LLMs"
50 / 115 papers shown
Title
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation
Stefan Vasilev
Christian Herold
Baohao Liao
Seyyed Hadi Hashemi
Shahram Khadivi
Christof Monz
MU
138
0
0
09 May 2025
Unlearning vs. Obfuscation: Are We Truly Removing Knowledge?
Guangzhi Sun
Potsawee Manakul
Xiao Zhan
Mark J. F. Gales
MU
56
0
0
05 May 2025
Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions
Yiming Du
Wenyu Huang
Danna Zheng
Zhaowei Wang
Sébastien Montella
Mirella Lapata
Kam-Fai Wong
Jeff Z. Pan
KELM
MU
78
2
0
01 May 2025
SHA256 at SemEval-2025 Task 4: Selective Amnesia -- Constrained Unlearning for Large Language Models via Knowledge Isolation
Saransh Agrawal
Kuan-Hao Huang
MU
KELM
54
0
0
17 Apr 2025
GRAIL: Gradient-Based Adaptive Unlearning for Privacy and Copyright in LLMs
Kun-Woo Kim
Ji-Hoon Park
Ju-Min Han
Seong-Whan Lee
MU
PILM
62
0
0
17 Apr 2025
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning
Tianyang Xu
Xiaoze Liu
Feijie Wu
Xiaoqian Wang
Jing Gao
MU
56
0
0
29 Mar 2025
ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging
Haoming Xu
Shuxun Wang
Yanqiu Zhao
Yi Zhong
Ziyan Jiang
Ningyuan Zhao
Shumin Deng
H. Chen
N. Zhang
MoMe
MU
69
0
0
27 Mar 2025
Effective Skill Unlearning through Intervention and Abstention
Yongce Li
Chung-En Sun
Tsui-Wei Weng
MU
149
0
0
27 Mar 2025
UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets
Wenyu Wang
M. Zhang
Xiaotian Ye
Z. Z. Ren
Z. Chen
Pengjie Ren
MU
KELM
168
0
0
06 Mar 2025
Erasing Without Remembering: Safeguarding Knowledge Forgetting in Large Language Models
Huazheng Wang
Yongcheng Jing
Haifeng Sun
Yingjie Wang
J. Wang
Jianxin Liao
Dacheng Tao
KELM
MU
47
0
0
27 Feb 2025
Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal
Weipeng Jiang
Juan Zhai
Shiqing Ma
Ziyan Lei
Xiaofei Xie
Yige Wang
Chao Shen
MU
66
0
0
26 Feb 2025
Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility
Martin Kuo
Jingyang Zhang
Jianyi Zhang
Minxue Tang
Louis DiValentin
...
William Chen
Amin Hass
Tianlong Chen
Y. Chen
H. Li
MU
KELM
42
2
0
24 Feb 2025
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
Vaidehi Patil
Elias Stengel-Eskin
Mohit Bansal
MU
CLL
75
2
0
20 Feb 2025
CoME: An Unlearning-based Approach to Conflict-free Model Editing
Dahyun Jung
Jaehyung Seo
Jaewook Lee
Chanjun Park
Heuiseok Lim
MU
KELM
47
0
0
20 Feb 2025
LUME: LLM Unlearning with Multitask Evaluations
Anil Ramakrishna
Yixin Wan
Xiaomeng Jin
Kai-Wei Chang
Zhiqi Bu
Bhanukiran Vinzamuri
V. Cevher
Mingyi Hong
Rahul Gupta
CLL
MU
103
7
0
20 Feb 2025
ReLearn: Unlearning via Learning for Large Language Models
Haoming Xu
Ningyuan Zhao
Liming Yang
Sendong Zhao
Shumin Deng
Mengru Wang
Bryan Hooi
Nay Oo
H. Chen
N. Zhang
KELM
CLL
MU
156
0
0
16 Feb 2025
LUNAR: LLM Unlearning via Neural Activation Redirection
William F. Shen
Xinchi Qiu
Meghdad Kurmanji
Alex Iacob
Lorenzo Sani
Yihong Chen
Nicola Cancedda
Nicholas D. Lane
MU
56
1
0
11 Feb 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
76
13
0
17 Jan 2025
OpenGU: A Comprehensive Benchmark for Graph Unlearning
Bowen Fan
Yuming Ai
Xunkai Li
Zhilin Guo
Rong-Hua Li
Guoren Wang
MU
43
0
0
06 Jan 2025
Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions
Hao Du
Shang Liu
Lele Zheng
Yang Cao
Atsuyoshi Nakamura
Lei Chen
AAML
114
3
0
21 Dec 2024
Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration
Xuhan Zuo
Minghao Wang
Tianqing Zhu
Shui Yu
Wanlei Zhou
MU
72
2
0
18 Dec 2024
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
A. Feder Cooper
Christopher A. Choquette-Choo
Miranda Bogen
Matthew Jagielski
Katja Filippova
...
Abigail Z. Jacobs
Andreas Terzis
Hanna M. Wallach
Nicolas Papernot
Katherine Lee
AILaw
MU
93
10
0
09 Dec 2024
Copyright-Protected Language Generation via Adaptive Model Fusion
Javier Abad
Konstantin Donhauser
Francesco Pinto
Fanny Yang
74
1
0
09 Dec 2024
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding
Jiancan Wu
Yancheng Yuan
Jinda Lu
Kai Zhang
Alex Su
Xiang Wang
Xiangnan He
MU
KELM
100
6
0
30 Nov 2024
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
Abhinav Joshi
Shaswati Saha
Divyaksh Shukla
Sriram Vema
Harsh Jhamtani
Manas Gaur
Ashutosh Modi
MU
83
3
0
23 Nov 2024
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Yingzi Ma
Jiongxiao Wang
Fei-Yue Wang
Siyuan Ma
Jiazhao Li
...
B. Li
Yejin Choi
M. Chen
Chaowei Xiao
Chaowei Xiao
MU
52
6
0
05 Nov 2024
RESTOR: Knowledge Recovery through Machine Unlearning
Keivan Rezaei
Khyathi Raghavi Chandu
S. Feizi
Yejin Choi
Faeze Brahman
Abhilasha Ravichander
KELM
CLL
MU
58
0
0
31 Oct 2024
CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
Tianyu Yang
Lisen Dai
Zheyuan Liu
Xiangqi Wang
Meng-Long Jiang
Yapeng Tian
Xiangliang Zhang
VLM
MU
29
4
0
30 Oct 2024
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate
Zhiqi Bu
Xiaomeng Jin
Bhanukiran Vinzamuri
Anil Ramakrishna
Kai-Wei Chang
V. Cevher
Mingyi Hong
MU
85
6
0
29 Oct 2024
Cross-model Control: Improving Multiple Large Language Models in One-time Training
Jiayi Wu
Hao-Lun Sun
Hengyi Cai
Lixin Su
S. Wang
Dawei Yin
Xiang Li
Ming Gao
MU
34
0
0
23 Oct 2024
CLEAR: Character Unlearning in Textual and Visual Modalities
Alexey Dontsov
Dmitrii Korzh
Alexey Zhavoronkin
Boris Mikheev
Denis Bobkov
Aibek Alanov
Oleg Y. Rogov
Ivan V. Oseledets
Elena Tutubalina
AILaw
VLM
MU
66
5
0
23 Oct 2024
Catastrophic Failure of LLM Unlearning via Quantization
Zhiwei Zhang
Fali Wang
Xiaomin Li
Zongyu Wu
Xianfeng Tang
Hui Liu
Qi He
Wenpeng Yin
Suhang Wang
MU
34
5
0
21 Oct 2024
Do LLMs "know" internally when they follow instructions?
Juyeon Heo
Christina Heinze-Deml
Oussama Elachqar
Shirley Ren
Udhay Nallasamy
Andy Miller
Kwan Ho Ryan Chan
Jaya Narain
51
3
0
18 Oct 2024
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization
Phillip Guo
Aaquib Syed
Abhay Sheshadri
Aidan Ewart
Gintare Karolina Dziugaite
KELM
MU
31
5
0
16 Oct 2024
LLM Unlearning via Loss Adjustment with Only Forget Data
Yaxuan Wang
Jiaheng Wei
Chris Liu
Jinlong Pang
Q. Liu
A. Shah
Yujia Bao
Yang Liu
Wei Wei
KELM
MU
37
7
0
14 Oct 2024
On Calibration of LLM-based Guard Models for Reliable Content Moderation
Hongfu Liu
Hengguan Huang
Hao Wang
Xiangming Gu
Ye Wang
55
2
0
14 Oct 2024
Dissecting Fine-Tuning Unlearning in Large Language Models
Yihuai Hong
Yuelin Zou
Lijie Hu
Ziqian Zeng
Di Wang
Haiqin Yang
AAML
MU
39
2
0
09 Oct 2024
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
Yan Scholten
Stephan Günnemann
Leo Schwinn
MU
55
6
0
04 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
71
7
0
03 Oct 2024
Efficient Backdoor Defense in Multimodal Contrastive Learning: A Token-Level Unlearning Method for Mitigating Threats
Kuanrong Liu
Siyuan Liang
Jiawei Liang
Pengwen Dai
Xiaochun Cao
MU
AAML
31
1
0
29 Sep 2024
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MU
AAML
71
31
0
26 Sep 2024
Creative Writers' Attitudes on Writing as Training Data for Large Language Models
Katy Ilonka Gero
Meera Desai
Carly Schnitzler
Nayun Eom
Jack Cushman
Elena L. Glassman
23
1
0
22 Sep 2024
Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models
Anmol Mekala
Vineeth Dorna
Shreya Dubey
Abhishek Lalwani
David Koleczek
Mukund Rungta
Sadid Hasan
Elita Lobo
KELM
MU
36
2
0
20 Sep 2024
LLM Surgery: Efficient Knowledge Unlearning and Editing in Large Language Models
Akshaj Kumar Veldanda
Shi-Xiong Zhang
Anirban Das
Supriyo Chakraborty
Stephen Rawls
Sambit Sahu
Milind Naphade
KELM
MU
25
0
0
19 Sep 2024
Recent Advances in Attack and Defense Approaches of Large Language Models
Jing Cui
Yishi Xu
Zhewei Huang
Shuchang Zhou
Jianbin Jiao
Junge Zhang
PILM
AAML
54
1
0
05 Sep 2024
CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence
Chaochao Chen
Jiaming Zhang
Yizhao Zhang
Li Zhang
Lingjuan Lyu
Yuyuan Li
Biao Gong
Chenggang Yan
CML
ELM
MU
27
3
0
26 Aug 2024
Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models
Hongbang Yuan
Zhuoran Jin
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
AAML
ELM
MU
44
1
0
20 Aug 2024
A Population-to-individual Tuning Framework for Adapting Pretrained LM to On-device User Intent Prediction
Jiahui Gong
Jingtao Ding
Fanjin Meng
Guilong Chen
Hong Chen
Shen Zhao
Haisheng Lu
Yong Li
MU
37
6
0
19 Aug 2024
WPN: An Unlearning Method Based on N-pair Contrastive Learning in Language Models
Guitao Chen
Yunshen Wang
Hongye Sun
Guang Chen
MU
26
1
0
18 Aug 2024
UNLEARN Efficient Removal of Knowledge in Large Language Models
Tyler Lizzo
Larry Heck
KELM
MoMe
MU
38
1
0
08 Aug 2024
1
2
3
Next