Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.18100
Cited By
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
30 January 2025
Yishuo Wang
Tiansheng Huang
Li Shen
H. Yao
Haotian Luo
Rui Liu
Naiqiang Tan
Jiaxing Huang
Dacheng Tao
AAML
MoMe
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation"
1 / 1 papers shown
Title
Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety
Zihan Guan
Mengxuan Hu
Ronghang Zhu
Sheng Li
Anil Vullikanti
AAML
31
0
0
11 May 2025
1