ResearchTrend.AI
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
v1v2v3 (latest)

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

Papers citing "BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B"

25 / 25 papers shown
Title
Locking Down the Finetuned LLMs Safety
Locking Down the Finetuned LLMs Safety
Minjun Zhu
Linyi Yang
Yifan Wei
Ningyu Zhang
Yue Zhang
108
14
0
14 Oct 2024

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.