
Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Papers citing "Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models"
19 / 19 papers shown
Title |
---|
![]() SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner Xunguang Wang Daoyuan Wu Zhenlan Ji Zongjie Li Pingchuan Ma Shuai Wang Yingjiu Li Yang Liu Ning Liu Juergen Rahmel |