
v1v2 (latest)
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Papers citing "AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs"
48 / 48 papers shown
Title |
---|
![]() Recent advancements in LLM Red-Teaming: Techniques, Defenses, and
Ethical Considerations Tarun Raheja Nilay Pochhi |
![]() SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner Xunguang Wang Daoyuan Wu Zhenlan Ji Zongjie Li Pingchuan Ma Shuai Wang Yingjiu Li Yang Liu Ning Liu Juergen Rahmel |