Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.12192
Cited By
No Free Lunch for Defending Against Prefilling Attack by In-Context Learning
13 December 2024
Zhiyu Xue
Guangliang Liu
Bocheng Chen
K. Johnson
Ramtin Pedarsani
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"No Free Lunch for Defending Against Prefilling Attack by In-Context Learning"
1 / 1 papers shown
Title
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
Tinghao Xie
Xiangyu Qi
Yi Zeng
Yangsibo Huang
Udari Madhushani Sehwag
...
Bo Li
Kai Li
Danqi Chen
Peter Henderson
Prateek Mittal
ALM
ELM
191
79
0
20 Jun 2024
1