Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.22770
Cited By
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
30 October 2024
Yiming Li
Xiaogeng Liu
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models"
4 / 4 papers shown
Title
CAPTURE: Context-Aware Prompt Injection Testing and Robustness Enhancement
Gauri Kholkar
Ratinder Ahuja
SILM
4
0
0
18 May 2025
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
Yi Nian
Shenzhe Zhu
Yuehan Qin
Li Li
Ziyi Wang
Chaowei Xiao
Yue Zhao
30
0
0
03 Apr 2025
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation
A. Naseh
Yuefeng Peng
Anshuman Suri
Harsh Chaudhari
Alina Oprea
Amir Houmansadr
SILM
AAML
RALM
56
0
0
01 Feb 2025
SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach
Ruoxi Sun
Jiamin Chang
Hammond Pearce
Chaowei Xiao
B. Li
Qi Wu
Surya Nepal
Minhui Xue
44
0
0
17 Nov 2024
1