Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.13581
Cited By
RAR: Setting Knowledge Tripwires for Retrieval Augmented Rejection
19 May 2025
T. M. Buonocore
Enea Parimbelli
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RAR: Setting Knowledge Tripwires for Retrieval Augmented Rejection"
4 / 4 papers shown
Title
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
Bang An
Shiyue Zhang
Mark Dredze
84
4
0
25 Apr 2025
Machine Against the RAG: Jamming Retrieval-Augmented Generation with Blocker Documents
Avital Shafran
R. Schuster
Vitaly Shmatikov
79
31
0
09 Jun 2024
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Justin Cui
Wei-Lin Chiang
Ion Stoica
Cho-Jui Hsieh
ALM
75
45
0
31 May 2024
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Hakan Inan
Kartikeya Upasani
Jianfeng Chi
Rashi Rungta
Krithika Iyer
...
Michael Tontchev
Qing Hu
Brian Fuller
Davide Testuggine
Madian Khabsa
AI4MH
68
423
0
07 Dec 2023
1