
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Kartikeya Upasani
Rashi Rungta
Madian Khabsa
Papers citing "Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations"
50 / 336 papers shown
Title |
---|
![]() Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in
Red Teaming GenAI Ambrish Rawat Stefan Schoepf Giulio Zizzo Giandomenico Cornacchia Muhammad Zaid Hameed ...Elizabeth M. Daly Mark Purcell P. Sattigeri Pin-Yu Chen Kush R. Varshney |