
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time
Papers citing "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"
50 / 57 papers shown
Title |
---|
![]() Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations Hakan Inan Kartikeya Upasani Jianfeng Chi Rashi Rungta Krithika Iyer ...Michael Tontchev Qing Hu Brian Fuller Davide Testuggine Madian Khabsa |