Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.05344
Cited By
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
8 June 2024
Prince Jha
Raghav Jain
Konika Mandal
Aman Chadha
Sriparna Saha
P. Bhattacharyya
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention"
6 / 6 papers shown
Title
LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation
Junyeong Park
Seogyeong Jeong
Shri Kiran Srinivasan
Yohan Lee
Alice H. Oh
57
0
0
10 Mar 2025
Bridging the Safety Gap: A Guardrail Pipeline for Trustworthy LLM Inferences
Shanshan Han
Salman Avestimehr
Chaoyang He
76
1
0
12 Feb 2025
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan
Agam Goyal
Yilun Chen
Eshwar Chandrasekharan
Koustuv Saha
AI4MH
159
0
0
17 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Shanshan Han
87
1
0
09 Oct 2024
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
328
4,077
0
24 May 2022
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
Mai Elsherief
Caleb Ziems
D. Muchlinski
Vaishnavi Anupindi
Jordyn Seybolt
M. D. Choudhury
Diyi Yang
106
237
0
11 Sep 2021
1