ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.18708
  4. Cited By
Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems
v1v2v3v4v5 (latest)

Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems

27 September 2024
Sergey Berezin
R. Farahbakhsh
Noel Crespi
ArXiv (abs)PDFHTMLGithub

Papers citing "Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems"

4 / 4 papers shown
Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
Seongho Joo
Hyukhun Koh
Kyomin Jung
225
4
0
13 Sep 2025
ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection
ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection
Axel Delaval
Shujian Yang
Huaimin Wang
Han Qiu
Jialiang Lu
200
0
0
15 Aug 2025
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Zhaochen Wang
Yujun Cai
Zi Huang
Bryan Hooi
Yiwei Wang
Ming Yang
CoGeVLM
427
5
0
02 Apr 2025
From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation
From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation
Sergey Berezin
R. Farahbakhsh
Noel Crespi
369
1
0
20 Mar 2025
1
Page 1 of 1