Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2409.18708
Cited By

Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems

v1v2v3v4v5 (latest)

Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems

27 September 2024

ArXiv (abs)PDF HTML Github

Papers citing "Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems"

4 / 4 papers shown

Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding

Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding

225

4

0

13 Sep 2025

ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection

ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection

200

0

0

15 Aug 2025

Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

427

5

0

02 Apr 2025

From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation

From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation

369

1

0

20 Mar 2025

Page 1 of 1