Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.14122
Cited By
SurrogatePrompt: Bypassing the Safety Filter of Text-To-Image Models via Substitution
25 September 2023
Zhongjie Ba
Jieming Zhong
Jiachen Lei
Pengyu Cheng
Qinglong Wang
Zhan Qin
Peng Kuang
Kui Ren
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SurrogatePrompt: Bypassing the Safety Filter of Text-To-Image Models via Substitution"
12 / 12 papers shown
Title
TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis
Longtian Wang
Xiaofei Xie
Tianlin Li
Yuhan Zhi
Chao Shen
21
0
0
11 May 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
Lijun Li
Zhelun Shi
Xuhao Hu
Bowen Dong
Yiran Qin
Xihui Liu
Lu Sheng
Jing Shao
116
1
0
21 Feb 2025
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Xuannan Liu
Xing Cui
Peipei Li
Zekun Li
Huaibo Huang
Shuhan Xia
Miaoxuan Zhang
Yueying Zou
Ran He
AAML
67
8
0
14 Nov 2024
Perception-guided Jailbreak against Text-to-Image Models
Yihao Huang
Le Liang
Tianlin Li
Xiaojun Jia
Run Wang
Weikai Miao
G. Pu
Yang Liu
46
7
0
20 Aug 2024
Jailbreaking Text-to-Image Models with LLM-Based Agents
Yingkai Dong
Zheng Li
Xiangtao Meng
Ning Yu
Shanqing Guo
LLMAG
45
13
0
01 Aug 2024
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey
Chenyu Zhang
Mingwang Hu
Wenhui Li
Lanjun Wang
41
15
0
10 Jul 2024
Espresso: Robust Concept Filtering in Text-to-Image Models
Anudeep Das
Vasisht Duddu
Rui Zhang
Nadarajah Asokan
EGVM
38
6
0
30 Apr 2024
LLMs for Cyber Security: New Opportunities
D. Divakaran
Sai Teja Peddinti
28
11
0
17 Apr 2024
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
Yijun Yang
Ruiyuan Gao
Xiao Yang
Qiang Xu
Qiang Xu
32
15
0
03 Mar 2024
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
132
184
0
03 Oct 2022
Discovering the Hidden Vocabulary of DALLE-2
Giannis Daras
A. Dimakis
132
64
0
01 Jun 2022
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
258
916
0
21 Apr 2018
1