Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.14348
Cited By
v1
v2
v3 (latest)
Manipulating Multimodal Agents via Cross-Modal Prompt Injection
19 April 2025
Le Wang
Zonghao Ying
Tianyuan Zhang
Siyuan Liang
Shengshan Hu
Mingchuan Zhang
A. Liu
Xianglong Liu
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Manipulating Multimodal Agents via Cross-Modal Prompt Injection"
19 / 69 papers shown
Title
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
257
1,200
0
27 Mar 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
Huanran Chen
Yichi Zhang
Yinpeng Dong
Xiao Yang
Hang Su
Junyi Zhu
AAML
89
69
0
16 Mar 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.5K
14,761
0
15 Mar 2023
ViperGPT: Visual Inference via Python Execution for Reasoning
Dídac Surís
Sachit Menon
Carl Vondrick
MLLM
LRM
ReLM
120
466
0
14 Mar 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu
Sheng-Kai Yin
Weizhen Qi
Xiaodong Wang
Zecheng Tang
Nan Duan
MLLM
LRM
136
645
0
08 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
1.5K
13,472
0
27 Feb 2023
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Kai Greshake
Sahar Abdelnabi
Shailesh Mishra
C. Endres
Thorsten Holz
Mario Fritz
SILM
143
498
0
23 Feb 2023
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLM
VLM
LRM
145
439
0
18 Nov 2022
A Large-scale Multiple-objective Method for Black-box Attack against Object Detection
Siyuan Liang
Longkang Li
Yanbo Fan
Xiaojun Jia
Jingzhi Li
Baoyuan Wu
Xiaochun Cao
AAML
77
35
0
16 Sep 2022
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
Weixin Liang
Yuhui Zhang
Yongchan Kwon
Serena Yeung
James Zou
VLM
137
429
0
03 Mar 2022
Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object Detection
Siyuan Liang
Baoyuan Wu
Yanbo Fan
Xingxing Wei
Xiaochun Cao
AAML
82
72
0
22 Jan 2022
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
1.0K
29,926
0
26 Feb 2021
Towards Defending Multiple
ℓ
p
\ell_p
ℓ
p
-norm Bounded Adversarial Perturbations via Gated Batch Normalization
Aishan Liu
Shiyu Tang
Xinyun Chen
Lei Huang
Zhuozhuo Tu
Xianglong Liu
Dacheng Tao
AAML
98
34
0
03 Dec 2020
Efficient Adversarial Attacks for Visual Object Tracking
Siyuan Liang
Xingxing Wei
Siyuan Yao
Xiaochun Cao
AAML
65
75
0
01 Aug 2020
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
130
1,086
0
20 Sep 2019
Training Robust Deep Neural Networks via Adversarial Noise Propagation
Aishan Liu
Xianglong Liu
Chongzhi Zhang
Hang Yu
Qiang Liu
Dacheng Tao
AAML
62
116
0
19 Sep 2019
On Evaluating Adversarial Robustness
Nicholas Carlini
Anish Athalye
Nicolas Papernot
Wieland Brendel
Jonas Rauber
Dimitris Tsipras
Ian Goodfellow
Aleksander Madry
Alexey Kurakin
ELM
AAML
112
905
0
18 Feb 2019
Transferable Adversarial Attacks for Image and Video Object Detection
Xingxing Wei
Siyuan Liang
Ning Chen
Xiaochun Cao
AAML
112
224
0
30 Nov 2018
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
253
2,692
0
23 Jan 2017
Previous
1
2