Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.11797
Cited By
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
24 September 2021
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLM
VPVLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models"
50 / 160 papers shown
Title
Convolutional Visual Prompt for Robust Visual Perception
Yun-Yun Tsai
Chengzhi Mao
Junfeng Yang
VLM
VPVLM
31
13
0
01 Mar 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CE
VLM
31
202
0
20 Feb 2023
Prompting for Multimodal Hateful Meme Classification
Rui Cao
Roy Ka-Wei Lee
Wen-Haw Chong
Jing Jiang
VLM
19
75
0
08 Feb 2023
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Beier Zhu
Yulei Niu
Saeil Lee
Minhoe Hur
Hanwang Zhang
VLM
VPVLM
24
22
0
29 Jan 2023
Position-guided Text Prompt for Vision-Language Pre-training
Alex Jinpeng Wang
Pan Zhou
Mike Zheng Shou
Shuicheng Yan
VLM
24
37
0
19 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
29
167
0
07 Dec 2022
Transferability Estimation Based On Principal Gradient Expectation
Huiyan Qi
Lechao Cheng
Jingjing Chen
Yue Yu
Xue Song
Zunlei Feng
Yueping Jiang
21
2
0
29 Nov 2022
Texts as Images in Prompt Tuning for Multi-Label Image Recognition
Zixian Guo
Bowen Dong
Zhilong Ji
Jinfeng Bai
Yiwen Guo
W. Zuo
VLM
VPVLM
28
57
0
23 Nov 2022
Visually Grounded Commonsense Knowledge Acquisition
Yuan Yao
Tianyu Yu
Ao Zhang
Mengdi Li
Ruobing Xie
...
Zhiyuan Liu
Haitao Zheng
S. Wermter
Tat-Seng Chua
Maosong Sun
SSL
19
7
0
22 Nov 2022
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Linli Yao
Wei Chen
Qin Jin
VLM
22
10
0
17 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLM
VPVLM
26
43
0
04 Nov 2022
Could Giant Pretrained Image Models Extract Universal Representations?
Yutong Lin
Ze Liu
Zheng-Wei Zhang
Han Hu
Nanning Zheng
Stephen Lin
Yue Cao
VLM
46
9
0
03 Nov 2022
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Yanxin Long
Jianhua Han
Runhu Huang
Xu Hang
Yi Zhu
Chunjing Xu
Xiaodan Liang
VLM
ObjD
29
18
0
02 Nov 2022
Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval
Minjoon Jung
Seongho Choi
Joo-Kyung Kim
Jin-Hwa Kim
Byoung-Tak Zhang
36
7
0
23 Oct 2022
Unified Vision and Language Prompt Learning
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
VLM
VPVLM
14
147
0
13 Oct 2022
MaPLe: Multi-modal Prompt Learning
Muhammad Uzair Khattak
H. Rasheed
Muhammad Maaz
Salman Khan
F. Khan
VPVLM
VLM
194
531
0
06 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLM
VLM
56
81
0
03 Oct 2022
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang
Yangxin Wu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
24
31
0
19 Sep 2022
PromptAttack: Prompt-based Attack for Language Models via Gradient Search
Yundi Shi
Piji Li
Changchun Yin
Zhaoyang Han
Lu Zhou
Zhe Liu
AAML
SILM
24
18
0
05 Sep 2022
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Jihye Park
Sunwoo Kim
Soohyun Kim
Seokju Cho
Jaejun Yoo
Youngjung Uh
Seung Wook Kim
VLM
33
9
0
31 Aug 2022
Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model
Yinghui Xing
Qirui Wu
De-Chun Cheng
Shizhou Zhang
Guoqiang Liang
Peng Wang
Yanning Zhang
VLM
VPVLM
56
51
0
17 Aug 2022
Prompt Tuning for Generative Multimodal Pretrained Models
Han Yang
Junyang Lin
An Yang
Peng Wang
Chang Zhou
Hongxia Yang
VLM
LRM
VPVLM
37
30
0
04 Aug 2022
Prompting for Multi-Modal Tracking
Jinyu Yang
Zhe Li
Fengcai Zheng
A. Leonardis
Jingkuan Song
21
86
0
29 Jul 2022
Fine-grained Retrieval Prompt Tuning
Shijie Wang
Jianlong Chang
Zhihui Wang
Haojie Li
Wanli Ouyang
Qi Tian
VLM
VPVLM
11
15
0
29 Jul 2022
Contrastive Adapters for Foundation Model Group Robustness
Michael Zhang
Christopher Ré
VLM
18
61
0
14 Jul 2022
DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Ximeng Sun
Ping Hu
Kate Saenko
VLM
33
119
0
20 Jun 2022
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Tal Shaharabany
Yoad Tewel
Lior Wolf
ObjD
38
15
0
19 Jun 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
54
527
0
13 Jun 2022
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
Bingqian Lin
Yi Zhu
Zicong Chen
Xiwen Liang
Jian-zhuo Liu
Xiaodan Liang
LM&Ro
25
51
0
31 May 2022
Prompt-aligned Gradient for Prompt Tuning
Beier Zhu
Yulei Niu
Yucheng Han
Yuehua Wu
Hanwang Zhang
VLM
183
271
0
30 May 2022
Prompt-based Learning for Unpaired Image Captioning
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
C. L. P. Chen
VLM
23
31
0
26 May 2022
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models
Yuan Yao
Qi-An Chen
Ao Zhang
Wei Ji
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
VLM
MLLM
26
38
0
23 May 2022
Prompt Tuning for Discriminative Pre-trained Language Models
Yuan Yao
Bowen Dong
Ao Zhang
Zhengyan Zhang
Ruobing Xie
Zhiyuan Liu
Leyu Lin
Maosong Sun
Jianyong Wang
VLM
16
34
0
23 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
50
343
0
13 May 2022
Declaration-based Prompt Tuning for Visual Question Answering
Yuhang Liu
Wei Wei
Daowan Peng
Feida Zhu
MLLM
VLM
19
19
0
05 May 2022
Visual Commonsense in Pretrained Unimodal and Multimodal Models
Chenyu Zhang
Benjamin Van Durme
Zhuowan Li
Elias Stengel-Eskin
VLM
SSL
23
39
0
04 May 2022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
Chunyuan Li
Haotian Liu
Liunian Harold Li
Pengchuan Zhang
J. Aneja
...
Ping Jin
Houdong Hu
Zicheng Liu
Yong Jae Lee
Jianfeng Gao
29
144
0
19 Apr 2022
Multi-Modal Few-Shot Object Detection with Meta-Learning-Based Cross-Modal Prompting
G. Han
Long Chen
Jiawei Ma
Shiyuan Huang
Ramalingam Chellappa
Shih-Fu Chang
VLM
29
20
0
16 Apr 2022
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension
Sanjay Subramanian
William Merrill
Trevor Darrell
Matt Gardner
Sameer Singh
Anna Rohrbach
ObjD
24
125
0
12 Apr 2022
Exploring Visual Prompts for Adapting Large-Scale Models
Hyojin Bahng
Ali Jahanian
S. Sankaranarayanan
Phillip Isola
VLM
VPVLM
LRM
25
255
0
31 Mar 2022
Do Vision-Language Pretrained Models Learn Composable Primitive Concepts?
Tian Yun
Usha Bhalla
Ellie Pavlick
Chen Sun
ReLM
CoGe
VLM
LRM
31
23
0
31 Mar 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge J. Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
40
1,522
0
23 Mar 2022
Fine-Grained Scene Graph Generation with Data Transfer
Ao Zhang
Yuan Yao
Qián Chen
Wei Ji
Zhiyuan Liu
Maosong Sun
Tat-Seng Chua
21
89
0
22 Mar 2022
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Haojun Jiang
Yuanze Lin
Dongchen Han
Shiji Song
Gao Huang
ObjD
37
50
0
16 Mar 2022
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Sheng Liang
Mengjie Zhao
Hinrich Schütze
33
42
0
15 Mar 2022
Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
32
24
0
13 Mar 2022
Conditional Prompt Learning for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VLM
CLIP
VPVLM
32
1,286
0
10 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
22
182
0
10 Mar 2022
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Jack Hessel
Jena D. Hwang
J. Park
Rowan Zellers
Chandra Bhagavatula
Anna Rohrbach
Kate Saenko
Yejin Choi
ReLM
151
48
0
10 Feb 2022
SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples
Hao Wang
Yangguang Li
Zhen Huang
Yong Dou
Lingpeng Kong
Jing Shao
SSL
11
54
0
16 Jan 2022
Previous
1
2
3
4
Next