ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.12119
  4. Cited By
Visual Prompt Tuning
v1v2 (latest)

Visual Prompt Tuning

23 March 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
    VLMVPVLM
ArXiv (abs)PDFHTML

Papers citing "Visual Prompt Tuning"

50 / 1,088 papers shown
Title
On the Efficacy of Differentially Private Few-shot Image Classification
On the Efficacy of Differentially Private Few-shot Image Classification
Marlon Tobaben
Aliaksandra Shysheya
J. Bronskill
Andrew Paverd
Shruti Tople
Santiago Zanella Béguelin
Richard Turner
Antti Honkela
96
12
0
02 Feb 2023
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training
  with Saliency Prompt
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt
Hao Li
Dingwen Zhang
Nian Liu
Lechao Cheng
Yalun Dai
Chaoxi Zhang
Xinggang Wang
Junwei Han
76
17
0
02 Feb 2023
A Survey on Efficient Training of Transformers
A Survey on Efficient Training of Transformers
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
128
49
0
02 Feb 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffMVLM
114
109
0
30 Jan 2023
ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts
ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLMOTCLIP
75
20
0
28 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
126
42
0
21 Jan 2023
Vision Learners Meet Web Image-Text Pairs
Vision Learners Meet Web Image-Text Pairs
Bingchen Zhao
Quan Cui
Hao Wu
Osamu Yoshie
Cheng Yang
Oisin Mac Aodha
VLM
84
5
0
17 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
152
116
0
16 Jan 2023
See, Think, Confirm: Interactive Prompting Between Vision and Language
  Models for Knowledge-based Visual Reasoning
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning
Zhenfang Chen
Qinhong Zhou
Songlin Yang
Yining Hong
Hao Zhang
Chuang Gan
LRMVLM
107
41
0
12 Jan 2023
Exploring Efficient Few-shot Adaptation for Vision Transformers
Exploring Efficient Few-shot Adaptation for Vision Transformers
C. Xu
Siqian Yang
Yabiao Wang
Zhanxiong Wang
Yanwei Fu
Xiangyang Xue
97
17
0
06 Jan 2023
Unleashing the Power of Visual Prompting At the Pixel Level
Unleashing the Power of Visual Prompting At the Pixel Level
Junyang Wu
Xianhang Li
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
VPVLMVLM
97
32
0
20 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
108
91
0
16 Dec 2022
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
Chengzhi Mao
Scott Geng
Junfeng Yang
Xin Eric Wang
Carl Vondrick
VLM
98
71
0
14 Dec 2022
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Chengzhi Mao
Revant Teotia
Amrutha Sundar
Sachit Menon
Junfeng Yang
Xin Eric Wang
Carl Vondrick
64
31
0
12 Dec 2022
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for
  Generalized Novel Category Discovery
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery
Shengxiang Zhang
Salman Khan
Zhiqiang Shen
Muzammal Naseer
Guangyi Chen
Fahad Shahbaz Khan
CLLVLM
71
63
0
11 Dec 2022
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers
  using Synthetic Scene Data
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data
Roei Herzig
Ofir Abramovich
Elad Ben-Avraham
Assaf Arbelle
Leonid Karlinsky
Ariel Shamir
Trevor Darrell
Amir Globerson
138
18
0
08 Dec 2022
Vision and Structured-Language Pretraining for Cross-Modal Food
  Retrieval
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval
Mustafa Shukor
Nicolas Thome
Matthieu Cord
CLIPCoGe
81
9
0
08 Dec 2022
Learning Domain Invariant Prompt for Vision-Language Models
Learning Domain Invariant Prompt for Vision-Language Models
Cairong Zhao
Yubin Wang
Xinyang Jiang
Yifei Shen
Kaitao Song
Dongsheng Li
Duoqian Miao
VLMVPVLM
88
27
0
08 Dec 2022
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time
  Adaptation
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation
Yulu Gan
Yan Bai
Yihang Lou
Xianzheng Ma
Renrui Zhang
Nian Shi
Lin Luo
OODVLM
70
100
0
08 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
123
29
0
07 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
97
176
0
07 Dec 2022
Visual Query Tuning: Towards Effective Usage of Intermediate
  Representations for Parameter and Memory Efficient Transfer Learning
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning
Cheng-Hao Tu
Zheda Mai
Wei-Lun Chao
53
48
0
06 Dec 2022
FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer
FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer
Shibo Jie
Zhi-Hong Deng
80
137
0
06 Dec 2022
Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight
Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight
Yunhua Zhang
Hazel Doughty
Cees G. M. Snoek
VLM
113
0
0
05 Dec 2022
Cloud-Device Collaborative Adaptation to Continual Changing Environments
  in the Real-world
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-world
Yulu Gan
Mingjie Pan
Rongyu Zhang
Zijian Ling
Lingran Zhao
Jiaming Liu
Shanghang Zhang
VLM
73
15
0
02 Dec 2022
Isolation and Impartial Aggregation: A Paradigm of Incremental Learning
  without Interference
Isolation and Impartial Aggregation: A Paradigm of Incremental Learning without Interference
Yabin Wang
Zhiheng Ma
Zhiwu Huang
Yaowei Wang
Zhou Su
Xiaopeng Hong
90
43
0
29 Nov 2022
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
Siteng Huang
Biao Gong
Yulin Pan
Jianwen Jiang
Yiliang Lv
Yuyuan Li
Donglin Wang
VLMVPVLM
92
42
0
23 Nov 2022
Texts as Images in Prompt Tuning for Multi-Label Image Recognition
Texts as Images in Prompt Tuning for Multi-Label Image Recognition
Zixian Guo
Bowen Dong
Zhilong Ji
Jinfeng Bai
Yiwen Guo
W. Zuo
VLMVPVLM
119
60
0
23 Nov 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLMVPVLM
104
52
0
21 Nov 2022
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
Xiangyang Zhu
Renrui Zhang
Bowei He
Ziyu Guo
Ziyao Zeng
Zipeng Qin
Shanghang Zhang
Peng Gao
VLM
111
148
0
21 Nov 2022
Understanding and Improving Visual Prompting: A Label-Mapping
  Perspective
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
Aochuan Chen
Yuguang Yao
Pin-Yu Chen
Yihua Zhang
Sijia Liu
VPVLMVLM
147
82
0
21 Nov 2022
ProSFDA: Prompt Learning based Source-free Domain Adaptation for Medical
  Image Segmentation
ProSFDA: Prompt Learning based Source-free Domain Adaptation for Medical Image Segmentation
Shishuai Hu
Zehui Liao
Yong-quan Xia
OODMedIm
95
22
0
21 Nov 2022
Cross-Modal Adapter for Text-Video Retrieval
Cross-Modal Adapter for Text-Video Retrieval
Haojun Jiang
Jianke Zhang
Rui Huang
Chunjiang Ge
Zanlin Ni
Jiwen Lu
Jie Zhou
S. Song
Gao Huang
134
38
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
61
21
0
16 Nov 2022
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with
  Pre-trained Transformers
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with Pre-trained Transformers
Jinyu Chen
Wenchao Xu
Song Guo
Junxiao Wang
Jie Zhang
Yining Qi
FedML
83
36
0
15 Nov 2022
Demystify Self-Attention in Vision Transformers from a Semantic
  Perspective: Analysis and Application
Demystify Self-Attention in Vision Transformers from a Semantic Perspective: Analysis and Application
Leijie Wu
Song Guo
Yaohong Ding
Junxiao Wang
Wenchao Xu
Richard Yi Da Xu
Jiewei Zhang
49
2
0
13 Nov 2022
One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and
  Inter-Image Attention Design
One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and Inter-Image Attention Design
Yikai Yan
Chaoyue Niu
Fan Wu
Qinya Li
Shaojie Tang
Chengfei Lyu
Guihai Chen
77
0
0
11 Nov 2022
Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Ju-ho Kim
Ju-Sung Heo
Hyun-Seo Shin
Chanmann Lim
Ha-Jin Yu
28
5
0
04 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for
  Vision-Language Models
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLMVPVLM
106
47
0
04 Nov 2022
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Rethinking Hierarchies in Pre-trained Plain Vision Transformer
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
83
1
0
03 Nov 2022
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental
  Learning
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning
Yixuan Pei
Zhiwu Qing
Jun Cen
Xiang Wang
Shiwei Zhang
Yaxiong Wang
Mingqian Tang
Nong Sang
Xueming Qian
56
13
0
02 Nov 2022
Towards Sustainable Self-supervised Learning
Towards Sustainable Self-supervised Learning
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
CLL
122
7
0
20 Oct 2022
Scaling & Shifting Your Features: A New Baseline for Efficient Model
  Tuning
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
Dongze Lian
Daquan Zhou
Jiashi Feng
Xinchao Wang
109
264
0
17 Oct 2022
Unified Vision and Language Prompt Learning
Unified Vision and Language Prompt Learning
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
VLMVPVLM
80
151
0
13 Oct 2022
Feature-Proxy Transformer for Few-Shot Segmentation
Feature-Proxy Transformer for Few-Shot Segmentation
Jianwei Zhang
Yifan Sun
Yi Yang
Wei Chen
ViT
75
63
0
13 Oct 2022
Continual Learning with Evolving Class Ontologies
Continual Learning with Evolving Class Ontologies
Zhiqiu Lin
Deepak Pathak
Yu-Xiong Wang
Deva Ramanan
Shu Kong
CLL
81
9
0
10 Oct 2022
FS-DETR: Few-Shot DEtection TRansformer with prompting and without
  re-training
FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Adrian Bulat
Ricardo Guerrero
Brais Martínez
Georgios Tzimiropoulos
96
31
0
10 Oct 2022
Visual Prompt Tuning for Test-time Domain Adaptation
Visual Prompt Tuning for Test-time Domain Adaptation
Yunhe Gao
Xingjian Shi
Yi Zhu
Hongya Wang
Zhiqiang Tang
Xiong Zhou
Mu Li
Dimitris N. Metaxas
VPVLMVLM
176
89
0
10 Oct 2022
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
CLIPVLM
130
459
0
09 Oct 2022
Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision
  Tasks
Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks
Yen-Cheng Liu
Chih-Yao Ma
Junjiao Tian
Zijian He
Z. Kira
160
52
0
07 Oct 2022
Previous
123...202122
Next