ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.12119
  4. Cited By
Visual Prompt Tuning
v1v2 (latest)

Visual Prompt Tuning

23 March 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
    VLMVPVLM
ArXiv (abs)PDFHTML

Papers citing "Visual Prompt Tuning"

50 / 1,088 papers shown
Title
Parameter-Efficient Fine-Tuning via Circular Convolution
Parameter-Efficient Fine-Tuning via Circular Convolution
Aochuan Chen
Jiashun Cheng
Zijing Liu
Ziqi Gao
Fugee Tsung
Yu-Feng Li
Jia Li
146
3
0
27 Jul 2024
Enhancing Model Performance: Another Approach to Vision-Language
  Instruction Tuning
Enhancing Model Performance: Another Approach to Vision-Language Instruction Tuning
Vedanshu
M. M. Tripathi
Bhavnesh Jaint
MLLMVLM
45
0
0
25 Jul 2024
SFPrompt: Communication-Efficient Split Federated Fine-Tuning for Large
  Pre-Trained Models over Resource-Limited Devices
SFPrompt: Communication-Efficient Split Federated Fine-Tuning for Large Pre-Trained Models over Resource-Limited Devices
Linxiao Cao
Yifei Zhu
Wei Gong
FedML
60
4
0
24 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
YunLong Yu
Jiale Cao
Yanwei Pang
Jungong Han
Xuelong Li
CLL
142
5
0
24 Jul 2024
Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution
Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution
Kai-Chun Liu
Zhihang Fu
Sheng Jin
Chao Chen
Ze Chen
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Jieping Ye
OODD
84
0
0
23 Jul 2024
Tackling Feature-Classifier Mismatch in Federated Learning via
  Prompt-Driven Feature Transformation
Tackling Feature-Classifier Mismatch in Federated Learning via Prompt-Driven Feature Transformation
Xinghao Wu
Jianwei Niu
Xuefeng Liu
Mingjia Shi
Guogang Zhu
Shaojie Tang
146
2
0
23 Jul 2024
AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description
AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description
Junyu Xie
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGen
74
10
0
22 Jul 2024
Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot
  Generalization of Vision-Language Models
Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models
Raza Imam
Hanan Gani
Muhammad Huzaifa
Karthik Nandakumar
VLM
76
4
0
22 Jul 2024
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot
  Anomaly Detection
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
Yunkang Cao
Jiangning Zhang
Luca Frittoli
Yuqi Cheng
Nong Sang
Giacomo Boracchi
VLM
113
42
0
22 Jul 2024
CLIP with Generative Latent Replay: a Strong Baseline for Incremental
  Learning
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning
Emanuele Frascaroli
Aniello Panariello
Pietro Buzzega
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
VLMCLL
78
6
0
22 Jul 2024
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Jiahang Tu
Wei Ji
Han Zhao
Chao Zhang
Roger Zimmermann
Hui Qian
72
8
0
22 Jul 2024
Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Jingchen Sun
Rohan Sharma
Vishnu Suresh Lokhande
Changyou Chen
96
0
0
22 Jul 2024
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Ruoyu Feng
Tao Yu
Xin Jin
Xiaoyuan Yu
Lei Xiao
Zhibo Chen
VLM
104
2
0
21 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language
  Models
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
94
9
0
21 Jul 2024
Learn to Preserve and Diversify: Parameter-Efficient Group with
  Orthogonal Regularization for Domain Generalization
Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization
Jiajun Hu
Jian Zhang
Lei Qi
Yinghuan Shi
Yang Gao
OOD
69
6
0
21 Jul 2024
Straightforward Layer-wise Pruning for More Efficient Visual Adaptation
Straightforward Layer-wise Pruning for More Efficient Visual Adaptation
Ruizi Han
Jinglei Tang
101
1
0
19 Jul 2024
Dyn-Adapter: Towards Disentangled Representation for Efficient Visual
  Recognition
Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition
Yurong Zhang
Honghao Chen
Xinyu Zhang
Xiangxiang Chu
Li Song
100
1
0
19 Jul 2024
CoAPT: Context Attribute words for Prompt Tuning
CoAPT: Context Attribute words for Prompt Tuning
Gun Lee
Subin An
Sungyong Baik
Soochahn Lee
VPVLMVLM
62
1
0
18 Jul 2024
Adapt PointFormer: 3D Point Cloud Analysis via Adapting 2D Visual
  Transformers
Adapt PointFormer: 3D Point Cloud Analysis via Adapting 2D Visual Transformers
Mengke Li
Da Li
Guoqing Yang
Yiu-ming Cheung
Hui Huang
3DPC
106
2
0
18 Jul 2024
UCIP: A Universal Framework for Compressed Image Super-Resolution using
  Dynamic Prompt
UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Xin Li
Bingchen Li
Yeying Jin
Cuiling Lan
Hanxin Zhu
Yulin Ren
Zhibo Chen
98
8
0
18 Jul 2024
Missing Modality Prediction for Unpaired Multimodal Learning via Joint
  Embedding of Unimodal Models
Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models
Donggeun Kim
Taesup Kim
76
4
0
17 Jul 2024
VCP-CLIP: A visual context prompting model for zero-shot anomaly
  segmentation
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation
Zhen Qu
Xian Tao
Mukesh Prasad
Fei Shen
Zhengtao Zhang
Xinyi Gong
Guiguang Ding
VLM
100
16
0
17 Jul 2024
Encapsulating Knowledge in One Prompt
Encapsulating Knowledge in One Prompt
Qi Li
Runpeng Yu
Xinchao Wang
VLMKELM
76
3
0
16 Jul 2024
Rate-Distortion-Cognition Controllable Versatile Neural Image
  Compression
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression
Jinming Liu
Ruoyu Feng
Yunpeng Qi
Qiuyu Chen
Zhibo Chen
Wenjun Zeng
Xin Jin
84
2
0
16 Jul 2024
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models
Hongrong Cheng
Miao Zhang
J. Q. Shi
105
3
0
16 Jul 2024
Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of
  Vision Transformers for Medical Image Classification
Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification
Naif Alkhunaizi
Faris Almalik
Rouqaiah Al-Refai
Muzammal Naseer
Karthik Nandakumar
MedIm
104
2
0
16 Jul 2024
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language
  Pre-trained Models
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou
Yongjian Wu
Jiya Saiyin
Bingzheng Wei
Maode Lai
Eric Chang
Yan Xu
VLM
87
1
0
16 Jul 2024
DataDream: Few-shot Guided Dataset Generation
DataDream: Few-shot Guided Dataset Generation
Jae Myung Kim
Jessica Bader
Stephan Alaniz
Cordelia Schmid
Zeynep Akata
89
7
0
15 Jul 2024
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed
  Image Restoration
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren
Xin Li
Bingchen Li
Xingrui Wang
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
DiffM
126
7
0
15 Jul 2024
Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot
  Whole Slide Image Classification
Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification
Linhao Qu
Dingkang Yang
Dan Huang
Qinhao Guo
Rongkui Luo
Shaoting Zhang
Xiaosong Wang
VLM
120
8
0
15 Jul 2024
Quantized Prompt for Efficient Generalization of Vision-Language Models
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao
Xiaohan Ding
Juexiao Feng
Yuhong Yang
Hui Chen
Guiguang Ding
VLMMQ
94
5
0
15 Jul 2024
Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free
  Continual Learning
Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning
Xinyuan Gao
Songlin Dong
Yuhang He
Qiang Wang
Yihong Gong
CLL
116
19
0
14 Jul 2024
Image Compression for Machine and Human Vision with Spatial-Frequency
  Adaptation
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
Han Li
Shaohui Li
Shuangrui Ding
Wenrui Dai
Maida Cao
Chenglin Li
Junni Zou
Hongkai Xiong
VLM
106
8
0
13 Jul 2024
Enhancing Robustness of Vision-Language Models through Orthogonality
  Learning and Cross-Regularization
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Jinlong Li
Zequn Jie
Elisa Ricci
Lin Ma
N. Sebe
VLM
101
0
0
11 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for
  Resource-Limited Transfer Learning
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
93
4
0
10 Jul 2024
Parameter Efficient Fine Tuning for Multi-scanner PET to PET
  Reconstruction
Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction
Yumin Kim
Gayoon Choi
Seong Jae Hwang
68
0
0
10 Jul 2024
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer:
  A Disentangled Approach
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach
Taolin Zhang
Jiawang Bai
Zhihe Lu
Dongze Lian
Genping Wang
Xinchao Wang
Shu-Tao Xia
95
5
0
09 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
96
1
0
09 Jul 2024
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for
  Text-to-Video Generation Task
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task
Yiran Yang
Jinchao Zhang
Ying Deng
Jie Zhou
DiffM
55
0
0
09 Jul 2024
Reprogramming Distillation for Medical Foundation Models
Reprogramming Distillation for Medical Foundation Models
Yuhang Zhou
Siyuan Du
Haolin Li
Jiangchao Yao
Ya Zhang
Yanfeng Wang
74
2
0
09 Jul 2024
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot
  Classification
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification
Jiaying Shi
Xuetong Xue
Shenghui Xu
VLM
138
0
0
08 Jul 2024
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot
  Performance
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Jiedong Zhuang
Jiaqi Hu
Lianrui Mu
Rui Hu
Xiaoyu Liang
Jiangnan Ye
Haoji Hu
CLIPVLM
95
4
0
08 Jul 2024
Mind the Interference: Retaining Pre-trained Knowledge in Parameter
  Efficient Continual Learning of Vision-Language Models
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
Longxiang Tang
Zhuotao Tian
Kai Li
Chunming He
Hantao Zhou
Hengshuang Zhao
Xiu Li
Jiaya Jia
CLLVLM
103
24
0
07 Jul 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Cheng Han
Qifan Wang
S. Dianat
Majid Rabbani
Raghuveer M. Rao
Yi Fang
Qiang Guan
Lifu Huang
Dongfang Liu
VLM
80
5
0
05 Jul 2024
Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners
Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners
Mushui Liu
Bozheng Li
Yunlong Yu
VLMCLIP
43
3
0
04 Jul 2024
Do Generalised Classifiers really work on Human Drawn Sketches?
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
Tao Xiang
A. Bhunia
Yi-Zhe Song
VLM
81
2
0
04 Jul 2024
PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision
  Transformer
PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer
Qian Feng
Hanbin Zhao
Chao Zhang
Jiahua Dong
Henghui Ding
Yu-Gang Jiang
Hui Qian
VLM
71
5
0
04 Jul 2024
ASteISR: Adapting Single Image Super-resolution Pre-trained Model for
  Efficient Stereo Image Super-resolution
ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution
Yuanbo Zhou
Yuyang Xue
Wei Deng
Xinlin Zhang
Qinquan Gao
Tong Tong
91
0
0
04 Jul 2024
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
Zhiyang Xu
Minqian Liu
Ying Shen
Joy Rimchala
Jiaxin Zhang
Qifan Wang
Yu Cheng
Lifu Huang
VLM
90
6
0
04 Jul 2024
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Changdae Oh
Gyeongdeok Seo
Geunyoung Jung
Zhi-Qi Cheng
Hosik Choi
Jiyoung Jung
Kyungwoo Song
VLM
125
1
0
04 Jul 2024
Previous
123...789...202122
Next