Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.12119
Cited By
v1
v2 (latest)
Visual Prompt Tuning
23 March 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visual Prompt Tuning"
50 / 1,088 papers shown
Title
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
Shizhou Zhang
Dexuan Kong
Yinghui Xing
Yue Lu
Lingyan Ran
Guoqiang Liang
Hexu Wang
Yanning Zhang
96
9
0
19 Sep 2024
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
458
1
0
19 Sep 2024
LPT++: Efficient Training on Mixture of Long-tailed Experts
Bowen Dong
Pan Zhou
W. Zuo
VLM
75
0
0
17 Sep 2024
Down-Sampling Inter-Layer Adapter for Parameter and Computation Efficient Ultra-Fine-Grained Image Recognition
Edwin Arkel Rios
Femiloye Oyerinde
Min-Chun Hu
Bo-Cheng Lai
64
0
0
17 Sep 2024
Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation
Hanbo Bi
Yingchao Feng
Wenhui Diao
Peijin Wang
Yongqiang Mao
Kun Fu
Hongqi Wang
Xian Sun
VLM
80
5
0
16 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen
3DV
34
1
0
14 Sep 2024
SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Chenyang Lei
Liyi Chen
Jun Cen
Xiao Chen
Zhen Lei
Felix Heide
Ziwei Liu
Qifeng Chen
Zhaoxiang Zhang
92
0
0
12 Sep 2024
Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region
Muhammad Akhtar Munir
Fahad Shahbaz Khan
Salman Khan
36
1
0
11 Sep 2024
Self-Masking Networks for Unsupervised Adaptation
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
81
2
0
11 Sep 2024
Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images
Xuexue Li
VLM
ISeg
85
0
0
11 Sep 2024
ExIQA: Explainable Image Quality Assessment Using Distortion Attributes
Sepehr Kazemi Ranjbar
Emad Fatemizadeh
82
1
0
10 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
119
2
0
10 Sep 2024
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
Changlin Li
Jiawei Zhang
Sihao Lin
Zongxin Yang
Junwei Liang
Xiaodan Liang
Xiaojun Chang
VLM
65
0
0
06 Sep 2024
Text-Guided Mixup Towards Long-Tailed Image Categorization
Richard Franklin
Jiawei Yao
Deyang Zhong
Qi Qian
Juhua Hu
VLM
70
0
0
05 Sep 2024
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation
Shehan Perera
Yunus Erzurumlu
Deepak Gulati
Alper Yilmaz
ViT
MedIm
61
0
0
04 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
124
2
0
04 Sep 2024
When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective
Hsi-Ai Tsao
Lei Hsiung
Pin-Yu Chen
Tsung-Yi Ho
VPVLM
LRM
VLM
67
0
0
03 Sep 2024
Towards General Industrial Intelligence: A Survey on IIoT-Enhanced Continual Large Models
Jiao Chen
Jiayi He
Fangfang Chen
Zuohong Lv
Jianhua Tang
Weihua Li
Zuozhu Liu
Howard H. Yang
Guangjie Han
AI4CE
79
1
0
02 Sep 2024
A Novel Hybrid Parameter-Efficient Fine-Tuning Approach for Hippocampus Segmentation and Alzheimer's Disease Diagnosis
Wangang Cheng
Guanghua He
Keli Hu
Mingyu Fang
Liang Dong
Zhong Li
Hancan Zhu
81
0
0
02 Sep 2024
Affordance-based Robot Manipulation with Flow Matching
Fan Zhang
Michael Gienger
167
14
0
02 Sep 2024
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
Leqi Shen
Tianxiang Hao
Tao He
Sicheng Zhao
Pengzhang Liu
Yongjun Bao
Guiguang Ding
Guiguang Ding
264
15
0
02 Sep 2024
DLM-VMTL:A Double Layer Mapper for heterogeneous data video Multi-task prompt learning
Zeyi Bo
Wuxi Sun
Ye Jin
VLM
104
0
0
29 Aug 2024
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration
Xu Zhang
Jiaqi Ma
Guoli Wang
Qian Zhang
Huan Zhang
Lefei Zhang
VLM
183
10
0
28 Aug 2024
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
Lingyun Huang
Jianxu Mao
Yaonan Wang
Junfei Yi
Ziming Tao
VLM
VPVLM
88
2
0
27 Aug 2024
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Yubin Wang
Xinyang Jiang
De Cheng
Wenli Sun
Dongsheng Li
Cairong Zhao
VLM
100
0
0
27 Aug 2024
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images
Ziteng Cui
Tatsuya Harada
84
8
0
27 Aug 2024
Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation
Yushun Tang
Shuoshuo Chen
Zhihe Lu
Xinchao Wang
Zhihai He
108
1
0
26 Aug 2024
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models
Shuai Fu
Xiequn Wang
Qiushi Huang
Yu Zhang
VLM
62
2
0
26 Aug 2024
AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal Samples
Yujin Lee
Seoyoon Jang
Hyunsoo Yoon
58
0
0
24 Aug 2024
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Kai-Wei Chang
Haibin Wu
Yu-Kai Wang
Yuan-Kuei Wu
Hua Shen
Wei-Cheng Tseng
Iu-thing Kang
Shang-Wen Li
Hung-yi Lee
93
3
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
91
1
0
23 Aug 2024
Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture
Chenqi Kong
Anwei Luo
Peijun Bao
Haoliang Li
Renjie Wan
Zengwei Zheng
Anderson de Rezende Rocha
Alex C. Kot
AAML
56
4
0
23 Aug 2024
Cross-Domain Foundation Model Adaptation: Pioneering Computer Vision Models for Geophysical Data Analysis
Zhixiang Guo
Xinming Wu
Luming Liang
Hanlin Sheng
Nuo Chen
Zhengfa Bi
AI4CE
96
4
0
22 Aug 2024
Segment Anything with Multiple Modalities
Aoran Xiao
Weihao Xuan
Heli Qi
Yun Xing
Naoto Yokoya
Shijian Lu
VLM
93
7
0
17 Aug 2024
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training
Gengwei Zhang
Liyuan Wang
Guoliang Kang
Ling Chen
Yunchao Wei
VLM
CLL
68
7
0
15 Aug 2024
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach
Shizhou Zhang
Wenlong Luo
De Cheng
Qingchun Yang
Lingyan Ran
Yinghui Xing
Yanning Zhang
VOS
89
6
0
14 Aug 2024
Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
Shibo Jie
Yehui Tang
Jianyuan Guo
Zhi-Hong Deng
Kai Han
Yunhe Wang
VLM
60
4
0
13 Aug 2024
Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Sungyeon Kim
Boseung Jeong
Donghyun Kim
Suha Kwak
VLM
88
3
0
11 Aug 2024
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
Zhi Chen
Zecheng Zhao
Yadan Luo
Zi Huang
DiffM
65
4
0
06 Aug 2024
Fairness and Bias Mitigation in Computer Vision: A Survey
Sepehr Dehdashtian
Ruozhen He
Yi Li
Guha Balakrishnan
Nuno Vasconcelos
Vicente Ordonez
Vishnu Boddeti
137
5
0
05 Aug 2024
TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks
Yang Yu
Cheng-Zhong Xu
Kai Wang
VLM
80
3
0
03 Aug 2024
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Weijie Zheng
Xingjun Ma
Hanxun Huang
Zuxuan Wu
Yu-Gang Jiang
AAML
102
0
0
03 Aug 2024
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
Donwon Park
Leixian Shen
Se Young Chun
89
2
0
02 Aug 2024
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning
Lu Yu
Hesong Li
Ying Fu
Joost van de Weijer
Bartłomiej Twardowski
Joost van de Weijer
Changsheng Xu
CLL
95
1
0
02 Aug 2024
A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation
Mothilal Asokan
Difei Gao
Joya Chen
Mike Zheng Shou
FedML
MedIm
90
1
0
31 Jul 2024
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
Ming-Kuan Wu
Xinyue Cai
Jiayi Ji
Jiale Li
Oucheng Huang
Gen Luo
Hao Fei
Xiaoshuai Sun
Rongrong Ji
MLLM
158
13
0
31 Jul 2024
Knowledge-Guided Prompt Learning for Lifespan Brain MR Image Segmentation
Shiyuan Chen
Zihao Zhao
Jiawei Huang
Yingfeng Cai
Runqi Meng
Wei Sui
Dinggang Shen
VLM
52
3
0
31 Jul 2024
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey
Atsuyuki Miyai
Jingkang Yang
Jingyang Zhang
Yifei Ming
Sisir Dhakal
...
Yixuan Li
Hai "Helen" Li
Ziwei Liu
Toshihiko Yamasaki
Kiyoharu Aizawa
128
13
0
31 Jul 2024
Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion Forecasting Models
Jifeng Wang
Kaouther Messaoud
Yuejiang Liu
Juergen Gall
Alexandre Alahi
69
1
0
28 Jul 2024
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Tz-Ying Wu
Kyle Min
Subarna Tripathi
Nuno Vasconcelos
EgoV
144
0
0
28 Jul 2024
Previous
1
2
3
...
6
7
8
...
20
21
22
Next