Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.01134
Cited By
Learning to Prompt for Vision-Language Models
2 September 2021
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Prompt for Vision-Language Models"
50 / 391 papers shown
Title
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts
Peng Wu
Xuerong Zhou
Guansong Pang
Zhiwei Yang
Qingsen Yan
Peng Wang
Yanning Zhang
28
9
0
12 Aug 2024
Robust Domain Generalization for Multi-modal Object Recognition
Yuxin Qiao
Keqin Li
Junhong Lin
Rong Wei
Chufeng Jiang
Yang Luo
Haoyu Yang
VLM
39
26
0
11 Aug 2024
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning
Lu Yu
Hesong Li
Ying Fu
J. Weijer
Changsheng Xu
CLL
55
1
0
02 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
46
3
0
31 Jul 2024
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang
Jongoh Jeong
Kuk-Jin Yoon
AAML
VLM
60
4
0
30 Jul 2024
Advancing Prompt Learning through an External Layer
Fangming Cui
Xun Yang
Chao Wu
Liang Xiao
Xinmei Tian
VLM
38
1
0
29 Jul 2024
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Tz-Ying Wu
Kyle Min
Subarna Tripathi
Nuno Vasconcelos
EgoV
55
0
0
28 Jul 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
57
3
0
26 Jul 2024
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?
Zhengbo Wang
Jian Liang
Ran He
Zilei Wang
Tieniu Tan
55
15
0
25 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
Yunlong Yu
Jiale Cao
Yanwei Pang
Jungong Han
X. Li
CLL
39
3
0
24 Jul 2024
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
Yunkang Cao
Jiangning Zhang
Luca Frittoli
Yuqi Cheng
Nong Sang
Giacomo Boracchi
VLM
48
29
0
22 Jul 2024
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning
Emanuele Frascaroli
Aniello Panariello
Pietro Buzzega
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
VLM
CLL
35
3
0
22 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
36
15
0
11 Jul 2024
The Solution for Language-Enhanced Image New Category Discovery
Haonan Xu
Dian Chao
Xiangyu Wu
Zhonghua Wan
Yang Yang
VLM
35
0
0
06 Jul 2024
CLIPVQA:Video Quality Assessment via CLIP
Fengchuang Xing
Mingjie Li
Yuan-Gen Wang
Guopu Zhu
Xiaochun Cao
CLIP
ViT
40
4
0
06 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
39
7
0
05 Jul 2024
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Mainak Singha
Ankit Jha
Divyam Gupta
Pranav Singla
Biplab Banerjee
VLM
32
0
0
05 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
54
3
0
03 Jul 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon
Elias Ramzi
Clément Rambour
Nicolas Audebert
Nicolas Thome
VLM
41
8
0
01 Jul 2024
CPT: Consistent Proxy Tuning for Black-box Optimization
Yuanyang He
Zitong Huang
Xinxing Xu
Rick Siow Mong Goh
Salman Khan
W. Zuo
Yong Liu
Chun-Mei Feng
42
0
0
01 Jul 2024
Embedded Visual Prompt Tuning
Wenqiang Zu
Shenghao Xie
Qing Zhao
Guoqi Li
Lei Ma
VLM
MedIm
49
9
0
01 Jul 2024
Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Yushun Tang
Shuoshuo Chen
Zhehan Kan
Yi Zhang
Qinghai Guo
Zhihai He
51
2
0
27 Jun 2024
Personalized Federated Continual Learning via Multi-granularity Prompt
Hao Yu
Xin Yang
Xin Gao
Yan Kang
Hao Wang
Junbo Zhang
Tianrui Li
CLL
51
6
0
27 Jun 2024
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Akshita Gupta
Aditya Arora
Sanath Narayan
Salman Khan
F. Khan
Graham W. Taylor
38
3
0
21 Jun 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim
Zhuoxiao Chen
Mahsa Baktashmotlagh
Zhi Chen
Xin Yu
Zi Huang
Yadan Luo
VLM
ObjD
82
1
0
21 Jun 2024
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
Shuo Xu
Sai Wang
Xinyue Hu
Yutian Lin
Bo Du
Yu Wu
CoGe
56
0
0
18 Jun 2024
Enhancing Domain Adaptation through Prompt Gradient Alignment
Hoang Phan
Lam C. Tran
Quyen Tran
Trung Le
52
0
0
13 Jun 2024
Language-guided Detection and Mitigation of Unknown Dataset Bias
Zaiying Zhao
Soichiro Kumano
Toshihiko Yamasaki
38
2
0
05 Jun 2024
Proxy Denoising for Source-Free Domain Adaptation
Song Tang
Wenxin Su
Mao Ye
Jianwei Zhang
Xiatian Zhu
Xiatian Zhu
67
1
0
03 Jun 2024
Auto-selected Knowledge Adapters for Lifelong Person Re-identification
Xuelin Qian
Ruiqi Wu
Gong Cheng
Junwei Han
CLL
52
2
0
29 May 2024
LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration
Mingrui Ma
Yu Yang
LM&MA
34
2
0
29 May 2024
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling
Cristian Rodriguez-Opazo
Ehsan Abbasnejad
Damien Teney
Edison Marrese-Taylor
Hamed Damirchi
Anton Van Den Hengel
VLM
40
1
0
27 May 2024
Disease-informed Adaptation of Vision-Language Models
Jiajin Zhang
Ge Wang
M. Kalra
P. Yan
VLM
46
2
0
24 May 2024
Learning Invariant Causal Mechanism from Vision-Language Models
Changwen Zheng
Siyu Zhao
Xingyu Zhang
Jiangmeng Li
Changwen Zheng
Jingyao Wang
CML
BDL
VLM
42
0
0
24 May 2024
Like Humans to Few-Shot Learning through Knowledge Permeation of Vision and Text
Yuyu Jia
Qing Zhou
Wei Huang
Junyu Gao
Qi. Wang
VLM
37
1
0
21 May 2024
Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research
Qinglong Cao
Yuntian Chen
Lu Lu
Hao Sun
Zhenzhong Zeng
Xiaokang Yang
Dong-juan Zhang
VLM
31
1
0
14 May 2024
SpeechVerse: A Large-scale Generalizable Audio Language Model
Nilaksh Das
Saket Dingliwal
S. Ronanki
Rohit Paturi
David Huang
...
Monica Sunkara
S. Srinivasan
Kyu J. Han
Katrin Kirchhoff
Katrin Kirchhoff
41
37
0
14 May 2024
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering
Yuanyuan Jiang
Jianqin Yin
45
1
0
13 May 2024
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Yaoqin Ye
Junjie Zhang
Hongwei Shi
MedIm
VLM
49
0
0
10 May 2024
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Maxime Zanella
Ismail Ben Ayed
VLM
MLLM
50
23
0
03 May 2024
Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Yifei Ming
Yixuan Li
VLM
39
7
0
02 May 2024
Open-Set Video-based Facial Expression Recognition with Human Expression-sensitive Prompting
Yuanyuan Liu
Yuxuan Huang
Shuyang Liu
Yibing Zhan
Zijing Chen
Zhe Chen
VLM
50
1
0
26 Apr 2024
Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class
Mazda Moayeri
Michael G. Rabbat
Mark Ibrahim
Diane Bouchacourt
VLM
50
1
0
25 Apr 2024
Improving Multi-label Recognition using Class Co-Occurrence Probabilities
Samyak Rawlekar
Shubhang Bhatnagar
Vishnuvardhan Pogunulu Srinivasulu
Narendra Ahuja
VLM
37
5
0
24 Apr 2024
Boosting Architectural Generation via Prompts: Report
Xin Zhang
Wenwen Liu
AI4CE
29
1
0
24 Apr 2024
SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval
Xuzheng Yu
Chen Jiang
Xingning Dong
Tian Gan
Ming Yang
Qingpei Guo
45
1
0
22 Apr 2024
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization
Zhaopeng Gu
Bingke Zhu
Guibo Zhu
Yingying Chen
Hao Li
Ming Tang
Jinqiao Wang
42
15
0
21 Apr 2024
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
Tiancheng Gu
Kaicheng Yang
Dongnan Liu
Weidong Cai
MedIm
41
2
0
19 Apr 2024
ECOR: Explainable CLIP for Object Recognition
Ali Rasekh
Sepehr Kazemi Ranjbar
Milad Heidari
Wolfgang Nejdl
VLM
46
4
0
19 Apr 2024
ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Quan Van Nguyen
Dan Quang Tran
Huy Quang Pham
Thang Kien-Bao Nguyen
Nghia Hieu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
CoGe
39
3
0
16 Apr 2024
Previous
1
2
3
4
5
6
7
8
Next