Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.03117
Cited By
MaPLe: Multi-modal Prompt Learning
6 October 2022
Muhammad Uzair Khattak
H. Rasheed
Muhammad Maaz
Salman Khan
F. Khan
VPVLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaPLe: Multi-modal Prompt Learning"
50 / 384 papers shown
Title
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning
Emanuele Frascaroli
Aniello Panariello
Pietro Buzzega
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
VLM
CLL
35
3
0
22 Jul 2024
Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Jingchen Sun
Rohan Sharma
Vishnu Suresh Lokhande
Changyou Chen
41
0
0
22 Jul 2024
A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model
Yingxue Xu
Yihui Wang
Fengtao Zhou
Jiabo Ma
Shu Yang
...
Anjia Han
Ronald Cheong Kin Chan
Li Liang
Xiuming Zhang
Hao Chen
37
15
0
22 Jul 2024
Robust Calibration of Large Vision-Language Adapters
Balamurali Murugesan
Julio Silva-Rodríguez
Ismail Ben Ayed
Jose Dolz
OODD
VLM
32
6
0
18 Jul 2024
Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models
Donggeun Kim
Taesup Kim
31
4
0
17 Jul 2024
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou
Yongjian Wu
Jiya Saiyin
Bingzheng Wei
Maode Lai
Eric Chang
Yan Xu
VLM
46
0
0
16 Jul 2024
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao
Xiaohan Ding
Juexiao Feng
Yuhong Yang
Hui Chen
Guiguang Ding
VLM
MQ
32
5
0
15 Jul 2024
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
Han Li
Shaohui Li
Shuangrui Ding
Wenrui Dai
Maida Cao
Chenglin Li
Junni Zou
Hongkai Xiong
VLM
43
5
0
13 Jul 2024
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Jinlong Li
Zequn Jie
Elisa Ricci
Lin Ma
N. Sebe
VLM
39
1
0
11 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
39
15
0
11 Jul 2024
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Shixiong Xu
Chenghao Zhang
Lubin Fan
Gaofeng Meng
Shiming Xiang
Jieping Ye
VLM
46
4
0
11 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
50
4
0
10 Jul 2024
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Jiedong Zhuang
Jiaqi Hu
Lianrui Mu
Rui Hu
Xiaoyu Liang
Jiangnan Ye
Haoji Hu
CLIP
VLM
34
2
0
08 Jul 2024
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
Longxiang Tang
Zhuotao Tian
Kai Li
Chunming He
Hantao Zhou
Hengshuang Zhao
Xiu Li
Jiaya Jia
CLL
VLM
36
20
0
07 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
41
7
0
05 Jul 2024
Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model
D. M. Nguyen
An T. Le
Trung Q. Nguyen
N. T. Diep
Tai Nguyen
D. Duong-Tran
Jan Peters
Li Shen
Mathias Niepert
Daniel Sonntag
VLM
43
3
0
05 Jul 2024
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Mainak Singha
Ankit Jha
Divyam Gupta
Pranav Singla
Biplab Banerjee
VLM
32
0
0
05 Jul 2024
Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners
Mushui Liu
Bozheng Li
Yunlong Yu
VLM
CLIP
31
3
0
04 Jul 2024
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
Tao Xiang
A. Bhunia
Yi-Zhe Song
VLM
33
2
0
04 Jul 2024
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Changdae Oh
Gyeongdeok Seo
Geunyoung Jung
Zhi-Qi Cheng
Hosik Choi
Jiyoung Jung
Kyungwoo Song
VLM
38
1
0
04 Jul 2024
Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
Marco Mistretta
Alberto Baldrati
Marco Bertini
Andrew D. Bagdanov
VPVLM
VLM
35
6
0
03 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
57
3
0
03 Jul 2024
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang
Ke Yu
Siqi Wu
Zhihai He
VLM
50
2
0
02 Jul 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon
Elias Ramzi
Clément Rambour
Nicolas Audebert
Nicolas Thome
VLM
43
8
0
01 Jul 2024
Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Yicheng Xu
Yuxin Chen
Jiahao Nie
Yusong Wang
Huiping Zhuang
Manabu Okumura
VLM
CLL
43
6
0
27 Jun 2024
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment
Jun Fu
Wei Zhou
Qiuping Jiang
Hantao Liu
Guangtao Zhai
VLM
CLIP
42
8
0
24 Jun 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim
Zhuoxiao Chen
Mahsa Baktashmotlagh
Zhi Chen
Xin Yu
Zi Huang
Yadan Luo
VLM
ObjD
82
1
0
21 Jun 2024
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
Shuo Xu
Sai Wang
Xinyue Hu
Yutian Lin
Bo Du
Yu Wu
CoGe
59
0
0
18 Jun 2024
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Tian Liu
Huixin Zhang
Shubham Parashar
Shu Kong
29
2
0
17 Jun 2024
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
37
2
0
12 Jun 2024
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs
Irene Huang
Wei Lin
M. Jehanzeb Mirza
Jacob A. Hansen
Sivan Doveh
...
Trevor Darrel
Chuang Gan
Aude Oliva
Rogerio Feris
Leonid Karlinsky
CoGe
LRM
43
7
0
12 Jun 2024
Robust Latent Representation Tuning for Image-text Classification
Hao Sun
Yu Song
VLM
57
0
0
10 Jun 2024
OVMR: Open-Vocabulary Recognition with Multi-Modal References
Zehong Ma
Shiliang Zhang
Longhui Wei
Qi Tian
VLM
41
0
0
07 Jun 2024
Learning Visual Prompts for Guiding the Attention of Vision Transformers
Razieh Rezaei
Masoud Jalili Sabet
Jindong Gu
Daniel Rueckert
Philip H. S. Torr
Ashkan Khakzar
29
5
0
05 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
38
3
0
04 Jun 2024
ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization
Chen Mao
Jingqi Hu
34
4
0
04 Jun 2024
Boosting Vision-Language Models with Transduction
Maxime Zanella
Benoît Gérin
Ismail Ben Ayed
VLM
42
5
0
03 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjD
VLM
48
13
0
01 Jun 2024
Effectiveness of Vision Language Models for Open-world Single Image Test Time Adaptation
Manogna Sreenivas
Soma Biswas
VLM
43
0
0
01 Jun 2024
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution
Yurui Chang
Bochuan Cao
Yujia Wang
Jinghui Chen
Lu Lin
LRM
27
0
0
30 May 2024
Low-Rank Few-Shot Adaptation of Vision-Language Models
Maxime Zanella
Ismail Ben Ayed
OffRL
VLM
56
26
0
28 May 2024
Frustratingly Easy Test-Time Adaptation of Vision-Language Models
Matteo Farina
Gianni Franchi
Giovanni Iacca
Massimiliano Mancini
Elisa Ricci
VLM
45
5
0
28 May 2024
ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection
Ziying Song
Feiyang Jia
Hongyu Pan
Yadan Luo
Caiyan Jia
Guoxin Zhang
Lin Liu
Yang Ji
Lei Yang
Li-e Wang
39
9
0
27 May 2024
Enhancing Near OOD Detection in Prompt Learning: Maximum Gains, Minimal Costs
M. Jung
He Zhao
Joanna Dipnall
Belinda Gabbe
Lan Du
VLM
OODD
34
1
0
25 May 2024
Disease-informed Adaptation of Vision-Language Models
Jiajin Zhang
Ge Wang
M. Kalra
P. Yan
VLM
46
2
0
24 May 2024
Learning from True-False Labels via Multi-modal Prompt Retrieving
Zhongnian Li
Jinghao Xu
Peng Ying
Meng Wei
Tongfeng Sun
Xinzheng Xu
35
0
0
24 May 2024
Are You Copying My Prompt? Protecting the Copyright of Vision Prompt for VPaaS via Watermark
Huali Ren
Anli Yan
Chong-zhi Gao
Hongyang Yan
Zhenxin Zhang
Jin Li
VLM
AAML
32
4
0
24 May 2024
CLIP model is an Efficient Online Lifelong Learner
Leyuan Wang
Liuyu Xiang
Yujie Wei
Yunlong Wang
Zhaofeng He
VLM
CLL
32
3
0
24 May 2024
Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays
Zhichao Sun
Yuliang Gu
Yepeng Liu
Zerui Zhang
Zhou Zhao
Yongchao Xu
MedIm
89
2
0
20 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
61
1
0
16 May 2024
Previous
1
2
3
4
5
6
7
8
Next