ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.03117
  4. Cited By
MaPLe: Multi-modal Prompt Learning

MaPLe: Multi-modal Prompt Learning

6 October 2022
Muhammad Uzair Khattak
H. Rasheed
Muhammad Maaz
Salman Khan
F. Khan
    VPVLM
    VLM
ArXivPDFHTML

Papers citing "MaPLe: Multi-modal Prompt Learning"

50 / 384 papers shown
Title
Promoting AI Equity in Science: Generalized Domain Prompt Learning for
  Accessible VLM Research
Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research
Qinglong Cao
Yuntian Chen
Lu Lu
Hao Sun
Zhenzhong Zeng
Xiaokang Yang
Dong-juan Zhang
VLM
31
1
0
14 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention
  Alignment
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
40
0
0
14 May 2024
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Hari Chandana Kuchibhotla
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
Vineeth N. Balasubramanian
VLM
42
0
0
13 May 2024
Exploring Text-Guided Single Image Editing for Remote Sensing Images
Exploring Text-Guided Single Image Editing for Remote Sensing Images
Fangzhou Han
Hui Xiong
Hongwei Dong
Lamei Zhang
Hao Chen
Bo Du
DiffM
39
1
0
09 May 2024
Multi-method Integration with Confidence-based Weighting for Zero-shot
  Image Classification
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Siqi Yin
Lifan Jiang
24
0
0
03 May 2024
Revisiting the Adversarial Robustness of Vision Language Models: a
  Multimodal Perspective
Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective
Wanqi Zhou
Shuanghao Bai
Qibin Zhao
Badong Chen
VLM
AAML
44
5
0
30 Apr 2024
Soft Prompt Generation for Domain Generalization
Soft Prompt Generation for Domain Generalization
Shuanghao Bai
Yuedi Zhang
Wanqi Zhou
Zhirong Luan
Badong Chen
VLM
44
3
0
30 Apr 2024
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
Gahyeon Kim
Sohee Kim
Seokju Lee
VLM
33
5
0
25 Apr 2024
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks
  with Adapters
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters
Hao-Wei Chen
Yu-Syuan Xu
Kelvin C. K. Chan
Hsien-Kai Kuo
Chun-Yi Lee
Ming-Hsuan Yang
29
1
0
17 Apr 2024
Exploring the Transferability of Visual Prompting for Multimodal Large
  Language Models
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Yichi Zhang
Yinpeng Dong
Siyuan Zhang
Tianzan Min
Hang Su
Jun Zhu
LRM
VLM
52
5
0
17 Apr 2024
Conditional Prototype Rectification Prompt Learning
Conditional Prototype Rectification Prompt Learning
Haoxing Chen
Yaohui Li
Zizheng Huang
Yan Hong
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
50
3
0
15 Apr 2024
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for
  Few-shot Learning
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
Yaohui Li
Qifeng Zhou
Haoxing Chen
Jianbing Zhang
Xinyu Dai
Hao Zhou
VLM
50
0
0
15 Apr 2024
Leveraging Temporal Contextualization for Video Action Recognition
Leveraging Temporal Contextualization for Video Action Recognition
Minji Kim
Dongyoon Han
Taekyung Kim
Bohyung Han
51
2
0
15 Apr 2024
MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial
  Expression Recognition in-the-wild
MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild
K. Chumachenko
Alexandros Iosifidis
Moncef Gabbouj
21
6
0
13 Apr 2024
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
Yuwei Tang
Zhenyi Lin
Qilong Wang
Pengfei Zhu
Qinghua Hu
30
11
0
13 Apr 2024
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical
  Image Classification
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification
Zhenwei Wang
Qiule Sun
Bingbing Zhang
Pengfei Wang
Jianxin Zhang
Qiang Zhang
VLM
38
1
0
13 Apr 2024
PromptSync: Bridging Domain Gaps in Vision-Language Models through
  Class-Aware Prototype Alignment and Discrimination
PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination
Anant Khandelwal
VLM
23
1
0
11 Apr 2024
Anchor-based Robust Finetuning of Vision-Language Models
Anchor-based Robust Finetuning of Vision-Language Models
Jinwei Han
Zhiwen Lin
Zhongyi Sun
Yingguo Gao
Ke Yan
Shouhong Ding
Yuan Gao
Gui-Song Xia
VLM
71
6
0
09 Apr 2024
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly
  Detection
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
Xiaofan Li
Zhizhong Zhang
Xin Tan
Chengwei Chen
Yanyun Qu
Yuan Xie
Lizhuang Ma
VLM
58
36
0
08 Apr 2024
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu
Andy Chia-Hao Chang
Chieh-Yu Chuang
Chun-Pei Chen
Yu-Lun Liu
Min-Hung Chen
Hou-Ning Hu
Yung-Yu Chuang
Yen-Yu Lin
VLM
43
9
0
05 Apr 2024
Learning Transferable Negative Prompts for Out-of-Distribution Detection
Learning Transferable Negative Prompts for Out-of-Distribution Detection
Tianqi Li
Guansong Pang
Xiaolong Bai
Wenjun Miao
Jingyi Zheng
VLM
60
12
0
04 Apr 2024
Prompt Learning via Meta-Regularization
Prompt Learning via Meta-Regularization
Jinyoung Park
Juyeon Ko
Hyunwoo J. Kim
VLM
VPVLM
47
14
0
01 Apr 2024
Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open
  Domain Generalization
Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open Domain Generalization
Mainak Singha
Ankit Jha
Shirsha Bose
Ashwin Nair
Moloud Abdar
Biplab Banerjee
VLM
57
10
0
31 Mar 2024
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image
  Restoration
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
Shihao Zhou
Jinshan Pan
Jinglei Shi
Duosheng Chen
Lishen Qu
Jufeng Yang
VLM
23
3
0
30 Mar 2024
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action
  Generalization
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization
Anna Kukleva
Fadime Sener
Edoardo Remelli
Bugra Tekin
Eric Sauser
Bernt Schiele
Shugao Ma
VLM
EgoV
42
1
0
28 Mar 2024
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for
  Vision-Language Models
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models
Saurav Jha
Dong Gong
Lina Yao
CLIP
VLM
33
8
0
28 Mar 2024
Dual Memory Networks: A Versatile Adaptation Approach for
  Vision-Language Models
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Yabin Zhang
Wen-Qing Zhu
Hui Tang
Zhiyuan Ma
Kaiyang Zhou
Lei Zhang
VLM
31
21
0
26 Mar 2024
Few-Shot Adversarial Prompt Learning on Vision-Language Models
Few-Shot Adversarial Prompt Learning on Vision-Language Models
Yiwei Zhou
Xiaobo Xia
Zhiwei Lin
Bo Han
Tongliang Liu
VLM
39
10
0
21 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
150
310
0
21 Mar 2024
Towards Multimodal In-Context Learning for Vision & Language Models
Towards Multimodal In-Context Learning for Vision & Language Models
Sivan Doveh
Shaked Perek
M. Jehanzeb Mirza
Wei Lin
Amit Alfassy
Assaf Arbelle
S. Ullman
Leonid Karlinsky
VLM
114
14
0
19 Mar 2024
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Sivan Doveh
Jakub Micorek
Mateusz Koziñski
Hilde Kuhene
Horst Possegger
VLM
MLLM
41
13
0
18 Mar 2024
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object
  Detection under Unknown Degradations
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations
Yuwei Zhang
Yan Wu
Yanming Liu
Xinyue Peng
49
5
0
17 Mar 2024
Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data
  in Text-Image Encoders
Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data in Text-Image Encoders
Andrew Geng
Pin-Yu Chen
OODD
19
0
0
16 Mar 2024
An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language
  Pre-train Model
An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model
Yuxin Tian
Mouxing Yang
Yunfan Li
Dayiheng Liu
Xingzhang Ren
Xiaocui Peng
Jiancheng Lv
VLM
37
0
0
13 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
46
2
0
13 Mar 2024
It's All About Your Sketch: Democratising Sketch Control in Diffusion
  Models
It's All About Your Sketch: Democratising Sketch Control in Diffusion Models
Subhadeep Koley
A. Bhunia
Deeptanshu Sekhri
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
DiffM
42
16
0
12 Mar 2024
You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image
  Retrieval
You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval
Subhadeep Koley
A. Bhunia
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
3DV
51
11
0
12 Mar 2024
Split to Merge: Unifying Separated Modalities for Unsupervised Domain
  Adaptation
Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation
Xinyao Li
Yuke Li
Zhekai Du
Fengling Li
Ke Lu
Jingjing Li
VLM
54
4
0
11 Mar 2024
RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Yuncheng Yang
Chuyan Zhang
Zuopeng Yang
Yuting Gao
Yulei Qin
Ke Li
Xing Sun
Jie-jin Yang
Yun Gu
VLM
VPVLM
49
0
0
10 Mar 2024
In-context Prompt Learning for Test-time Vision Recognition with Frozen
  Vision-language Model
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
Junhui Yin
Xinyu Zhang
Lin Wu
Xianghua Xie
Xiaojie Wang
VPVLM
VLM
MLLM
30
2
0
10 Mar 2024
Multimodal Infusion Tuning for Large Models
Multimodal Infusion Tuning for Large Models
Hao Sun
Yu Song
Xinyao Yu
Jiaqing Liu
Yen-Wei Chen
Lanfen Lin
VLM
32
0
0
08 Mar 2024
ObjectCompose: Evaluating Resilience of Vision-Based Models on
  Object-to-Background Compositional Changes
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
H. Malik
Muhammad Huzaifa
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
DiffM
40
2
0
07 Mar 2024
Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Zhekai Du
Xinyao Li
Fengling Li
Ke Lu
Lei Zhu
Jingjing Li
40
15
0
05 Mar 2024
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
Zheng Li
Xiang Li
Xinyi Fu
Xing Zhang
Weiqiang Wang
Shuo Chen
Jian Yang
VLM
39
35
0
05 Mar 2024
Few-shot Learner Parameterization by Diffusion Time-steps
Few-shot Learner Parameterization by Diffusion Time-steps
Zhongqi Yue
Pan Zhou
Richang Hong
Hanwang Zhang
Qianru Sun
36
11
0
05 Mar 2024
Contrastive Region Guidance: Improving Grounding in Vision-Language
  Models without Training
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
David Wan
Jaemin Cho
Elias Stengel-Eskin
Mohit Bansal
VLM
ObjD
51
29
0
04 Mar 2024
Multi-modal Attribute Prompting for Vision-Language Models
Multi-modal Attribute Prompting for Vision-Language Models
Xin Liu
Jiamin Wu
and Wenfei Yang
Xu Zhou
Tianzhu Zhang
VLM
23
10
0
01 Mar 2024
Generalizable Whole Slide Image Classification with Fine-Grained
  Visual-Semantic Interaction
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Hao Li
Ying Chen
Yifei Chen
Wenxian Yang
Bowen Ding
Yuchen Han
Liansheng Wang
Rongshan Yu
33
15
0
29 Feb 2024
Global and Local Prompts Cooperation via Optimal Transport for Federated
  Learning
Global and Local Prompts Cooperation via Optimal Transport for Federated Learning
Hongxia Li
Wei Huang
Jingya Wang
Ye-ling Shi
FedML
VLM
35
19
0
29 Feb 2024
Prompt-Driven Dynamic Object-Centric Learning for Single Domain
  Generalization
Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization
Deng Li
Aming Wu
Yaowei Wang
Yahong Han
OOD
VLM
26
9
0
28 Feb 2024
Previous
12345678
Next