ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.05557
  4. Cited By
Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

10 March 2022
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
    VLM
    CLIP
    VPVLM
ArXivPDFHTML

Papers citing "Conditional Prompt Learning for Vision-Language Models"

50 / 256 papers shown
Title
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object
  Detection
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection
Yiming Li
Rui Zhang
Hantao Yao
X. Zhang
Yifan Hao
Xinkai Song
Xiaqing Li
Yongwei Zhao
Ling Li
Yunji Chen
ObjD
VLM
37
4
0
11 Oct 2024
Conjugated Semantic Pool Improves OOD Detection with Pre-trained
  Vision-Language Models
Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models
Mengyuan Chen
Junyu Gao
Changsheng Xu
VLM
OODD
30
1
0
11 Oct 2024
Deep Correlated Prompting for Visual Recognition with Missing Modalities
Deep Correlated Prompting for Visual Recognition with Missing Modalities
Lianyu Hu
Tongkai Shi
Wei Feng
Fanhua Shang
Liang Wan
VLM
44
1
0
09 Oct 2024
An Eye for an Ear: Zero-shot Audio Description Leveraging an Image
  Captioner using Audiovisual Distribution Alignment
An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment
Hugo Malard
Michel Olvera
Stéphane Lathuilière
S. Essid
VLM
39
0
0
08 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
84
1
0
08 Oct 2024
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection
Mingyi Guo
Yuyang Liu
Zongying Lin
Peixi Peng
Yonghong Tian
Yonghong Tian
VLM
35
0
0
08 Oct 2024
Exploring Information-Theoretic Metrics Associated with Neural Collapse in Supervised Training
Exploring Information-Theoretic Metrics Associated with Neural Collapse in Supervised Training
Kun Song
Zhiquan Tan
Bochao Zou
Jiansheng Chen
Huimin Ma
Weiran Huang
42
0
0
25 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
170
3
0
23 Sep 2024
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
182
1
0
19 Sep 2024
Recent Advances in OOD Detection: Problems and Approaches
Recent Advances in OOD Detection: Problems and Approaches
Shuo Lu
YingSheng Wang
Lijun Sheng
Aihua Zheng
Lingxiao He
Jian Liang
OODD
68
3
0
18 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
43
1
0
10 Sep 2024
A Novel Dataset for Video-Based Autism Classification Leveraging
  Extra-Stimulatory Behavior
A Novel Dataset for Video-Based Autism Classification Leveraging Extra-Stimulatory Behavior
Manuel Serna-Aguilera
Xuan-Bac Nguyen
Han-Seok Seo
Khoa Luu
49
1
0
06 Sep 2024
Multi-Modal Adapter for Vision-Language Models
Multi-Modal Adapter for Vision-Language Models
Dominykas Seputis
Serghei Mihailov
Soham Chatterjee
Zehao Xiao
VLM
36
1
0
03 Sep 2024
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
Leqi Shen
Tianxiang Hao
Tao He
Sicheng Zhao
Pengzhang Liu
Yongjun Bao
Guiguang Ding
Guiguang Ding
138
7
0
02 Sep 2024
Robust Domain Generalization for Multi-modal Object Recognition
Robust Domain Generalization for Multi-modal Object Recognition
Yuxin Qiao
Keqin Li
Junhong Lin
Rong Wei
Chufeng Jiang
Yang Luo
Haoyu Yang
VLM
47
26
0
11 Aug 2024
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for
  Continual Learning
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning
Lu Yu
Hesong Li
Ying Fu
Joost van de Weijer
Changsheng Xu
CLL
55
1
0
02 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
49
3
0
31 Jul 2024
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang
Jongoh Jeong
Kuk-Jin Yoon
AAML
VLM
60
4
0
30 Jul 2024
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Tz-Ying Wu
Kyle Min
Subarna Tripathi
Nuno Vasconcelos
EgoV
55
0
0
28 Jul 2024
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot
  Anomaly Detection
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
Yunkang Cao
Jiangning Zhang
Luca Frittoli
Yuqi Cheng
Weiming Shen
Giacomo Boracchi
VLM
56
29
0
22 Jul 2024
CLIP with Generative Latent Replay: a Strong Baseline for Incremental
  Learning
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning
Emanuele Frascaroli
Aniello Panariello
Pietro Buzzega
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
VLM
CLL
43
3
0
22 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
42
15
0
11 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
52
1
0
09 Jul 2024
The Solution for Language-Enhanced Image New Category Discovery
The Solution for Language-Enhanced Image New Category Discovery
Haonan Xu
Dian Chao
Xiangyu Wu
Zhonghua Wan
Yang Yang
VLM
35
0
0
06 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting,
  and Transportation
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
47
7
0
05 Jul 2024
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal
  Prompt Learning
Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning
Mainak Singha
Ankit Jha
Divyam Gupta
Pranav Singla
Biplab Banerjee
VLM
32
0
0
05 Jul 2024
SOWA: Adapting Hierarchical Frozen Window Self-Attention to
  Visual-Language Models for Better Anomaly Detection
SOWA: Adapting Hierarchical Frozen Window Self-Attention to Visual-Language Models for Better Anomaly Detection
Zongxiang Hu
Zhaosheng Zhang
VLM
29
1
0
04 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
57
3
0
03 Jul 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon
Elias Ramzi
Clément Rambour
Nicolas Audebert
Nicolas Thome
VLM
46
8
0
01 Jul 2024
CPT: Consistent Proxy Tuning for Black-box Optimization
CPT: Consistent Proxy Tuning for Black-box Optimization
Yuanyang He
Zitong Huang
Xinxing Xu
Rick Siow Mong Goh
Salman Khan
W. Zuo
Yong Liu
Chun-Mei Feng
48
0
0
01 Jul 2024
Embedded Visual Prompt Tuning
Embedded Visual Prompt Tuning
Wenqiang Zu
Shenghao Xie
Qing Zhao
Guoqi Li
Lei Ma
VLM
MedIm
56
9
0
01 Jul 2024
GM-DF: Generalized Multi-Scenario Deepfake Detection
GM-DF: Generalized Multi-Scenario Deepfake Detection
Yingxin Lai
Zitong Yu
Jing Yang
Bin Li
Xiangui Kang
Linlin Shen
40
8
0
28 Jun 2024
Learning Visual Conditioning Tokens to Correct Domain Shift for Fully
  Test-time Adaptation
Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Yushun Tang
Shuoshuo Chen
Zhehan Kan
Yi Zhang
Qinghai Guo
Zhihai He
51
2
0
27 Jun 2024
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Akshita Gupta
Aditya Arora
Sanath Narayan
Salman Khan
Fahad Shahbaz Khan
Graham W. Taylor
41
3
0
21 Jun 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim
Zhuoxiao Chen
Mahsa Baktashmotlagh
Zhi Chen
Xin Yu
Zi Huang
Yadan Luo
VLM
ObjD
86
1
0
21 Jun 2024
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
Shuo Xu
Sai Wang
Xinyue Hu
Yutian Lin
Bo Du
Yu Wu
CoGe
59
0
0
18 Jun 2024
Enhancing Domain Adaptation through Prompt Gradient Alignment
Enhancing Domain Adaptation through Prompt Gradient Alignment
Hoang Phan
Lam C. Tran
Quyen Tran
Trung Le
52
0
0
13 Jun 2024
OVMR: Open-Vocabulary Recognition with Multi-Modal References
OVMR: Open-Vocabulary Recognition with Multi-Modal References
Zehong Ma
Shiliang Zhang
Longhui Wei
Qi Tian
VLM
44
0
0
07 Jun 2024
BeFA: A General Behavior-driven Feature Adapter for Multimedia Recommendation
BeFA: A General Behavior-driven Feature Adapter for Multimedia Recommendation
Qile Fan
Penghang Yu
Zhiyi Tan
Bing-Kun Bao
Guanming Lu
37
1
0
01 Jun 2024
CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD
  Generalization and Open-Set OOD Detection
CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection
Lin Zhu
Yifeng Yang
Qinying Gu
Xinbing Wang
Cheng Zhou
Nanyang Ye
VLM
34
2
0
26 May 2024
Disease-informed Adaptation of Vision-Language Models
Disease-informed Adaptation of Vision-Language Models
Jiajin Zhang
Ge Wang
M. Kalra
P. Yan
VLM
46
2
0
24 May 2024
Learning Invariant Causal Mechanism from Vision-Language Models
Learning Invariant Causal Mechanism from Vision-Language Models
Zeen Song
Siyu Zhao
Xingyu Zhang
Jiangmeng Li
Changwen Zheng
Wenwen Qiang
CML
BDL
VLM
45
0
0
24 May 2024
Like Humans to Few-Shot Learning through Knowledge Permeation of Vision
  and Text
Like Humans to Few-Shot Learning through Knowledge Permeation of Vision and Text
Yuyu Jia
Qing Zhou
Wei Huang
Junyu Gao
Qi. Wang
VLM
39
1
0
21 May 2024
Promoting AI Equity in Science: Generalized Domain Prompt Learning for
  Accessible VLM Research
Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research
Qinglong Cao
Yuntian Chen
Lu Lu
Hao Sun
Zhenzhong Zeng
Xiaokang Yang
Dong-juan Zhang
VLM
37
1
0
14 May 2024
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for
  Multi-Label Medical Image Classification
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Yaoqin Ye
Junjie Zhang
Hongwei Shi
MedIm
VLM
49
0
0
10 May 2024
Temporal and Heterogeneous Graph Neural Network for Remaining Useful
  Life Prediction
Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction
Zhihao Wen
Yuan Fang
Pengcheng Wei
Fayao Liu
Zhenghua Chen
Min-man Wu
AI4CE
30
2
0
07 May 2024
On the test-time zero-shot generalization of vision-language models: Do
  we really need prompt learning?
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Maxime Zanella
Ismail Ben Ayed
VLM
MLLM
56
23
0
03 May 2024
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
Hongyu Qu
Rui Yan
Xiangbo Shu
Haoliang Gao
Peng Huang
Guo-Sen Xie
61
4
0
03 May 2024
Understanding Retrieval-Augmented Task Adaptation for Vision-Language
  Models
Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Yifei Ming
Yixuan Li
VLM
41
7
0
02 May 2024
Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Samuel Lavoie
Polina Kirichenko
Mark Ibrahim
Mahmoud Assran
Andrew Gordon Wilson
Aaron Courville
Nicolas Ballas
CLIP
VLM
69
20
0
30 Apr 2024
Previous
123456
Next