Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04544
Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Adapter: Better Vision-Language Models with Feature Adapters"
50 / 642 papers shown
Title
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei
Xiaoyu Yue
Wenwei Zhang
Shu Kong
Xihui Liu
Jiangmiao Pang
VLM
26
24
0
08 Oct 2023
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification
Feihong He
Gang Li
Hui Xiong
VLM
ViT
54
1
0
05 Oct 2023
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Yiwen Tang
Ivan Tang
Ray Gu
Dong Wang
Eric Zhang
Bin Zhao
Xuelong Li
3DPC
29
20
0
04 Oct 2023
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
31
16
0
30 Sep 2023
Noise-Tolerant Unsupervised Adapter for Vision-Language Models
Eman Ali
Dayan Guan
Muhammad Haris Khan
Abdulmotaleb Elsaddik
VLM
24
0
0
26 Sep 2023
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
Xin Li
Dongze Lian
Zhihe Lu
Jiawang Bai
Zhibo Chen
Xinchao Wang
VLM
43
60
0
24 Sep 2023
TextCLIP: Text-Guided Face Image Generation And Manipulation Without Adversarial Training
Xiaozhou You
Jian Zhang
CLIP
VLM
15
0
0
21 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
23
9
0
19 Sep 2023
CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval
Yating Liu
Yaowei Li
Zimo Liu
Wenming Yang
Yaowei Wang
Qingmin Liao
VLM
26
11
0
18 Sep 2023
Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
Xiaoqing Zhang
Jilu Zhao
Yan Li
Hao Wu
Xiangtian Zhou
Jiang Liu
22
1
0
17 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
24
18
0
14 Sep 2023
DePT: Decoupled Prompt Tuning
Ji Zhang
Shihan Wu
Lianli Gao
Hengtao Shen
Jingkuan Song
VLM
32
27
0
14 Sep 2023
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Horst Possegger
Rogerio Feris
Horst Bischof
VLM
37
6
0
13 Sep 2023
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Ting Lei
Fabian Caba
Qingchao Chen
Hailin Jin
Yuxin Peng
Yang Liu
VLM
39
17
0
07 Sep 2023
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Qiong Wu
Wei Yu
Yiyi Zhou
Shubin Huang
Xiaoshuai Sun
Rongrong Ji
VLM
26
7
0
04 Sep 2023
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Yi Zhang
Ce Zhang
Zihan Liao
Yushun Tang
Zhihai He
BDL
VLM
26
10
0
03 Sep 2023
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Cheng Shi
Sibei Yang
VLM
19
21
0
03 Sep 2023
Big-model Driven Few-shot Continual Learning
Ziqi Gu
Chunyan Xu
Zihan Lu
Xin Liu
Anbo Dai
Zhen Cui
CLL
35
1
0
02 Sep 2023
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
Hanqiu Deng
Zhaoxiang Zhang
Jinan Bao
Xingyu Li
VLM
30
4
0
30 Aug 2023
Read-only Prompt Optimization for Vision-Language Few-shot Learning
Dongjun Lee
Seokwon Song
Jihee G. Suh
Joonmyeong Choi
S. Lee
Hyunwoo J.Kim
VLM
41
42
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
32
26
0
28 Aug 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yuhang Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
36
18
0
28 Aug 2023
Fine-tuning can cripple your foundation model; preserving features may be the solution
Jishnu Mukhoti
Y. Gal
Philip H. S. Torr
P. Dokania
CLL
40
31
0
25 Aug 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Anton van den Hengel
F. Khan
VLM
60
5
0
24 Aug 2023
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Yuan. Yuan
Yangfan Zhan
Zhitong Xiong
VLM
30
39
0
24 Aug 2023
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
Hualiang Wang
Yi Li
Huifeng Yao
Xiaomeng Li
VLM
OODD
40
94
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
36
4
0
22 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
29
4
0
22 Aug 2023
ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data
M. Varma
Jean-Benoit Delbrouck
Sarah Hooper
Akshay S. Chaudhari
C. Langlotz
VLM
CoGe
40
5
0
22 Aug 2023
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
34
2
0
21 Aug 2023
COCA: Classifier-Oriented Calibration via Textual Prototype for Source-Free Universal Domain Adaptation
Xinghong Liu
Yi Zhou
Tao Zhou
Chun-Mei Feng
Ling Shao
VLM
29
2
0
21 Aug 2023
An Empirical Study of CLIP for Text-based Person Search
Min Cao
Yang Bai
Ziyin Zeng
Mang Ye
Min Zhang
VLM
49
36
0
19 Aug 2023
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Xuanyu Yi
Jiajun Deng
Qianru Sun
Xiansheng Hua
J. Lim
Hanwang Zhang
3DPC
18
14
0
18 Aug 2023
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
Giacomo Zara
Alessandro Conti
Subhankar Roy
Stéphane Lathuilière
Paolo Rota
Elisa Ricci
27
11
0
17 Aug 2023
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Guangyi Chen
Xiao Liu
Guangrun Wang
Anton van den Hengel
Philip H.S.Torr
Xiaoping Zhang
Yansong Tang
21
18
0
16 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
72
33
0
15 Aug 2023
ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation
Chaohui Yu
Qiang-feng Zhou
Zhibin Wang
Fan Wang
VLM
27
1
0
14 Aug 2023
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Yan Zhu
Junbao Zhuo
B. Ma
Jiajia Geng
Xiaoming Wei
Xiaolin K. Wei
Shuhui Wang
VLM
25
5
0
14 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
30
13
0
11 Aug 2023
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Chun-Mei Feng
Kai Yu
Yong Liu
Salman Khan
W. Zuo
VLM
22
77
0
11 Aug 2023
Exploring Part-Informed Visual-Language Learning for Person Re-Identification
Y. Lin
Cong Liu
Yehansen Chen
Jinshui Hu
Bing Yin
Baocai Yin
Zengfu Wang
64
7
0
04 Aug 2023
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Ping Hu
Ximeng Sun
Stan Sclaroff
Kate Saenko
VLM
34
21
0
03 Aug 2023
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Ka Leong Cheng
Wenpo Song
Zheng Ma
Wenhao Zhu
Zi-Yue Zhu
Jianbing Zhang
CLIP
VLM
27
10
0
02 Aug 2023
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving
Xiaosong Jia
Yulu Gao
Li Chen
Junchi Yan
Patrick Langechuan Liu
Hongyang Li
11
64
0
01 Aug 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
25
2
0
31 Jul 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
36
15
0
28 Jul 2023
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Zhizhen Zhang
Xiao-Zhu Xie
Meng Yang
Ye Tian
Yong-jia Jiang
Yong Cui
26
5
0
28 Jul 2023
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization
Junhyeong Cho
Gilhyun Nam
Sungyeon Kim
Hunmin Yang
Suha Kwak
VLM
OOD
TTA
27
48
0
27 Jul 2023
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Kecheng Zheng
Wei Wu
Ruili Feng
Kai Zhu
Jiawei Liu
Deli Zhao
Zhengjun Zha
Wei Chen
Yujun Shen
VLM
26
8
0
27 Jul 2023
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
30
9
0
21 Jul 2023
Previous
1
2
3
...
10
11
12
13
8
9
Next