ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 642 papers shown
Title
OV-PARTS: Towards Open-Vocabulary Part Segmentation
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei
Xiaoyu Yue
Wenwei Zhang
Shu Kong
Xihui Liu
Jiangmiao Pang
VLM
26
24
0
08 Oct 2023
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification
Feihong He
Gang Li
Hui Xiong
VLM
ViT
54
1
0
05 Oct 2023
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Yiwen Tang
Ivan Tang
Ray Gu
Dong Wang
Eric Zhang
Bin Zhao
Xuelong Li
3DPC
29
20
0
04 Oct 2023
Domain-Controlled Prompt Learning
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
31
16
0
30 Sep 2023
Noise-Tolerant Unsupervised Adapter for Vision-Language Models
Noise-Tolerant Unsupervised Adapter for Vision-Language Models
Eman Ali
Dayan Guan
Muhammad Haris Khan
Abdulmotaleb Elsaddik
VLM
24
0
0
26 Sep 2023
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
Xin Li
Dongze Lian
Zhihe Lu
Jiawang Bai
Zhibo Chen
Xinchao Wang
VLM
43
60
0
24 Sep 2023
TextCLIP: Text-Guided Face Image Generation And Manipulation Without
  Adversarial Training
TextCLIP: Text-Guided Face Image Generation And Manipulation Without Adversarial Training
Xiaozhou You
Jian Zhang
CLIP
VLM
15
0
0
21 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
23
9
0
19 Sep 2023
CLIP-based Synergistic Knowledge Transfer for Text-based Person
  Retrieval
CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval
Yating Liu
Yaowei Li
Zimo Liu
Wenming Yang
Yaowei Wang
Qingmin Liao
VLM
26
11
0
18 Sep 2023
Efficient Pyramid Channel Attention Network for Pathological Myopia
  Recognition
Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
Xiaoqing Zhang
Jilu Zhao
Yan Li
Hao Wu
Xiangtian Zhou
Jiang Liu
22
1
0
17 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
24
18
0
14 Sep 2023
DePT: Decoupled Prompt Tuning
DePT: Decoupled Prompt Tuning
Ji Zhang
Shihan Wu
Lianli Gao
Hengtao Shen
Jingkuan Song
VLM
32
27
0
14 Sep 2023
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training
  Instances for Visual Classification
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Horst Possegger
Rogerio Feris
Horst Bischof
VLM
37
6
0
13 Sep 2023
Efficient Adaptive Human-Object Interaction Detection with
  Concept-guided Memory
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Ting Lei
Fabian Caba
Qingchao Chen
Hailin Jin
Yuxin Peng
Yang Liu
VLM
39
17
0
07 Sep 2023
Parameter and Computation Efficient Transfer Learning for
  Vision-Language Pre-trained Models
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Qiong Wu
Wei Yu
Yiyi Zhou
Shubin Huang
Xiaoshuai Sun
Rongrong Ji
VLM
26
7
0
04 Sep 2023
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language
  Reasoning
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Yi Zhang
Ce Zhang
Zihan Liao
Yushun Tang
Zhihai He
BDL
VLM
26
10
0
03 Sep 2023
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for
  Vision-Language Models
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Cheng Shi
Sibei Yang
VLM
19
21
0
03 Sep 2023
Big-model Driven Few-shot Continual Learning
Big-model Driven Few-shot Continual Learning
Ziqi Gu
Chunyan Xu
Zihan Lu
Xin Liu
Anbo Dai
Zhen Cui
CLL
35
1
0
02 Sep 2023
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot
  Anomaly Localization
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
Hanqiu Deng
Zhaoxiang Zhang
Jinan Bao
Xingyu Li
VLM
30
4
0
30 Aug 2023
Read-only Prompt Optimization for Vision-Language Few-shot Learning
Read-only Prompt Optimization for Vision-Language Few-shot Learning
Dongjun Lee
Seokwon Song
Jihee G. Suh
Joonmyeong Choi
S. Lee
Hyunwoo J.Kim
VLM
41
42
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
32
26
0
28 Aug 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient
  Parameter and Memory
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yuhang Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
36
18
0
28 Aug 2023
Fine-tuning can cripple your foundation model; preserving features may
  be the solution
Fine-tuning can cripple your foundation model; preserving features may be the solution
Jishnu Mukhoti
Y. Gal
Philip H. S. Torr
P. Dokania
CLL
40
31
0
25 Aug 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic
  Alignment
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Anton van den Hengel
F. Khan
VLM
60
5
0
24 Aug 2023
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text
  Retrieval
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Yuan. Yuan
Yangfan Zhan
Zhitong Xiong
VLM
30
39
0
24 Aug 2023
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
Hualiang Wang
Yi Li
Huifeng Yao
Xiaomeng Li
VLM
OODD
40
94
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
  Learning
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
36
4
0
22 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
29
4
0
22 Aug 2023
ViLLA: Fine-Grained Vision-Language Representation Learning from
  Real-World Data
ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data
M. Varma
Jean-Benoit Delbrouck
Sarah Hooper
Akshay S. Chaudhari
C. Langlotz
VLM
CoGe
40
5
0
22 Aug 2023
An Examination of the Compositionality of Large Generative
  Vision-Language Models
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
34
2
0
21 Aug 2023
COCA: Classifier-Oriented Calibration via Textual Prototype for
  Source-Free Universal Domain Adaptation
COCA: Classifier-Oriented Calibration via Textual Prototype for Source-Free Universal Domain Adaptation
Xinghong Liu
Yi Zhou
Tao Zhou
Chun-Mei Feng
Ling Shao
VLM
29
2
0
21 Aug 2023
An Empirical Study of CLIP for Text-based Person Search
An Empirical Study of CLIP for Text-based Person Search
Min Cao
Yang Bai
Ziyin Zeng
Mang Ye
Min Zhang
VLM
49
36
0
19 Aug 2023
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud
  Recognition
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Xuanyu Yi
Jiajun Deng
Qianru Sun
Xiansheng Hua
J. Lim
Hanwang Zhang
3DPC
18
14
0
18 Aug 2023
The Unreasonable Effectiveness of Large Language-Vision Models for
  Source-free Video Domain Adaptation
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
Giacomo Zara
Alessandro Conti
Subhankar Roy
Stéphane Lathuilière
Paolo Rota
Elisa Ricci
27
11
0
17 Aug 2023
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Guangyi Chen
Xiao Liu
Guangrun Wang
Anton van den Hengel
Philip H.S.Torr
Xiaoping Zhang
Yansong Tang
21
18
0
16 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
72
33
0
15 Aug 2023
ICPC: Instance-Conditioned Prompting with Contrastive Learning for
  Semantic Segmentation
ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation
Chaohui Yu
Qiang-feng Zhou
Zhibin Wang
Fan Wang
VLM
27
1
0
14 Aug 2023
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Yan Zhu
Junbao Zhuo
B. Ma
Jiajia Geng
Xiaoming Wei
Xiaolin K. Wei
Shuhui Wang
VLM
25
5
0
14 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
30
13
0
11 Aug 2023
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt
  Tuning
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Chun-Mei Feng
Kai Yu
Yong Liu
Salman Khan
W. Zuo
VLM
22
77
0
11 Aug 2023
Exploring Part-Informed Visual-Language Learning for Person Re-Identification
Exploring Part-Informed Visual-Language Learning for Person Re-Identification
Y. Lin
Cong Liu
Yehansen Chen
Jinshui Hu
Bing Yin
Baocai Yin
Zengfu Wang
64
7
0
04 Aug 2023
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition
  with Limited Annotations
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Ping Hu
Ximeng Sun
Stan Sclaroff
Kate Saenko
VLM
34
21
0
03 Aug 2023
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge
  using Vision-Language Pre-Training Model
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Ka Leong Cheng
Wenpo Song
Zheng Ma
Wenhao Zhu
Zi-Yue Zhu
Jianbing Zhang
CLIP
VLM
27
10
0
02 Aug 2023
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning
  in End-to-End Autonomous Driving
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving
Xiaosong Jia
Yulu Gao
Li Chen
Junchi Yan
Patrick Langechuan Liu
Hongyang Li
11
64
0
01 Aug 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for
  Complex Visual Reasoning Tasks
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
25
2
0
31 Jul 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Cross-Modal Concept Learning and Inference for Vision-Language Models
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
36
15
0
28 Jul 2023
Improving Social Media Popularity Prediction with Multiple Post
  Dependencies
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Zhizhen Zhang
Xiao-Zhu Xie
Meng Yang
Ye Tian
Yong-jia Jiang
Yong Cui
26
5
0
28 Jul 2023
PromptStyler: Prompt-driven Style Generation for Source-free Domain
  Generalization
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization
Junhyeong Cho
Gilhyun Nam
Sungyeon Kim
Hunmin Yang
Suha Kwak
VLM
OOD
TTA
27
48
0
27 Jul 2023
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained
  Vision-Language Models
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Kecheng Zheng
Wei Wu
Ruili Feng
Kai Zhu
Jiawei Liu
Deli Zhao
Zhengjun Zha
Wei Chen
Yujun Shen
VLM
26
8
0
27 Jul 2023
Selective Perception: Optimizing State Descriptions with Reinforcement
  Learning for Language Model Actors
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
30
9
0
21 Jul 2023
Previous
123...1011121389
Next