Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.00511
Cited By
Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions
1 June 2015
Jimmy Ba
Kevin Swersky
Sanja Fidler
Ruslan Salakhutdinov
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions"
50 / 197 papers shown
Title
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Huajie Jiang
ZeLin Li
Xiaohan Yu
Yongli Hu
Baocai Yin
Jian Yang
Yuankai Qi
VLM
49
0
0
29 Mar 2025
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
Xiangyan Qu
Jing Yu
Jiamin Zhuang
Gaopeng Gou
Gang Xiong
Qi Wu
VLM
51
0
0
10 Mar 2025
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance
Yaoyun Zhang
Xuenan Xu
Mengyue Wu
VGen
36
0
0
24 Dec 2024
An Individual Identity-Driven Framework for Animal Re-Identification
Yihao Wu
Di Zhao
Jingfeng Zhang
Yun Sing Koh
36
0
0
30 Oct 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
49
2
0
22 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
55
2
0
11 Jul 2024
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang
Ke Yu
Siqi Wu
Zhihai He
VLM
53
2
0
02 Jul 2024
A separability-based approach to quantifying generalization: which layer is best?
Luciano Dyballa
Evan Gerritz
Steven W. Zucker
OOD
39
3
0
02 May 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
35
10
0
14 Feb 2024
A Closer Look at AUROC and AUPRC under Class Imbalance
Matthew B. A. McDermott
Lasse Hyldig Hansen
Haoran Zhang
Giovanni Angelotti
Jack Gallifant
39
30
0
11 Jan 2024
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
31
3
0
12 Dec 2023
Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models
Zhihe Lu
Jiawang Bai
Xin Li
Zeyu Xiao
Xinchao Wang
VLM
52
11
0
28 Nov 2023
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Xue-mei Hu
Ce Zhang
Yi Zhang
Bowen Hai
Ke Yu
Zhihai He
MDE
VLM
51
17
0
02 Nov 2023
SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Yang Zhang
Jiajun Song
Ying Jin
Shuqiang Jiang
DiffM
80
5
0
07 Oct 2023
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Zixiang Chen
Yihe Deng
Yuanzhi Li
Quanquan Gu
VLM
28
11
0
02 Oct 2023
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
31
16
0
30 Sep 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Kun Zhang
Fahad Shahbaz Khan
VLM
60
5
0
24 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
38
17
0
09 Aug 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
39
15
0
28 Jul 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
32
53
0
21 Jul 2023
CoPL: Contextual Prompt Learning for Vision-Language Understanding
Koustava Goswami
Srikrishna Karanam
Prateksha Udhayanan
J. JosephK.
Balaji Vasan Srinivasan
VLM
26
8
0
03 Jul 2023
Multimodal Zero-Shot Learning for Tactile Texture Recognition
G. Cao
Jiaqi Jiang
Danushka Bollegala
Min Li
Shan Luo
18
12
0
22 Jun 2023
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models
Yongzhu Miao
Shasha Li
Jintao Tang
Ting Wang
VLM
MLLM
VPVLM
34
3
0
20 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
43
0
0
02 Jun 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
Man Liu
Feng Li
Chunjie Zhang
Yunchao Wei
H. Bai
Yao-Min Zhao
47
39
0
27 Mar 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
42
40
0
14 Feb 2023
Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework
Jiayun Zhang
Xiyuan Zhang
Xinyang Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
FedML
62
7
0
01 Jan 2023
Unleashing the Power of Shared Label Structures for Human Activity Recognition
Xiyuan Zhang
Ranak Roy Chowdhury
Jiayun Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
VLM
18
6
0
01 Jan 2023
Localized Latent Updates for Fine-Tuning Vision-Language Models
Moritz Ibing
I. Lim
Leif Kobbelt
VLM
26
1
0
13 Dec 2022
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
Xiaoshui Huang
Zhou Huang
Shengjia Li
Wentao Qu
Tong He
Yuenan Hou
Yifan Zuo
Wanli Ouyang
13
11
0
08 Dec 2022
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
19
49
0
21 Nov 2022
Task Residual for Tuning Vision-Language Models
Tao Yu
Zhihe Lu
Xin Jin
Zhibo Chen
Xinchao Wang
VLM
CLIP
24
83
0
18 Nov 2022
Text2Model: Text-based Model Induction for Zero-shot Image Classification
Ohad Amosy
Tomer Volk
Eilam Shapira
Eyal Ben-David
Roi Reichart
Gal Chechik
VLM
32
0
0
27 Oct 2022
Learning by Asking Questions for Knowledge-based Novel Object Recognition
Kohei Uehara
Tatsuya Harada
20
1
0
12 Oct 2022
Learning to embed semantic similarity for joint image-text retrieval
Noam Malali
Y. Keller
32
10
0
07 Oct 2022
I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Muhammad Ferjad Naeem
Yongqin Xian
Luc Van Gool
F. Tombari
VLM
23
37
0
21 Sep 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming Yang
Serge Belongie
Huayu Chen
VLM
41
22
0
15 Jul 2022
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes
Alessio Mazzetto
Cristina Menghini
A. Yuan
E. Upfal
Stephen H. Bach
VLM
20
1
0
25 May 2022
Generating Representative Samples for Few-Shot Classification
Jingyi Xu
Hieu M. Le
VLM
25
61
0
05 May 2022
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention
Yu Yang
Seung Wook Kim
Jungseock Joo
FAtt
13
17
0
10 Apr 2022
Mixed Differential Privacy in Computer Vision
Aditya Golatkar
Alessandro Achille
Yu-Xiang Wang
Aaron Roth
Michael Kearns
Stefano Soatto
PICV
VLM
28
49
0
22 Mar 2022
Conditional Prompt Learning for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VLM
CLIP
VPVLM
47
1,294
0
10 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik R. Narasimhan
VLM
36
2
0
26 Feb 2022
On Guiding Visual Attention with Language Specification
Suzanne Petryk
Lisa Dunlap
Keyan Nasseri
Joseph E. Gonzalez
Trevor Darrell
Anna Rohrbach
VLM
206
31
1
17 Feb 2022
A Survey on Visual Transfer Learning using Knowledge Graphs
Sebastian Monka
Lavdim Halilaj
Achim Rettinger
33
23
0
27 Jan 2022
Towards Zero-shot Sign Language Recognition
Yunus Can Bilge
R. G. Cinbis
Nazli Ikizler-Cinbis
SLR
17
36
0
15 Jan 2022
CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision
A. Shrivastava
Ramprasaath R. Selvaraju
Nikhil Naik
Vicente Ordonez
VLM
CLIP
30
6
0
14 Dec 2021
Dual Progressive Prototype Network for Generalized Zero-Shot Learning
Chaoqun Wang
Shaobo Min
Xuejin Chen
Xiaoyan Sun
Houqiang Li
25
45
0
03 Nov 2021
Fine-Grained Zero-Shot Learning with DNA as Side Information
Sarkhan Badirli
Zeynep Akata
G. Mohler
Christel Picard
M. M. Dundar
SyDa
BDL
46
35
0
29 Sep 2021
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan
Xiao Chang
Minnan Luo
Huan Liu
Xiaoqin Zhang
Qinghua Zheng
ObjD
VLM
67
77
0
04 Sep 2021
1
2
3
4
Next