ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.00511
  4. Cited By
Predicting Deep Zero-Shot Convolutional Neural Networks using Textual
  Descriptions

Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions

1 June 2015
Jimmy Ba
Kevin Swersky
Sanja Fidler
Ruslan Salakhutdinov
    VLM
ArXivPDFHTML

Papers citing "Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions"

50 / 197 papers shown
Title
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Huajie Jiang
ZeLin Li
Xiaohan Yu
Yongli Hu
Baocai Yin
Jian Yang
Yuankai Qi
VLM
49
0
0
29 Mar 2025
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
Xiangyan Qu
Jing Yu
Jiamin Zhuang
Gaopeng Gou
Gang Xiong
Qi Wu
VLM
51
0
0
10 Mar 2025
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation
  Under Semantic Guidance
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance
Yaoyun Zhang
Xuenan Xu
Mengyue Wu
VGen
36
0
0
24 Dec 2024
An Individual Identity-Driven Framework for Animal Re-Identification
An Individual Identity-Driven Framework for Animal Re-Identification
Yihao Wu
Di Zhao
Jingfeng Zhang
Yun Sing Koh
36
0
0
30 Oct 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based
  Zero-Shot Learning
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
49
2
0
22 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better
  Vision-Language Reasoning
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
55
2
0
11 Jul 2024
Conceptual Codebook Learning for Vision-Language Models
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang
Ke Yu
Siqi Wu
Zhihai He
VLM
53
2
0
02 Jul 2024
A separability-based approach to quantifying generalization: which layer
  is best?
A separability-based approach to quantifying generalization: which layer is best?
Luciano Dyballa
Evan Gerritz
Steven W. Zucker
OOD
39
3
0
02 May 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food
  Detection
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
35
10
0
14 Feb 2024
A Closer Look at AUROC and AUPRC under Class Imbalance
A Closer Look at AUROC and AUPRC under Class Imbalance
Matthew B. A. McDermott
Lasse Hyldig Hansen
Haoran Zhang
Giovanni Angelotti
Jack Gallifant
39
30
0
11 Jan 2024
CLIP in Medical Imaging: A Comprehensive Survey
CLIP in Medical Imaging: A Comprehensive Survey
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Dinggang Shen
CLIP
MedIm
LM&MA
VLM
31
3
0
12 Dec 2023
Beyond Sole Strength: Customized Ensembles for Generalized
  Vision-Language Models
Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models
Zhihe Lu
Jiawang Bai
Xin Li
Zeyu Xiao
Xinchao Wang
VLM
52
11
0
28 Nov 2023
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Xue-mei Hu
Ce Zhang
Yi Zhang
Bowen Hai
Ke Yu
Zhihai He
MDE
VLM
51
17
0
02 Nov 2023
SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food
  Detection
SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Yang Zhang
Jiajun Song
Ying Jin
Shuqiang Jiang
DiffM
80
5
0
07 Oct 2023
Understanding Transferable Representation Learning and Zero-shot
  Transfer in CLIP
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Zixiang Chen
Yihe Deng
Yuanzhi Li
Quanquan Gu
VLM
28
11
0
02 Oct 2023
Domain-Controlled Prompt Learning
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
31
16
0
30 Sep 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic
  Alignment
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Kun Zhang
Fahad Shahbaz Khan
VLM
60
5
0
24 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion
  Prompts Learning
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
38
17
0
09 Aug 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Cross-Modal Concept Learning and Inference for Vision-Language Models
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
39
15
0
28 Jul 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
32
53
0
21 Jul 2023
CoPL: Contextual Prompt Learning for Vision-Language Understanding
CoPL: Contextual Prompt Learning for Vision-Language Understanding
Koustava Goswami
Srikrishna Karanam
Prateksha Udhayanan
J. JosephK.
Balaji Vasan Srinivasan
VLM
26
8
0
03 Jul 2023
Multimodal Zero-Shot Learning for Tactile Texture Recognition
Multimodal Zero-Shot Learning for Tactile Texture Recognition
G. Cao
Jiaqi Jiang
Danushka Bollegala
Min Li
Shan Luo
18
12
0
22 Jun 2023
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained
  Vision-Language Models
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models
Yongzhu Miao
Shasha Li
Jintao Tang
Ting Wang
VLM
MLLM
VPVLM
34
3
0
20 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
43
0
0
02 Jun 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot
  Learning
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
Man Liu
Feng Li
Chunjie Zhang
Yunchao Wei
H. Bai
Yao-Min Zhao
47
39
0
27 Mar 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future
  Directions
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
42
40
0
14 Feb 2023
Navigating Alignment for Non-identical Client Class Sets: A Label
  Name-Anchored Federated Learning Framework
Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework
Jiayun Zhang
Xiyuan Zhang
Xinyang Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
FedML
62
7
0
01 Jan 2023
Unleashing the Power of Shared Label Structures for Human Activity
  Recognition
Unleashing the Power of Shared Label Structures for Human Activity Recognition
Xiyuan Zhang
Ranak Roy Chowdhury
Jiayun Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
VLM
18
6
0
01 Jan 2023
Localized Latent Updates for Fine-Tuning Vision-Language Models
Localized Latent Updates for Fine-Tuning Vision-Language Models
Moritz Ibing
I. Lim
Leif Kobbelt
VLM
26
1
0
13 Dec 2022
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
Xiaoshui Huang
Zhou Huang
Shengjia Li
Wentao Qu
Tong He
Yuenan Hou
Yifan Zuo
Wanli Ouyang
13
11
0
08 Dec 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
19
49
0
21 Nov 2022
Task Residual for Tuning Vision-Language Models
Task Residual for Tuning Vision-Language Models
Tao Yu
Zhihe Lu
Xin Jin
Zhibo Chen
Xinchao Wang
VLM
CLIP
24
83
0
18 Nov 2022
Text2Model: Text-based Model Induction for Zero-shot Image
  Classification
Text2Model: Text-based Model Induction for Zero-shot Image Classification
Ohad Amosy
Tomer Volk
Eilam Shapira
Eyal Ben-David
Roi Reichart
Gal Chechik
VLM
32
0
0
27 Oct 2022
Learning by Asking Questions for Knowledge-based Novel Object
  Recognition
Learning by Asking Questions for Knowledge-based Novel Object Recognition
Kohei Uehara
Tatsuya Harada
20
1
0
12 Oct 2022
Learning to embed semantic similarity for joint image-text retrieval
Learning to embed semantic similarity for joint image-text retrieval
Noam Malali
Y. Keller
32
10
0
07 Oct 2022
I2DFormer: Learning Image to Document Attention for Zero-Shot Image
  Classification
I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Muhammad Ferjad Naeem
Yongqin Xian
Luc Van Gool
F. Tombari
VLM
23
37
0
21 Sep 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision
  and Language Models
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming Yang
Serge Belongie
Huayu Chen
VLM
41
22
0
15 Jul 2022
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with
  Attributes
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes
Alessio Mazzetto
Cristina Menghini
A. Yuan
E. Upfal
Stephen H. Bach
VLM
20
1
0
25 May 2022
Generating Representative Samples for Few-Shot Classification
Generating Representative Samples for Few-Shot Classification
Jingyi Xu
Hieu M. Le
VLM
25
61
0
05 May 2022
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic
  Filter Attention
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention
Yu Yang
Seung Wook Kim
Jungseock Joo
FAtt
13
17
0
10 Apr 2022
Mixed Differential Privacy in Computer Vision
Mixed Differential Privacy in Computer Vision
Aditya Golatkar
Alessandro Achille
Yu-Xiang Wang
Aaron Roth
Michael Kearns
Stefano Soatto
PICV
VLM
28
49
0
22 Mar 2022
Conditional Prompt Learning for Vision-Language Models
Conditional Prompt Learning for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VLM
CLIP
VPVLM
47
1,294
0
10 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot
  Generalization
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik R. Narasimhan
VLM
36
2
0
26 Feb 2022
On Guiding Visual Attention with Language Specification
On Guiding Visual Attention with Language Specification
Suzanne Petryk
Lisa Dunlap
Keyan Nasseri
Joseph E. Gonzalez
Trevor Darrell
Anna Rohrbach
VLM
206
31
1
17 Feb 2022
A Survey on Visual Transfer Learning using Knowledge Graphs
A Survey on Visual Transfer Learning using Knowledge Graphs
Sebastian Monka
Lavdim Halilaj
Achim Rettinger
33
23
0
27 Jan 2022
Towards Zero-shot Sign Language Recognition
Towards Zero-shot Sign Language Recognition
Yunus Can Bilge
R. G. Cinbis
Nazli Ikizler-Cinbis
SLR
17
36
0
15 Jan 2022
CLIP-Lite: Information Efficient Visual Representation Learning with
  Language Supervision
CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision
A. Shrivastava
Ramprasaath R. Selvaraju
Nikhil Naik
Vicente Ordonez
VLM
CLIP
30
6
0
14 Dec 2021
Dual Progressive Prototype Network for Generalized Zero-Shot Learning
Dual Progressive Prototype Network for Generalized Zero-Shot Learning
Chaoqun Wang
Shaobo Min
Xuejin Chen
Xiaoyan Sun
Houqiang Li
25
45
0
03 Nov 2021
Fine-Grained Zero-Shot Learning with DNA as Side Information
Fine-Grained Zero-Shot Learning with DNA as Side Information
Sarkhan Badirli
Zeynep Akata
G. Mohler
Christel Picard
M. M. Dundar
SyDa
BDL
46
35
0
29 Sep 2021
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan
Xiao Chang
Minnan Luo
Huan Liu
Xiaoqin Zhang
Qinghua Zheng
ObjD
VLM
67
77
0
04 Sep 2021
1234
Next