ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.07183
  4. Cited By
Visual Classification via Description from Large Language Models

Visual Classification via Description from Large Language Models

13 October 2022
Sachit Menon
Carl Vondrick
    VLM
ArXivPDFHTML

Papers citing "Visual Classification via Description from Large Language Models"

50 / 225 papers shown
Title
Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP
Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP
Qi Qian
Yuanhong Xu
Juhua Hu
VLM
CLIP
32
16
0
30 Oct 2023
Open Visual Knowledge Extraction via Relation-Oriented Multimodality
  Model Prompting
Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting
Hejie Cui
Xinyu Fang
Zihan Zhang
Ran Xu
Xuan Kan
Xin Liu
Yue Yu
Manling Li
Yangqiu Song
Carl Yang
VLM
28
4
0
28 Oct 2023
Image Clustering Conditioned on Text Criteria
Image Clustering Conditioned on Text Criteria
Sehyun Kwon
Jaeseung Park
Minkyu Kim
Jaewoong Cho
Ernest K. Ryu
Kangwook Lee
VLM
39
11
0
27 Oct 2023
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression
  Recognition
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
Niki Maria Foteinopoulou
Ioannis Patras
VLM
19
16
0
25 Oct 2023
On the Powerfulness of Textual Outlier Exposure for Visual OoD Detection
On the Powerfulness of Textual Outlier Exposure for Visual OoD Detection
Sangha Park
J. Mok
Dahuin Jung
Saehyung Lee
Sung-Hoon Yoon
24
10
0
25 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video
  understanding
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
38
2
0
23 Oct 2023
Large Language Models can Share Images, Too!
Large Language Models can Share Images, Too!
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
24
2
0
23 Oct 2023
Open-Set Image Tagging with Multi-Grained Text Supervision
Open-Set Image Tagging with Multi-Grained Text Supervision
Xinyu Huang
Yi-Jie Huang
Youcai Zhang
Weiwei Tian
Rui Feng
Yuejie Zhang
Yanchun Xie
Yaqian Li
Lei Zhang
VLM
30
28
0
23 Oct 2023
3D-GPT: Procedural 3D Modeling with Large Language Models
3D-GPT: Procedural 3D Modeling with Large Language Models
Chunyi Sun
Junlin Han
Weijian Deng
Xinlong Wang
Zishan Qin
Stephen Gould
39
39
0
19 Oct 2023
Fake News in Sheep's Clothing: Robust Fake News Detection Against
  LLM-Empowered Style Attacks
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
39
54
0
16 Oct 2023
Automated Natural Language Explanation of Deep Visual Neurons with Large
  Models
Automated Natural Language Explanation of Deep Visual Neurons with Large Models
Chenxu Zhao
Wei Qian
Yucheng Shi
Mengdi Huai
Ninghao Liu
29
2
0
16 Oct 2023
Large Models for Time Series and Spatio-Temporal Data: A Survey and
  Outlook
Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook
Ming Jin
Qingsong Wen
Keli Zhang
Chaoli Zhang
Siqiao Xue
...
Shirui Pan
Vincent S. Tseng
Yu Zheng
Lei Chen
Hui Xiong
AI4TS
SyDa
35
117
0
16 Oct 2023
Prompting Scientific Names for Zero-Shot Species Recognition
Prompting Scientific Names for Zero-Shot Species Recognition
Shubham Parashar
Zhiqiu Lin
Yanan Li
Shu Kong
VLM
23
12
0
15 Oct 2023
Vision-by-Language for Training-Free Compositional Image Retrieval
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
CoGe
28
52
0
13 Oct 2023
Visual Data-Type Understanding does not emerge from Scaling
  Vision-Language Models
Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models
Vishaal Udandarao
Max F. Burg
Samuel Albanie
Matthias Bethge
VLM
36
9
0
12 Oct 2023
Leveraging Vision-Language Models for Improving Domain Generalization in
  Image Classification
Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification
Sravanti Addepalli
Ashish Ramayee Asokan
Lakshay Sharma
R. V. Babu
VLM
24
15
0
12 Oct 2023
Exploring Large Language Models for Multi-Modal Out-of-Distribution
  Detection
Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
Yi Dai
Hao Lang
Kaisheng Zeng
Fei Huang
Yongbin Li
OODD
26
10
0
12 Oct 2023
Investigating the Limitation of CLIP Models: The Worst-Performing
  Categories
Investigating the Limitation of CLIP Models: The Worst-Performing Categories
Jiejing Shao
Jiang-Xin Shi
Xiao-Wen Yang
Lan-Zhe Guo
Yu-Feng Li
VLM
31
10
0
05 Oct 2023
Robust and Interpretable Medical Image Classifiers via Concept
  Bottleneck Models
Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
An Yan
Yu-Xiang Wang
Yiwu Zhong
Zexue He
Petros Karypis
...
Chengyu Dong
Amilcare Gentili
Chun-Nan Hsu
Jingbo Shang
Julian McAuley
27
30
0
04 Oct 2023
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Sanghwan Kim
Hao Tang
Fisher Yu
VLM
CLIP
21
4
0
28 Sep 2023
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Clement Laroudie
Andrei Bursuc
Mai Lan Ha
Gianni Franchi
VLM
26
5
0
19 Sep 2023
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts
Jiang-Xin Shi
Tong Wei
Zhi-Hua Zhou
Jiejing Shao
Xin-Yan Han
Yu-Feng Li
34
26
0
18 Sep 2023
Zero-Shot Visual Classification with Guided Cropping
Zero-Shot Visual Classification with Guided Cropping
Piyapat Saranrittichai
Mauricio Muñoz
Volker Fischer
Chaithanya Kumar Mummadi
VLM
32
1
0
12 Sep 2023
Language Models as Black-Box Optimizers for Vision-Language Models
Language Models as Black-Box Optimizers for Vision-Language Models
Shihong Liu
Zhiqiu Lin
Samuel Yu
Ryan Lee
Tiffany Ling
Deepak Pathak
Deva Ramanan
VLM
32
28
0
12 Sep 2023
A Co-design Study for Multi-Stakeholder Job Recommender System
  Explanations
A Co-design Study for Multi-Stakeholder Job Recommender System Explanations
Roan Schellingerhout
Francesco Barile
N. Tintarev
9
5
0
11 Sep 2023
Zero-Shot Robustification of Zero-Shot Models
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila
Changho Shin
Lin Cai
Frederic Sala
40
18
0
08 Sep 2023
Context-Aware Prompt Tuning for Vision-Language Model with
  Dual-Alignment
Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Hongyu Hu
Tiancheng Lin
Jie Wang
Zhenbang Sun
Yi Xu
MLLM
VLM
VPVLM
16
1
0
08 Sep 2023
TExplain: Explaining Learned Visual Features via Pre-trained (Frozen)
  Language Models
TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models
Saeid Asgari Taghanaki
Aliasghar Khani
Ali Saheb Pasand
Amir Khasahmadi
Aditya Sanghi
K. Willis
Ali Mahdavi-Amiri
FAtt
VLM
27
0
0
01 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute
  Decomposition-Aggregation
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya Zhang
Yanfeng Wang
VLM
48
17
0
31 Aug 2023
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification
  with Cross-Modal Retrieval
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval
Seong-Hoon Eom
Namgyu Ho
Jaehoon Oh
Se-Young Yun
CLIP
VLM
35
0
0
29 Aug 2023
Prompting Visual-Language Models for Dynamic Facial Expression
  Recognition
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
Zengqun Zhao
Ioannis Patras
VLM
13
33
0
25 Aug 2023
Variational Information Pursuit with Large Language and Multimodal
  Models for Interpretable Predictions
Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Kwan Ho Ryan Chan
Aditya Chattopadhyay
B. Haeffele
René Vidal
40
0
0
24 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
29
4
0
22 Aug 2023
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language
  Tasks
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks
Fawaz Sammani
Nikos Deligiannis
13
5
0
17 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
72
33
0
15 Aug 2023
Few-shot medical image classification with simple shape and texture text
  descriptors using vision-language models
Few-shot medical image classification with simple shape and texture text descriptors using vision-language models
Michal Byra
M. F. Rachmadi
Henrik Skibbe
VLM
38
6
0
08 Aug 2023
Learning Concise and Descriptive Attributes for Visual Recognition
Learning Concise and Descriptive Attributes for Visual Recognition
Andy Yan
Yu-Xiang Wang
Yiwu Zhong
Chengyu Dong
Zexue He
Yujie Lu
William Wang
Jingbo Shang
Julian McAuley
VLM
27
60
0
07 Aug 2023
PerceptionCLIP: Visual Classification by Inferring and Conditioning on
  Contexts
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
Bang An
Sicheng Zhu
Michael-Andrei Panaitescu-Liess
Chaithanya Kumar Mummadi
Furong Huang
VLM
33
7
0
02 Aug 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
29
53
0
21 Jul 2023
Language-based Action Concept Spaces Improve Video Self-Supervised
  Learning
Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Kanchana Ranasinghe
Michael S. Ryoo
SSL
VLM
40
12
0
20 Jul 2023
PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language
  Pre-training via Prompting
PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting
Zixin Guo
T. Wang
Selen Pehlivan
Abduljalil Radman
Jorma T. Laaksonen
VLM
27
2
0
14 Jul 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream
  Tasks
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
37
1
0
13 Jul 2023
Text Descriptions are Compressive and Invariant Representations for
  Visual Learning
Text Descriptions are Compressive and Invariant Representations for Visual Learning
Zhili Feng
Anna Bair
J. Zico Kolter
VLM
24
6
0
10 Jul 2023
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image
  Diagnosis
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis
Jiaxiang Liu
Tianxiang Hu
Yan Zhang
Xiaotang Gai
Yang Feng
Zuozhu Liu
LM&MA
MedIm
39
32
0
05 Jul 2023
Towards Language Models That Can See: Computer Vision Through the LENS
  of Natural Language
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
William Berrios
Gautam Mittal
Tristan Thrush
Douwe Kiela
Amanpreet Singh
MLLM
VLM
15
61
0
28 Jun 2023
DesCo: Learning Object Recognition with Rich Language Descriptions
DesCo: Learning Object Recognition with Rich Language Descriptions
Liunian Harold Li
Zi-Yi Dou
Nanyun Peng
Kai-Wei Chang
ObjD
VLM
28
20
0
24 Jun 2023
Neural Priming for Sample-Efficient Adaptation
Neural Priming for Sample-Efficient Adaptation
Matthew Wallingford
Vivek Ramanujan
Alex Fang
Aditya Kusupati
Roozbeh Mottaghi
Aniruddha Kembhavi
Ludwig Schmidt
Ali Farhadi
VLM
108
13
0
16 Jun 2023
Waffling around for Performance: Visual Classification with Random Words
  and Broad Concepts
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
VLM
26
70
0
12 Jun 2023
Multi-Modal Classifiers for Open-Vocabulary Object Detection
Multi-Modal Classifiers for Open-Vocabulary Object Detection
Prannay Kaul
Weidi Xie
Andrew Zisserman
ObjD
VLM
MLLM
14
47
0
08 Jun 2023
HUB: Guiding Learned Optimizers with Continuous Prompt Tuning
Gaole Dai
Wei Yu Wu
Ziyu Wang
Jie Fu
Shanghang Zhang
Tiejun Huang
AIFin
14
0
0
26 May 2023
Previous
12345
Next