Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.07183
Cited By
Visual Classification via Description from Large Language Models
13 October 2022
Sachit Menon
Carl Vondrick
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Classification via Description from Large Language Models"
50 / 225 papers shown
Title
Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP
Qi Qian
Yuanhong Xu
Juhua Hu
VLM
CLIP
32
16
0
30 Oct 2023
Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting
Hejie Cui
Xinyu Fang
Zihan Zhang
Ran Xu
Xuan Kan
Xin Liu
Yue Yu
Manling Li
Yangqiu Song
Carl Yang
VLM
28
4
0
28 Oct 2023
Image Clustering Conditioned on Text Criteria
Sehyun Kwon
Jaeseung Park
Minkyu Kim
Jaewoong Cho
Ernest K. Ryu
Kangwook Lee
VLM
39
11
0
27 Oct 2023
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
Niki Maria Foteinopoulou
Ioannis Patras
VLM
19
16
0
25 Oct 2023
On the Powerfulness of Textual Outlier Exposure for Visual OoD Detection
Sangha Park
J. Mok
Dahuin Jung
Saehyung Lee
Sung-Hoon Yoon
24
10
0
25 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
38
2
0
23 Oct 2023
Large Language Models can Share Images, Too!
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
24
2
0
23 Oct 2023
Open-Set Image Tagging with Multi-Grained Text Supervision
Xinyu Huang
Yi-Jie Huang
Youcai Zhang
Weiwei Tian
Rui Feng
Yuejie Zhang
Yanchun Xie
Yaqian Li
Lei Zhang
VLM
30
28
0
23 Oct 2023
3D-GPT: Procedural 3D Modeling with Large Language Models
Chunyi Sun
Junlin Han
Weijian Deng
Xinlong Wang
Zishan Qin
Stephen Gould
39
39
0
19 Oct 2023
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
39
54
0
16 Oct 2023
Automated Natural Language Explanation of Deep Visual Neurons with Large Models
Chenxu Zhao
Wei Qian
Yucheng Shi
Mengdi Huai
Ninghao Liu
29
2
0
16 Oct 2023
Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook
Ming Jin
Qingsong Wen
Keli Zhang
Chaoli Zhang
Siqiao Xue
...
Shirui Pan
Vincent S. Tseng
Yu Zheng
Lei Chen
Hui Xiong
AI4TS
SyDa
35
117
0
16 Oct 2023
Prompting Scientific Names for Zero-Shot Species Recognition
Shubham Parashar
Zhiqiu Lin
Yanan Li
Shu Kong
VLM
23
12
0
15 Oct 2023
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
CoGe
28
52
0
13 Oct 2023
Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models
Vishaal Udandarao
Max F. Burg
Samuel Albanie
Matthias Bethge
VLM
36
9
0
12 Oct 2023
Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification
Sravanti Addepalli
Ashish Ramayee Asokan
Lakshay Sharma
R. V. Babu
VLM
24
15
0
12 Oct 2023
Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
Yi Dai
Hao Lang
Kaisheng Zeng
Fei Huang
Yongbin Li
OODD
26
10
0
12 Oct 2023
Investigating the Limitation of CLIP Models: The Worst-Performing Categories
Jiejing Shao
Jiang-Xin Shi
Xiao-Wen Yang
Lan-Zhe Guo
Yu-Feng Li
VLM
31
10
0
05 Oct 2023
Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
An Yan
Yu-Xiang Wang
Yiwu Zhong
Zexue He
Petros Karypis
...
Chengyu Dong
Amilcare Gentili
Chun-Nan Hsu
Jingbo Shang
Julian McAuley
27
30
0
04 Oct 2023
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Sanghwan Kim
Hao Tang
Fisher Yu
VLM
CLIP
21
4
0
28 Sep 2023
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Clement Laroudie
Andrei Bursuc
Mai Lan Ha
Gianni Franchi
VLM
26
5
0
19 Sep 2023
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts
Jiang-Xin Shi
Tong Wei
Zhi-Hua Zhou
Jiejing Shao
Xin-Yan Han
Yu-Feng Li
34
26
0
18 Sep 2023
Zero-Shot Visual Classification with Guided Cropping
Piyapat Saranrittichai
Mauricio Muñoz
Volker Fischer
Chaithanya Kumar Mummadi
VLM
32
1
0
12 Sep 2023
Language Models as Black-Box Optimizers for Vision-Language Models
Shihong Liu
Zhiqiu Lin
Samuel Yu
Ryan Lee
Tiffany Ling
Deepak Pathak
Deva Ramanan
VLM
32
28
0
12 Sep 2023
A Co-design Study for Multi-Stakeholder Job Recommender System Explanations
Roan Schellingerhout
Francesco Barile
N. Tintarev
9
5
0
11 Sep 2023
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila
Changho Shin
Lin Cai
Frederic Sala
40
18
0
08 Sep 2023
Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Hongyu Hu
Tiancheng Lin
Jie Wang
Zhenbang Sun
Yi Xu
MLLM
VLM
VPVLM
16
1
0
08 Sep 2023
TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models
Saeid Asgari Taghanaki
Aliasghar Khani
Ali Saheb Pasand
Amir Khasahmadi
Aditya Sanghi
K. Willis
Ali Mahdavi-Amiri
FAtt
VLM
27
0
0
01 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya Zhang
Yanfeng Wang
VLM
48
17
0
31 Aug 2023
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval
Seong-Hoon Eom
Namgyu Ho
Jaehoon Oh
Se-Young Yun
CLIP
VLM
35
0
0
29 Aug 2023
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
Zengqun Zhao
Ioannis Patras
VLM
13
33
0
25 Aug 2023
Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Kwan Ho Ryan Chan
Aditya Chattopadhyay
B. Haeffele
René Vidal
40
0
0
24 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
29
4
0
22 Aug 2023
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks
Fawaz Sammani
Nikos Deligiannis
13
5
0
17 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
72
33
0
15 Aug 2023
Few-shot medical image classification with simple shape and texture text descriptors using vision-language models
Michal Byra
M. F. Rachmadi
Henrik Skibbe
VLM
38
6
0
08 Aug 2023
Learning Concise and Descriptive Attributes for Visual Recognition
Andy Yan
Yu-Xiang Wang
Yiwu Zhong
Chengyu Dong
Zexue He
Yujie Lu
William Wang
Jingbo Shang
Julian McAuley
VLM
27
60
0
07 Aug 2023
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
Bang An
Sicheng Zhu
Michael-Andrei Panaitescu-Liess
Chaithanya Kumar Mummadi
Furong Huang
VLM
33
7
0
02 Aug 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
29
53
0
21 Jul 2023
Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Kanchana Ranasinghe
Michael S. Ryoo
SSL
VLM
40
12
0
20 Jul 2023
PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting
Zixin Guo
T. Wang
Selen Pehlivan
Abduljalil Radman
Jorma T. Laaksonen
VLM
27
2
0
14 Jul 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
37
1
0
13 Jul 2023
Text Descriptions are Compressive and Invariant Representations for Visual Learning
Zhili Feng
Anna Bair
J. Zico Kolter
VLM
24
6
0
10 Jul 2023
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis
Jiaxiang Liu
Tianxiang Hu
Yan Zhang
Xiaotang Gai
Yang Feng
Zuozhu Liu
LM&MA
MedIm
39
32
0
05 Jul 2023
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
William Berrios
Gautam Mittal
Tristan Thrush
Douwe Kiela
Amanpreet Singh
MLLM
VLM
15
61
0
28 Jun 2023
DesCo: Learning Object Recognition with Rich Language Descriptions
Liunian Harold Li
Zi-Yi Dou
Nanyun Peng
Kai-Wei Chang
ObjD
VLM
28
20
0
24 Jun 2023
Neural Priming for Sample-Efficient Adaptation
Matthew Wallingford
Vivek Ramanujan
Alex Fang
Aditya Kusupati
Roozbeh Mottaghi
Aniruddha Kembhavi
Ludwig Schmidt
Ali Farhadi
VLM
108
13
0
16 Jun 2023
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
VLM
26
70
0
12 Jun 2023
Multi-Modal Classifiers for Open-Vocabulary Object Detection
Prannay Kaul
Weidi Xie
Andrew Zisserman
ObjD
VLM
MLLM
14
47
0
08 Jun 2023
HUB: Guiding Learned Optimizers with Continuous Prompt Tuning
Gaole Dai
Wei Yu Wu
Ziyu Wang
Jie Fu
Shanghang Zhang
Tiejun Huang
AIFin
14
0
0
26 May 2023
Previous
1
2
3
4
5
Next