Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.07183
Cited By
Visual Classification via Description from Large Language Models
13 October 2022
Sachit Menon
Carl Vondrick
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Classification via Description from Large Language Models"
25 / 225 papers shown
Title
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
Sanjoy Kundu
Shubham Trehan
Sathyanarayanan N. Aakur
LRM
LM&Ro
27
1
0
26 May 2023
In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Leonard Salewski
Stephan Alaniz
Isabel Rio-Torto
Eric Schulz
Zeynep Akata
44
149
0
24 May 2023
Prompting Language-Informed Distribution for Compositional Zero-Shot Learning
Wentao Bao
Lichang Chen
Heng-Chiao Huang
Yu Kong
CoGe
VLM
29
12
0
23 May 2023
Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models
Lin Li
Jun Xiao
Guikun Chen
Jian Shao
Yueting Zhuang
Long Chen
VLM
32
26
0
21 May 2023
A Survey on Out-of-Distribution Detection in NLP
Hao Lang
Yinhe Zheng
Yixuan Li
Jian Sun
Feiling Huang
Yongbin Li
29
20
0
05 May 2023
RPLKG: Robust Prompt Learning with Knowledge Graph
Yewon Kim
Yongtaek Lim
Dokyung Yoon
Kyungwoo Song
VLM
6
0
0
21 Apr 2023
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval
Jae Myung Kim
A. Sophia Koepke
Cordelia Schmid
Zeynep Akata
78
25
0
06 Apr 2023
VicTR: Video-conditioned Text Representations for Activity Recognition
Kumara Kahatapitiya
Anurag Arnab
Arsha Nagrani
Michael S. Ryoo
36
19
0
05 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
41
483
0
03 Apr 2023
Xplainer: From X-Ray Observations to Explainable Zero-Shot Diagnosis
Chantal Pellegrini
Matthias Keicher
Ege Ozsoy
Petra Jirásková
R. Braren
Nassir Navab
MedIm
15
20
0
23 Mar 2023
Open-Vocabulary Object Detection using Pseudo Caption Labels
Han-Cheol Cho
Won Young Jhoo
Woohyun Kang
Byungseok Roh
VLM
ObjD
32
20
0
23 Mar 2023
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection
Kyle Buettner
Adriana Kovashka
22
0
0
17 Mar 2023
ViperGPT: Visual Inference via Python Execution for Reasoning
Dídac Surís
Sachit Menon
Carl Vondrick
MLLM
LRM
ReLM
45
431
0
14 Mar 2023
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Renrui Zhang
Xiangfei Hu
Bohao Li
Siyuan Huang
Hanqiu Deng
Hongsheng Li
Yu Qiao
Peng Gao
VLM
MLLM
40
170
0
03 Mar 2023
Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion
Jordan Shipard
Arnold Wiliem
Kien Nguyen Thanh
Wei Xiang
Clinton Fookes
DiffM
14
73
0
07 Feb 2023
Affective Faces for Goal-Driven Dyadic Communication
Scott Geng
Revant Teotia
Purva Tendulkar
Sachit Menon
Carl Vondrick
VGen
26
18
0
26 Jan 2023
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Chengzhi Mao
Revant Teotia
Amrutha Sundar
Sachit Menon
Junfeng Yang
Xin Eric Wang
Carl Vondrick
18
29
0
12 Dec 2022
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Vishaal Udandarao
Ankush Gupta
Samuel Albanie
VLM
MLLM
29
103
0
28 Nov 2022
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
Yue Yang
Artemis Panagopoulou
Shenghao Zhou
Daniel Jin
Chris Callison-Burch
Mark Yatskar
40
211
0
21 Nov 2022
What does a platypus look like? Generating customized prompts for zero-shot image classification
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
131
212
0
07 Sep 2022
Generative Action Description Prompts for Skeleton-based Action Recognition
Wangmeng Xiang
Chong Li
Yuxuan Zhou
Biao Wang
Lei Zhang
35
35
0
10 Aug 2022
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Yumao Lu
Zicheng Liu
Lijuan Wang
180
402
0
10 Sep 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
313
3,708
0
11 Feb 2021
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
417
2,588
0
03 Sep 2019
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
296
39,198
0
01 Sep 2014
Previous
1
2
3
4
5