ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.07282
  4. Cited By
Waffling around for Performance: Visual Classification with Random Words
  and Broad Concepts

Waffling around for Performance: Visual Classification with Random Words and Broad Concepts

12 June 2023
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
    VLM
ArXivPDFHTML

Papers citing "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"

50 / 59 papers shown
Title
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
Yasser Benigmim
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Raoul de Charette
VLM
40
0
0
14 Apr 2025
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning
Chi-Hsi Kung
Frangil Ramirez
Juhyung Ha
Yi-Ting Chen
David J. Crandall
Yi-Hsuan Tsai
45
0
0
27 Mar 2025
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Deepayan Das
Davide Talon
Yiming Wang
Massimiliano Mancini
Elisa Ricci
VLM
LRM
50
0
0
24 Mar 2025
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Marco Garosi
Alessandro Conti
Gaowen Liu
Elisa Ricci
Massimiliano Mancini
ObjD
VLM
55
0
0
24 Mar 2025
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
Xiangyan Qu
Gaopeng Gou
Jiamin Zhuang
Jing Yu
Kun Song
Qihao Wang
Yili Li
Gang Xiong
VLM
93
0
0
13 Mar 2025
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
M. Rahaman
Ewan K. A. Millar
Erik H. W. Meijering
VLM
64
0
0
13 Mar 2025
Towards Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
Niklas Penzel
Joachim Denzler
FAtt
50
0
0
07 Mar 2025
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models
Kevin Miller
Samarth Mishra
Aditya Gangrade
Kate Saenko
Venkatesh Saligrama
VLM
47
0
0
24 Feb 2025
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Xinyu Tian
Shu Zou
Zhaoyuan Yang
Mengqi He
Jing Zhang
VLM
48
0
0
19 Feb 2025
VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Divyansh Srivastava
Beatriz Cabrero-Daniel
Christian Berger
VLM
67
8
0
17 Jan 2025
BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization
Xiusheng Xu
Lei Qi
Jingyang Zhou
Xin Geng
TTA
57
0
0
03 Jan 2025
Real Classification by Description: Extending CLIP's Limits of Part
  Attributes Recognition
Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition
Ethan Baron
Idan Tankel
Peter Tu
Guy Ben-Yosef
VLM
84
0
0
18 Dec 2024
Does VLM Classification Benefit from LLM Description Semantics?
Does VLM Classification Benefit from LLM Description Semantics?
Pingchuan Ma
Lennart Rietdorf
Dmytro Kotovenko
Vincent Tao Hu
Bjorn Ommer
VLM
74
1
0
16 Dec 2024
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot
  Classification with CLIP
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
Yayuan Li
Jintao Guo
Lei Qi
Wenbin Li
Yinghuan Shi
VLM
CLIP
79
0
0
16 Dec 2024
SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with
  ground-level prompting
SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting
Pallavi Jain
Dino Ienco
R. Interdonato
Tristan Berchoux
Diego Marcos
VLM
80
3
0
11 Dec 2024
How to Merge Your Multimodal Models Over Time?
How to Merge Your Multimodal Models Over Time?
Sebastian Dziadzio
Vishaal Udandarao
Karsten Roth
Ameya Prabhu
Zeynep Akata
Samuel Albanie
Matthias Bethge
MoMe
100
3
0
09 Dec 2024
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
Mohamed Fazli Mohamed Imam
Rufael Fedaku Marew
Jameel Hassan
M. Fiaz
Alham Fikri Aji
Hisham Cholakkal
VLM
193
0
0
28 Nov 2024
DoubleCCA: Improving Foundation Model Group Robustness with Random
  Sentence Embeddings
DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings
Hong Liu
Yitong Lu
78
0
0
25 Nov 2024
Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales
Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales
Tang Li
Mengmeng Ma
Xi Peng
45
2
0
31 Oct 2024
Tree of Attributes Prompt Learning for Vision-Language Models
Tree of Attributes Prompt Learning for Vision-Language Models
Tong Ding
Wanhua Li
Zhongqi Miao
Hanspeter Pfister
VLM
54
1
0
15 Oct 2024
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
Anh-Quan Cao
M. Jaritz
Matthieu Guillaumin
Raoul de Charette
Loris Bazzani
VLM
CLIP
52
2
0
10 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
84
1
0
08 Oct 2024
Visual-O1: Understanding Ambiguous Instructions via Multi-modal
  Multi-turn Chain-of-thoughts Reasoning
Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Minheng Ni
Yutao Fan
Lei Zhang
Wangmeng Zuo
LRM
AI4CE
31
6
0
04 Oct 2024
A sound description: Exploring prompt templates and class descriptions
  to enhance zero-shot audio classification
A sound description: Exploring prompt templates and class descriptions to enhance zero-shot audio classification
Michel Olvera
Paraskevas Stamatiadis
S. Essid
VLM
37
1
0
19 Sep 2024
Text-Enhanced Zero-Shot Action Recognition: A training-free approach
Text-Enhanced Zero-Shot Action Recognition: A training-free approach
Massimo Bosetti
Shibingfeng Zhang
Bendetta Liberatori
Giacomo Zara
Elisa Ricci
Paolo Rota
VLM
49
0
0
29 Aug 2024
Efficient Test-Time Prompt Tuning for Vision-Language Models
Efficient Test-Time Prompt Tuning for Vision-Language Models
Yuhan Zhu
Guozhen Zhang
Chen Xu
Haocheng Shen
Xiaoxin Chen
Gangshan Wu
Limin Wang
VLM
37
2
0
11 Aug 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based
  Zero-Shot Learning
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
49
2
0
22 Jul 2024
Open Vocabulary Multi-Label Video Classification
Open Vocabulary Multi-Label Video Classification
Rohit Gupta
Mamshad Nayeem Rizve
Jayakrishnan Unnikrishnan
Ashish Tawari
Son Tran
Mubarak Shah
Benjamin Z. Yao
Trishul Chilimbi
VLM
67
1
0
12 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting,
  and Transportation
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
44
7
0
05 Jul 2024
Visual-Text Cross Alignment: Refining the Similarity Score in
  Vision-Language Models
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
Jinhao Li
Haopeng Li
S. Erfani
Lei Feng
James Bailey
Feng Liu
VLM
34
3
0
05 Jun 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
61
1
0
16 May 2024
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to
  Probe the Boundaries of Stable Diffusion Generated Data
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data
Leonhard Hennicke
C. Adriano
Holger Giese
Jan Mathias Koehler
Lukas Schott
DiffM
55
2
0
06 May 2024
Embracing Diversity: Interpretable Zero-shot classification beyond one
  vector per class
Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class
Mazda Moayeri
Michael G. Rabbat
Mark Ibrahim
Diane Bouchacourt
VLM
52
1
0
25 Apr 2024
Evolving Interpretable Visual Classifiers with Large Language Models
Evolving Interpretable Visual Classifiers with Large Language Models
Mia Chiquier
Utkarsh Mall
Carl Vondrick
VLM
30
10
0
15 Apr 2024
Exploring the Potential of Large Foundation Models for Open-Vocabulary
  HOI Detection
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Ting Lei
Shaofeng Yin
Yang Liu
VLM
47
9
0
09 Apr 2024
Label Propagation for Zero-shot Classification with Vision-Language
  Models
Label Propagation for Zero-shot Classification with Vision-Language Models
Vladan Stojnić
Yannis Kalantidis
Giorgos Tolias
VLM
41
8
0
05 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
If CLIP Could Talk: Understanding Vision-Language Model Representations
  Through Their Preferred Concept Descriptions
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
Reza Esfandiarpoor
Cristina Menghini
Stephen H. Bach
CoGe
VLM
40
8
0
25 Mar 2024
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Sivan Doveh
Jakub Micorek
Mateusz Koziñski
Hilde Kuhene
Horst Possegger
VLM
MLLM
47
13
0
18 Mar 2024
PEEB: Part-based Image Classifiers with an Explainable and Editable
  Language Bottleneck
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
Thang M. Pham
Peijie Chen
Tin Nguyen
Seunghyun Yoon
Trung Bui
Anh Nguyen
VLM
51
7
0
08 Mar 2024
Any-Shift Prompting for Generalization over Distributions
Any-Shift Prompting for Generalization over Distributions
Zehao Xiao
Jiayi Shen
Mohammad Mahdi Derakhshani
Tianran Ouyang
Cees G. M. Snoek
OOD
VPVLM
VLM
45
8
0
15 Feb 2024
Multimodal Unsupervised Domain Generalization by Retrieving Across the
  Modality Gap
Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap
Christopher Liao
Christian So
Theodoros Tsiligkaridis
Brian Kulis
36
0
0
06 Feb 2024
Learning to Prompt with Text Only Supervision for Vision-Language Models
Learning to Prompt with Text Only Supervision for Vision-Language Models
Muhammad Uzair Khattak
Muhammad Ferjad Naeem
Muzammal Naseer
Luc Van Gool
F. Tombari
VLM
VPVLM
33
19
0
04 Jan 2024
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
Xinyu Tian
Shu Zou
Zhaoyuan Yang
Jing Zhang
VLM
34
22
0
27 Nov 2023
Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy
  Tradeoff for Out-of-Distribution Few-shot Learning
Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy Tradeoff for Out-of-Distribution Few-shot Learning
Christopher Liao
Theodoros Tsiligkaridis
Brian Kulis
OODD
49
5
0
21 Nov 2023
LLMs as Visual Explainers: Advancing Image Classification with Evolving
  Visual Descriptions
LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Songhao Han
Le Zhuo
Yue Liao
Si Liu
VLM
26
14
0
20 Nov 2023
From Categories to Classifier: Name-Only Continual Learning by Exploring
  the Web
From Categories to Classifier: Name-Only Continual Learning by Exploring the Web
Ameya Prabhu
Hasan Hammoud
Ser-Nam Lim
Guohao Li
Philip Torr
Adel Bibi
CLL
127
9
0
19 Nov 2023
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities
  for Image Classification
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
Reza Esfandiarpoor
Stephen H. Bach
VLM
32
13
0
10 Nov 2023
Videoprompter: an ensemble of foundational models for zero-shot video
  understanding
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
38
2
0
23 Oct 2023
Vision-by-Language for Training-Free Compositional Image Retrieval
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
CoGe
28
52
0
13 Oct 2023
12
Next