Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.00917
Cited By
v1
v2
v3 (latest)
Vocabulary-free Image Classification
1 June 2023
Alessandro Conti
Enrico Fini
Massimiliano Mancini
Paolo Rota
Yiming Wang
Elisa Ricci
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Vocabulary-free Image Classification"
22 / 22 papers shown
Title
Vocabulary-free few-shot learning for Vision-Language Models
Maxime Zanella
Clément Fuchs
Ismail Ben Ayed
Christophe De Vleeschouwer
VLM
90
0
0
04 Jun 2025
Reasoning Segmentation for Images and Videos: A Survey
Yiqing Shen
Chenjia Li
Fei Xiong
Jeong-O Jeong
Tianpeng Wang
Michael Latman
Mathias Unberath
VOS
246
0
0
24 May 2025
Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs
Hari Chandana Kuchibhotla
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
Vineeth N. Balasubramanian
169
0
0
02 May 2025
Taxonomy-Aware Evaluation of Vision-Language Models
Vésteinn Snæbjarnarson
Kevin Du
Niklas Stoehr
Serge Belongie
Ryan Cotterell
Nico Lang
Stella Frank
92
2
0
07 Apr 2025
On Large Multimodal Models as Open-World Image Classifiers
Alessandro Conti
Massimiliano Mancini
Enrico Fini
Yiming Wang
Paolo Rota
Elisa Ricci
VLM
Presented at
ResearchTrend Connect | VLM
on
07 May 2025
199
1
0
27 Mar 2025
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Marco Garosi
Alessandro Conti
Gaowen Liu
Elisa Ricci
Massimiliano Mancini
ObjD
VLM
103
0
0
24 Mar 2025
CoA: Chain-of-Action for Generative Semantic Labels
Meng Wei
Zhongnian Li
Peng Ying
Xinzheng Xu
VLM
119
0
0
26 Nov 2024
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning
William A. Stigall
122
0
0
14 Oct 2024
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
143
5
0
07 Oct 2024
A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning
Niki Maria Foteinopoulou
Enjie Ghorbel
Djamila Aouada
136
4
0
01 Oct 2024
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
ISeg
VLM
131
4
0
20 Aug 2024
Vocabulary-free Image Classification and Semantic Segmentation
Alessandro Conti
Enrico Fini
Massimiliano Mancini
Paolo Rota
Yiming Wang
Elisa Ricci
VLM
89
3
0
16 Apr 2024
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
Sang-Kee Jo
Soohyun Ryu
Sungyub Kim
Eunho Yang
Kyungsu Kim
98
2
0
30 Mar 2024
Renovating Names in Open-Vocabulary Segmentation Benchmarks
Haiwen Huang
Songyou Peng
Dan Zhang
Andreas Geiger
VLM
76
3
0
14 Mar 2024
Effectiveness Assessment of Recent Large Vision-Language Models
Yao Jiang
Xinyu Yan
Ge-Peng Ji
Keren Fu
Meijun Sun
Huan Xiong
Deng-Ping Fan
Fahad Shahbaz Khan
125
17
0
07 Mar 2024
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
Simon Ging
M. A. Bravo
Thomas Brox
VLM
158
12
0
11 Feb 2024
Democratizing Fine-grained Visual Recognition with Large Language Models
Mingxuan Liu
Subhankar Roy
Wenjing Li
Zhun Zhong
N. Sebe
Elisa Ricci
VLM
106
13
0
24 Jan 2024
Osprey: Pixel Understanding with Visual Instruction Tuning
Yuqian Yuan
Wentong Li
Jian Liu
Dongqi Tang
Xinjie Luo
Chi Qin
Lei Zhang
Jianke Zhu
MLLM
VLM
127
87
0
15 Dec 2023
Object Recognition as Next Token Prediction
Kaiyu Yue
Borchun Chen
Jonas Geiping
Hengduo Li
Tom Goldstein
Ser-Nam Lim
93
9
0
04 Dec 2023
Towards Open-Ended Visual Recognition with Large Language Model
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
VLM
74
8
0
14 Nov 2023
Survey on video anomaly detection in dynamic scenes with moving cameras
Runyu Jiao
Yi Wan
Fabio Poiesi
Yiming Wang
104
11
0
14 Aug 2023
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
458
3,754
0
28 Feb 2017
1