What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

24 May 2024

Abdelrahman Abdelhamed

Papers citing "What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models"

8 / 8 papers shown

Title
Zero-shot Model-based Reinforcement Learning using Large Language Models Abdelhakim Benechehab Youssef Attia El Hili Ambroise Odonnat Oussama Zekri Albert Thomas Giuseppe Paolo Maurizio Filippone I. Redko Balázs Kégl OffRL 66 1 0 17 Feb 2025
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance Mostafa Hussien Mahmoud Afifi K. Nguyen M. Cheriet 26 0 0 21 Oct 2024
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations Nikolaos Giakoumoglou Tania Stathaki SSL 46 0 0 03 Oct 2024
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention Ziyu Guo Renrui Zhang Longtian Qiu Xianzheng Ma Xupeng Miao Xuming He Bin Cui VLM AAML 59 109 0 28 Sep 2022
UniCLIP: Unified Framework for Contrastive Language-Image Pre-training Janghyeon Lee Jongsuk Kim Hyounguk Shon Bumsoo Kim Seung Wook Kim Honglak Lee Junmo Kim CLIP VLM 50 53 0 27 Sep 2022
What does a platypus look like? Generating customized prompts for zero-shot image classification Sarah M Pratt Ian Covert Rosanne Liu Ali Farhadi VLM 131 212 0 07 Sep 2022
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling Renrui Zhang Rongyao Fang Wei Zhang Peng Gao Kunchang Li Jifeng Dai Yu Qiao Hongsheng Li VLM 192 385 0 06 Nov 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu H. Pham Quoc V. Le Yun-hsuan Sung Zhen Li Tom Duerig VLM CLIP 310 3,708 0 11 Feb 2021