CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP
Aligned Representation

CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation

21 June 2024

Muhammad Ali

Papers citing "CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation"

4 / 4 papers shown

Title
Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang Adam Dziedzic Grace C. Kim Michael Backes Franziska Boenisch 93 0 0 11 Feb 2025
Organizing Unstructured Image Collections using Natural Language Mingxuan Liu Zhun Zhong Jun Li Gianni Franchi Subhankar Roy Elisa Ricci VLM 39 3 0 07 Oct 2024
Speech-text based multi-modal training with bidirectional attention for improved speech recognition Yuhang Yang Haihua Xu Hao-Ming Huang E. Chng Sheng Li 38 7 0 01 Nov 2022
ImageNet-21K Pretraining for the Masses T. Ridnik Emanuel Ben-Baruch Asaf Noy Lihi Zelnik-Manor SSeg VLM CLIP 181 687 0 22 Apr 2021