Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.14830
Cited By
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation
21 June 2024
Muhammad Ali
Salman Khan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation"
4 / 4 papers shown
Title
Captured by Captions: On Memorization and its Mitigation in CLIP Models
Wenhao Wang
Adam Dziedzic
Grace C. Kim
Michael Backes
Franziska Boenisch
93
0
0
11 Feb 2025
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
39
3
0
07 Oct 2024
Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Yuhang Yang
Haihua Xu
Hao-Ming Huang
E. Chng
Sheng Li
38
7
0
01 Nov 2022
ImageNet-21K Pretraining for the Masses
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
181
687
0
22 Apr 2021
1