Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10972
Cited By
v1
v2
v3
v4 (latest)
ImageNet-21K Pretraining for the Masses
22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
Re-assign community
ArXiv (abs)
PDF
HTML
Github (765★)
Papers citing
"ImageNet-21K Pretraining for the Masses"
50 / 427 papers shown
Title
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma
Peihao Wang
Zhiwen Fan
Zhangyang Wang
Hao Su
R. Ramamoorthi
VLM
88
8
0
27 Mar 2024
MisGUIDE : Defense Against Data-Free Deep Learning Model Extraction
Mahendra Gurve
S. Behera
Satyadev Ahlawat
Yamuna Prasad
MIACV
AAML
53
0
0
27 Mar 2024
SCOD: From Heuristics to Theory
Vojtech Franc
Jakub Paplhám
D. Prusa
65
2
0
25 Mar 2024
G-ACIL: Analytic Learning for Exemplar-Free Generalized Class Incremental Learning
Huiping Zhuang
Yizhu Chen
Di Fang
Run He
Kai Tong
Hongxin Wei
Huiping Zhuang
Cen Chen
CLL
85
12
0
23 Mar 2024
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li
Yibo Yang
Hefei Ling
Jianlong Wu
Yue Yu
Guohao Li
Min Zhang
SSL
101
6
0
18 Mar 2024
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
Yuanchen Wu
Xichen Ye
Kequan Yang
Jide Li
Xiaoqiang Li
92
10
0
17 Mar 2024
Frozen Feature Augmentation for Few-Shot Image Classification
Andreas Bär
N. Houlsby
Mostafa Dehghani
Manoj Kumar
VLM
86
4
0
15 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjD
VLM
124
2
0
14 Mar 2024
Synth
2
^2
2
: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Sahand Sharifzadeh
Christos Kaplanis
Shreya Pathak
D. Kumaran
Anastasija Ilić
Jovana Mitrović
Charles Blundell
Andrea Banino
VLM
97
12
0
12 Mar 2024
Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
Oana Ignat
Longju Bai
Joan Nwatu
Rada Mihalcea
76
6
0
12 Mar 2024
Fine-grained Prompt Tuning: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification
Yijin Huang
Pujin Cheng
Roger Tam
Xiaoying Tang
VLM
MedIm
102
1
0
12 Mar 2024
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
Minjie Zhu
Yichen Zhu
Xin Liu
Ning Liu
Zhiyuan Xu
Yaxin Peng
Chaomin Shen
Zhicai Ou
Feifei Feng
Jian Tang
VLM
100
22
0
10 Mar 2024
Latent Dataset Distillation with Diffusion Models
Brian B. Moser
Federico Raue
Sebastián M. Palacio
Stanislav Frolov
Andreas Dengel
DD
108
16
0
06 Mar 2024
Pre-training Differentially Private Models with Limited Public Data
Zhiqi Bu
Xinwei Zhang
Mingyi Hong
Sheng Zha
George Karypis
114
4
0
28 Feb 2024
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
97
10
0
28 Feb 2024
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation
Zhiwei Yang
Kexue Fu
Minghong Duan
Linhao Qu
Shuo Wang
Zhijian Song
76
15
0
28 Feb 2024
Self-Supervised Learning with Generative Adversarial Networks for Electron Microscopy
Bashir Kazimi
Karina Ruzaeva
Stefan Sandfeld
85
6
0
28 Feb 2024
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval
Haowei Liu
Yaya Shi
Haiyang Xu
Chunfen Yuan
Qinghao Ye
...
Mingshi Yan
Ji Zhang
Fei Huang
Bing Li
Weiming Hu
55
1
0
26 Feb 2024
A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends
Abolfazl Younesi
Mohsen Ansari
Mohammadamin Fazli
A. Ejlali
Muhammad Shafique
Joerg Henkel
3DV
117
51
0
23 Feb 2024
Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?
Nader Asadi
Mahdi Beitollahi
Yasser H. Khalil
Yinchuan Li
Guojun Zhang
Xi Chen
MoMe
92
9
0
23 Feb 2024
All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining
Haihong Zhao
Aochuan Chen
Xiangguo Sun
Hong Cheng
Jia Li
98
41
0
15 Feb 2024
An Empirical Study Into What Matters for Calibrating Vision-Language Models
Weijie Tu
Weijian Deng
Dylan Campbell
Stephen Gould
Tom Gedeon
VLM
88
8
0
12 Feb 2024
Enhancing Embodied Object Detection through Language-Image Pre-training and Implicit Object Memory
N. H. Chapman
Feras Dayoub
Will N. Browne
Chris Lehnert
ObjD
VLM
LM&Ro
65
1
0
06 Feb 2024
Review of multimodal machine learning approaches in healthcare
"Felix H. Krones
Umar Marikkar
Guy Parsons
Adam Szmul
Adam Mahdi
120
32
0
04 Feb 2024
Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning
Li Ren
Chen Chen
Liqiang Wang
Kien Hua
75
5
0
04 Feb 2024
Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?
Cheng Han
Qifan Wang
Yiming Cui
Wenguan Wang
Lifu Huang
Siyuan Qi
Dongfang Liu
VLM
157
22
0
23 Jan 2024
Slicer Networks
Hang Zhang
Xiang Chen
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
MedIm
87
3
0
18 Jan 2024
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models
Dan G. Jacobellis
Daniel Cummings
N. Yadwadkar
58
2
0
15 Jan 2024
Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing
Jakob Hackstein
Gencer Sumbul
Kai Norman Clasen
Begüm Demir
108
7
0
15 Jan 2024
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Shengbang Tong
Zhuang Liu
Yuexiang Zhai
Yi-An Ma
Yann LeCun
Saining Xie
VLM
MLLM
151
349
0
11 Jan 2024
OTAS: An Elastic Transformer Serving System via Token Adaptation
Jinyu Chen
Wenchao Xu
Zicong Hong
Song Guo
Yining Qi
Jie Zhang
Deze Zeng
69
4
0
10 Jan 2024
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Xiangxiang Chu
Limeng Qiao
Xinyang Lin
Shuang Xu
Yang Yang
...
Fei Wei
Xinyu Zhang
Bo Zhang
Xiaolin Wei
Chunhua Shen
MLLM
130
44
0
28 Dec 2023
Infinite dSprites for Disentangled Continual Learning: Separating Memory Edits from Generalization
Sebastian Dziadzio
cCaugatay Yildiz
Gido M. van de Ven
Tomasz Trzciñski
Tinne Tuytelaars
Matthias Bethge
71
1
0
27 Dec 2023
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models
Gianni Franchi
Olivier Laurent
Maxence Leguéry
Andrei Bursuc
Andrea Pilzer
Angela Yao
UQCV
BDL
60
6
0
23 Dec 2023
Testing the Segment Anything Model on radiology data
J. Almeida
N. M. Rodrigues
Sara Silva
Nickolas Papanikolaou
MedIm
VLM
86
1
0
20 Dec 2023
Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback
Boaz Lerner
N. Darshan
Rami Ben-Ari
70
1
0
18 Dec 2023
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
Enze Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
207
85
0
17 Dec 2023
Read Between the Layers: Leveraging Multi-Layer Representations for Rehearsal-Free Continual Learning with Pre-Trained Models
Kyra Ahrens
Hans Hergen Lehmann
Jae Hee Lee
Stefan Wermter
CLL
98
7
0
13 Dec 2023
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection
Joonhyun Jeong
Geondo Park
Jayeon Yoo
Hyungsik Jung
Heesu Kim
VLM
ObjD
92
11
0
12 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
74
0
11 Dec 2023
Scaling Laws of Synthetic Images for Model Training ... for Now
Lijie Fan
Kaifeng Chen
Dilip Krishnan
Dina Katabi
Phillip Isola
Yonglong Tian
CLIP
VLM
82
68
0
07 Dec 2023
Scaling Laws for Adversarial Attacks on Language Model Activations
Stanislav Fort
63
16
0
05 Dec 2023
A Comprehensive Study of Vision Transformers in Image Classification Tasks
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
64
10
0
02 Dec 2023
Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning
Jiantao Wu
Shentong Mo
Sara Atito
Josef Kittler
Zhenhua Feng
Muhammad Awais
SSL
70
3
0
02 Dec 2023
Language-conditioned Detection Transformer
Jang Hyun Cho
Philipp Krahenbuhl
VLM
ObjD
93
1
0
29 Nov 2023
Non-Visible Light Data Synthesis and Application: A Case Study for Synthetic Aperture Radar Imagery
Zichen Tian
Zhaozheng Chen
Qianru Sun
81
1
0
29 Nov 2023
Explaining CLIP's performance disparities on data from blind/low vision users
Daniela Massiceti
Camilla Longden
Agnieszka Slowik
Samuel Wills
Martin Grayson
C. Morrison
VLM
60
10
0
29 Nov 2023
Efficient Key-Based Adversarial Defense for ImageNet by Using Pre-trained Model
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
VLM
AAML
57
0
0
28 Nov 2023
SpliceMix: A Cross-scale and Semantic Blending Augmentation Strategy for Multi-label Image Classification
Lei Wang
Yibing Zhan
Leilei Ma
Dapeng Tao
Liang Ding
Chen Gong
77
1
0
26 Nov 2023
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Yangyi Chen
Xingyao Wang
Manling Li
Derek Hoiem
Heng Ji
81
12
0
22 Nov 2023
Previous
1
2
3
4
5
6
7
8
9
Next