Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10972
Cited By
v1
v2
v3
v4 (latest)
ImageNet-21K Pretraining for the Masses
22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
Re-assign community
ArXiv (abs)
PDF
HTML
Github (765★)
Papers citing
"ImageNet-21K Pretraining for the Masses"
50 / 427 papers shown
Title
On the Generalizability and Predictability of Recommender Systems
Duncan C. McElfresh
Sujay Khandagale
Jonathan Valverde
John P. Dickerson
Colin White
74
10
0
23 Jun 2022
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Muhammad Maaz
Abdelrahman M. Shaker
Hisham Cholakkal
Salman Khan
Syed Waqas Zamir
Rao Muhammad Anwer
Fahad Shahbaz Khan
ViT
125
203
0
21 Jun 2022
Model-Agnostic Few-Shot Open-Set Recognition
Malik Boudiaf
Etienne Bennequin
Myriam Tami
C´eline Hudelot
Antoine Toubhans
Pablo Piantanida
Ismail Ben Ayed
BDL
107
1
0
18 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
171
412
0
17 Jun 2022
Efficient Adaptive Ensembling for Image Classification
A. Bruno
Davide Moroni
M. Martinelli
67
18
0
15 Jun 2022
Differentiable Top-k Classification Learning
Felix Petersen
Hilde Kuehne
Christian Borgelt
Oliver Deussen
125
32
0
15 Jun 2022
Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps
Seung Wook Kim
Karsten Kreis
Daiqing Li
Antonio Torralba
Sanja Fidler
GAN
95
6
0
06 Jun 2022
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta
Mohammad Rastegari
ViT
MQ
105
265
0
06 Jun 2022
Which models are innately best at uncertainty estimation?
Ido Galil
Mohammed Dabbah
Ran El-Yaniv
UQCV
79
5
0
05 Jun 2022
Zero-Shot and Few-Shot Learning for Lung Cancer Multi-Label Classification using Vision Transformer
F. Guo
Yingfang Fan
ViT
MedIm
124
7
0
30 May 2022
Self-Supervised Pre-training of Vision Transformers for Dense Prediction Tasks
Jaonary Rabarisoa
Velentin Belissen
Florian Chabot
Q. C. Pham
VLM
ViT
SSL
MDE
45
3
0
30 May 2022
A Closer Look at Self-Supervised Lightweight Vision Transformers
Shaoru Wang
Jin Gao
Zeming Li
Jian Sun
Weiming Hu
ViT
148
46
0
28 May 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
241
703
0
26 May 2022
An Empirical Study on Distribution Shift Robustness From the Perspective of Pre-Training and Data Augmentation
Ziquan Liu
Yi Tian Xu
Yuanhong Xu
Qi Qian
Hao Li
Rong Jin
Xiangyang Ji
Antoni B. Chan
OOD
89
16
0
25 May 2022
Vision Transformers in 2022: An Update on Tiny ImageNet
Ethan Huynh
ViT
86
11
0
21 May 2022
A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers
Yuzhong Chen
Yu Du
Zhe Xiao
Lin Zhao
Lu Zhang
...
Dajiang Zhu
Tuo Zhang
Xintao Hu
Tianming Liu
Xi Jiang
ViT
79
5
0
20 May 2022
Identical Image Retrieval using Deep Learning
Sayan Nath
Nikhil Nayak
VLM
52
1
0
10 May 2022
Continual Learning with Foundation Models: An Empirical Study of Latent Replay
O. Ostapenko
Timothée Lesort
P. Rodríguez
Md Rifat Arefin
Arthur Douillard
Irina Rish
Laurent Charlin
98
53
0
30 Apr 2022
Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval
Siyu Ren
Kenny Q. Zhu
VLM
34
7
0
29 Apr 2022
Where in the World is this Image? Transformer-based Geo-localization in the Wild
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
62
37
0
29 Apr 2022
Self-Supervised Learning of Object Parts for Semantic Segmentation
A. Ziegler
Yuki M. Asano
SSL
OCL
117
103
0
27 Apr 2022
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
Spyridon Baxevanakis
Giorgos Kordopatis-Zilos
Panagiotis Galopoulos
Lazaros Apostolidis
Killian Levacher
Ipek B. Schlicht
Denis Teyssou
I. Kompatsiaris
Symeon Papadopoulos
72
8
0
27 Apr 2022
VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout
Md. Istiak Hossain Shihab
Nazia Tasnim
H. Zunair
L. Rupty
Nabeel Mohammed
86
8
0
23 Apr 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching
Patrick Fernandes
Marcos Vinícius Treviso
Danish Pruthi
André F. T. Martins
Graham Neubig
FAtt
94
22
0
22 Apr 2022
DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning
Zifeng Wang
Zizhao Zhang
Sayna Ebrahimi
Ruoxi Sun
Han Zhang
...
Xiaoqi Ren
Guolong Su
Vincent Perot
Jennifer Dy
Tomas Pfister
CLL
VLM
VPVLM
132
504
0
10 Apr 2022
Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results
T. Ridnik
Hussam Lawen
Emanuel Ben-Baruch
Asaf Noy
104
11
0
07 Apr 2022
How stable are Transferability Metrics evaluations?
A. Agostinelli
Michal Pándy
J. Uijlings
Thomas Mensink
V. Ferrari
121
24
0
04 Apr 2022
CLIP-Mesh: Generating textured meshes from text using pretrained image-text models
N. Khalid
Tianhao Xie
Eugene Belilovsky
Tiberiu Popa
CLIP
102
302
0
24 Mar 2022
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Botao Ye
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
ViT
119
485
0
22 Mar 2022
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning
Samuel Yu
Peter Wu
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
LRM
117
16
0
21 Mar 2022
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
Yinan He
Gengshi Huang
Siyu Chen
Jianing Teng
Wang Kun
Zhen-fei Yin
Lu Sheng
Ziwei Liu
Yu Qiao
Jing Shao
VLM
SSL
ViT
107
7
0
16 Mar 2022
Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information
Fanchao Qi
Chuancheng Lv
Zhiyuan Liu
Xiaojun Meng
Maosong Sun
Haitao Zheng
77
6
0
14 Mar 2022
Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability
Ruifei He
Shuyang Sun
Jihan Yang
Song Bai
Xiaojuan Qi
96
37
0
10 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
82
14
0
08 Mar 2022
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Qishuai Diao
Yi Jiang
Bin Wen
Jianxiang Sun
Zehuan Yuan
77
63
0
05 Mar 2022
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification
Kai Yi
Xiaoqian Shen
Yunhao Gou
Mohamed Elhoseiny
93
21
0
02 Mar 2022
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
Frederik Pahde
Maximilian Dreyer
Leander Weber
Moritz Weckbecker
Christopher J. Anders
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
144
10
0
07 Feb 2022
Revisiting Weakly Supervised Pre-Training of Visual Perception Models
Mannat Singh
Laura Gustafson
Aaron B. Adcock
Vinicius de Freitas Reis
B. Gedik
Raj Prateek Kosaraju
D. Mahajan
Ross B. Girshick
Piotr Dollár
Laurens van der Maaten
VLM
104
130
0
20 Jan 2022
Adversarial vulnerability of powerful near out-of-distribution detection
Stanislav Fort
OODD
66
17
0
18 Jan 2022
It's All in the Head: Representation Knowledge Distillation through Classifier Sharing
Emanuel Ben-Baruch
M. Karklinsky
Yossi Biton
Avi Ben-Cohen
Hussam Lawen
Nadav Zamir
58
12
0
18 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
141
107
0
16 Jan 2022
Parameter-free Online Test-time Adaptation
Malik Boudiaf
Romain Mueller
Ismail Ben Ayed
Luca Bertinetto
TTA
72
152
0
15 Jan 2022
Detecting Twenty-thousand Classes using Image-level Supervision
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIP
VLM
125
619
0
07 Jan 2022
MIA-Former: Efficient and Robust Vision Transformers via Multi-grained Input-Adaptation
Zhongzhi Yu
Y. Fu
Sicheng Li
Chaojian Li
Yingyan Lin
ViT
76
19
0
21 Dec 2021
Towards General and Efficient Active Learning
Yichen Xie
Masayoshi Tomizuka
Wei Zhan
VLM
95
10
0
15 Dec 2021
Transformaly -- Two (Feature Spaces) Are Better Than One
M. Cohen
S. Avidan
ViT
88
30
0
08 Dec 2021
Learning to Detect Every Thing in an Open World
Kuniaki Saito
Ping Hu
Trevor Darrell
Kate Saenko
ObjD
VLM
73
48
0
03 Dec 2021
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
VLM
CLIP
224
582
0
02 Dec 2021
ML-Decoder: Scalable and Versatile Classification Head
T. Ridnik
Gilad Sharir
Avi Ben-Cohen
Emanuel Ben-Baruch
Asaf Noy
VLM
86
108
0
25 Nov 2021
Multi-label Classification with Partial Annotations using Class-aware Selective Loss
Emanuel Ben-Baruch
T. Ridnik
Itamar Friedman
Avi Ben-Cohen
Nadav Zamir
Asaf Noy
Lihi Zelnik-Manor
73
40
0
21 Oct 2021
Previous
1
2
3
4
5
6
7
8
9
Next