Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10972
Cited By
v1
v2
v3
v4 (latest)
ImageNet-21K Pretraining for the Masses
22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
Re-assign community
ArXiv (abs)
PDF
HTML
Github (765★)
Papers citing
"ImageNet-21K Pretraining for the Masses"
50 / 427 papers shown
Title
Large-scale Dataset Pruning with Dynamic Uncertainty
Muyang He
Shuo Yang
Tiejun Huang
Bo Zhao
79
32
0
08 Jun 2023
ScaleDet: A Scalable Multi-Dataset Object Detector
Yanbei Chen
Manchen Wang
Abhay Mittal
Zhenlin Xu
Paolo Favaro
Joseph Tighe
Davide Modolo
ObjD
51
22
0
08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
62
4
0
07 Jun 2023
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Haiyang Xu
Qinghao Ye
Xuan-Wei Wu
Mingshi Yan
Yuan Miao
...
Qingfang Qian
Maofei Que
Ji Zhang
Xiaoyan Zeng
Feiyan Huang
VLM
MLLM
98
25
0
07 Jun 2023
Centered Self-Attention Layers
Ameen Ali
Tomer Galanti
Lior Wolf
140
8
0
02 Jun 2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Jiapeng Wang
Chengyu Wang
Xiaodan Wang
Jun Huang
Lianwen Jin
VLM
113
5
0
28 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
129
27
0
23 May 2023
P-NOC: adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors
L. David
Hélio Pedrini
Z. Dias
GAN
81
1
0
21 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
123
55
0
14 May 2023
Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing
Sabyasachi Ghosh
Sanyam Saxena
Ajit V. Rajwade
69
0
0
12 May 2023
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Hassan Akbari
Dan Kondratyuk
Huayu Chen
Rachel Hornung
Haoran Wang
Hartwig Adam
VLM
MoE
105
13
0
10 May 2023
A Survey on the Robustness of Computer Vision Models against Common Corruptions
Shunxin Wang
Raymond N. J. Veldhuis
Christoph Brune
N. Strisciuglio
OOD
VLM
139
14
0
10 May 2023
Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization
Xilie Xu
Jingfeng Zhang
Feng Liu
Masashi Sugiyama
Mohan S. Kankanhalli
AAML
111
11
0
30 Apr 2023
Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment
Haoning Wu
Liang Liao
Annan Wang
Chaofeng Chen
Jingwen Hou
Wenxiu Sun
Qiong Yan
Weisi Lin
102
15
0
28 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
103
12
0
24 Apr 2023
Benchmarking Low-Shot Robustness to Natural Distribution Shifts
Aaditya K. Singh
Kartik Sarangmath
Prithvijit Chattopadhyay
Judy Hoffman
OOD
90
1
0
21 Apr 2023
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
Yihao Chen
Xianbiao Qi
Jianan Wang
Lei Zhang
82
18
0
17 Apr 2023
Unified Out-Of-Distribution Detection: A Model-Specific Perspective
Reza Averly
Wei-Lun Chao
OODD
82
14
0
13 Apr 2023
Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies
Laura Gustafson
Megan Richards
Melissa Hall
C. Hazirbas
Diane Bouchacourt
Mark Ibrahim
66
7
0
11 Apr 2023
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Shuhuai Ren
Aston Zhang
Yi Zhu
Shuai Zhang
Shuai Zheng
Mu Li
Alexander J. Smola
Xu Sun
VPVLM
VLM
84
28
0
10 Apr 2023
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao
Zhan Tong
Limin Wang
Mike Zheng Shou
60
10
0
07 Apr 2023
Micron-BERT: BERT-based Facial Micro-Expression Recognition
Xuan-Bac Nguyen
C. Duong
Xin Li
Susan Gauch
Han-Seok Seo
Khoa Luu
87
57
0
06 Apr 2023
Learning Neural Eigenfunctions for Unsupervised Semantic Segmentation
Zhijie Deng
Yucen Luo
67
6
0
06 Apr 2023
What's in a Name? Beyond Class Indices for Image Recognition
Kai Han
Yandong Li
S. Vaze
Jie Li
Xuhui Jia
VLM
85
7
0
05 Apr 2023
Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms
Florin Condrea
S. Rapaka
Lucian Itu
Puneet Sharma
J. Sperl
Mohamed Ali
Marius Leordeanu
55
5
0
30 Mar 2023
Towards Understanding the Effect of Pretraining Label Granularity
Guanzhe Hong
Huayu Chen
Ariel Fuxman
Stanley H. Chan
Enming Luo
58
2
0
29 Mar 2023
Visually Wired NFTs: Exploring the Role of Inspiration in Non-Fungible Tokens
Lucio La Cava
Davide Costa
Andrea Tagarelli
82
7
0
29 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
182
71
0
23 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
114
140
0
21 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
101
158
0
17 Mar 2023
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
Seungju Han
Jack Hessel
Nouha Dziri
Yejin Choi
Youngjae Yu
VGen
88
19
0
17 Mar 2023
A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation
Hui Tang
Kui Jia
OOD
84
14
0
16 Mar 2023
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need
Da-Wei Zhou
Han-Jia Ye
De-Chuan Zhan
Ziwei Liu
CLL
106
111
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
98
1
0
13 Mar 2023
Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai
Tatiana Likhomanenko
Etai Littwin
Dan Busbridge
Jason Ramapuram
Yizhe Zhang
Jiatao Gu
J. Susskind
AAML
114
78
0
11 Mar 2023
SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model
Gengwei Zhang
Liyuan Wang
Guoliang Kang
Ling-Hao Chen
Yunchao Wei
CLL
88
119
0
09 Mar 2023
Out-of-distribution Detection with Implicit Outlier Transformation
Qizhou Wang
Junjie Ye
Feng Liu
Quanyu Dai
Marcus Kalander
Tongliang Liu
Jianye Hao
Bo Han
OODD
208
47
0
09 Mar 2023
Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes
Brandon Clark
Alec Kerrigan
P. Kulkarni
V. Cepeda
M. Shah
75
27
0
07 Mar 2023
Predicted Embedding Power Regression for Large-Scale Out-of-Distribution Detection
Han Yang
William R. Gebhardt
Alexander Ororbia
Travis J. Desell
OODD
43
0
0
07 Mar 2023
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective
Animesh Gupta
Irtiza Hassan
Dilip K. Prasad
D. K. Gupta
52
5
0
03 Mar 2023
Token Contrast for Weakly-Supervised Semantic Segmentation
Lixiang Ru
Heliang Zheng
Yibing Zhan
Bo Du
ViT
106
91
0
02 Mar 2023
Domain-adapted large language models for classifying nuclear medicine reports
Zachary Huemann
Changhee Lee
Junjie Hu
Steve Y. Cho
Tyler Bradshaw
LM&MA
VLM
MedIm
55
16
0
01 Mar 2023
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning
Zhenmei Shi
Jiefeng Chen
Kunyang Li
Jayaram Raghuram
Xi Wu
Yingyu Liang
S. Jha
SSL
79
20
0
28 Feb 2023
ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection
Md Awsafur Rahman
Bishmoy Paul
Najibul Haque Sarker
Zaber Ibn Abdul Hakim
S. Fattah
87
32
0
23 Feb 2023
A framework for benchmarking class-out-of-distribution detection and its application to ImageNet
Ido Galil
Mohammed Dabbah
Ran El-Yaniv
UQCV
82
30
0
23 Feb 2023
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers
Ido Galil
Mohammed Dabbah
Ran El-Yaniv
UQCV
78
31
0
23 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
133
94
0
23 Feb 2023
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities
Hexiang Hu
Yi Luan
Yang Chen
Urvashi Khandelwal
Mandar Joshi
Kenton Lee
Kristina Toutanova
Ming-Wei Chang
VLM
124
61
0
22 Feb 2023
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers
Melissa Hall
Bobbie Chern
Laura Gustafson
Denisse Ventura
Harshad Kulkarni
Candace Ross
Nicolas Usunier
63
6
0
16 Feb 2023
Less is More: Selective Layer Finetuning with SubTuning
Gal Kaplun
Andrey Gurevich
Tal Swisa
Mazor David
Shai Shalev-Shwartz
Eran Malach
78
9
0
13 Feb 2023
Previous
1
2
3
4
5
6
7
8
9
Next