ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.10972
  4. Cited By
ImageNet-21K Pretraining for the Masses
v1v2v3v4 (latest)

ImageNet-21K Pretraining for the Masses

22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
    SSegVLMCLIP
ArXiv (abs)PDFHTMLGithub (765★)

Papers citing "ImageNet-21K Pretraining for the Masses"

50 / 427 papers shown
Title
Large-scale Dataset Pruning with Dynamic Uncertainty
Large-scale Dataset Pruning with Dynamic Uncertainty
Muyang He
Shuo Yang
Tiejun Huang
Bo Zhao
79
32
0
08 Jun 2023
ScaleDet: A Scalable Multi-Dataset Object Detector
ScaleDet: A Scalable Multi-Dataset Object Detector
Yanbei Chen
Manchen Wang
Abhay Mittal
Zhenlin Xu
Paolo Favaro
Joseph Tighe
Davide Modolo
ObjD
51
22
0
08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action
  Recognition
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
62
4
0
07 Jun 2023
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for
  Pre-training and Benchmarks
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
Haiyang Xu
Qinghao Ye
Xuan-Wei Wu
Mingshi Yan
Yuan Miao
...
Qingfang Qian
Maofei Que
Ji Zhang
Xiaoyan Zeng
Feiyan Huang
VLMMLLM
98
25
0
07 Jun 2023
Centered Self-Attention Layers
Centered Self-Attention Layers
Ameen Ali
Tomer Galanti
Lior Wolf
140
8
0
02 Jun 2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge
  Interaction Graph for Lightweight Text-Image Retrieval
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Jiapeng Wang
Chengyu Wang
Xiaodan Wang
Jun Huang
Lianwen Jin
VLM
113
5
0
28 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
  Vision-Language Model
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIPVLM
129
27
0
23 May 2023
P-NOC: adversarial training of CAM generating networks for robust weakly
  supervised semantic segmentation priors
P-NOC: adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors
L. David
Hélio Pedrini
Z. Dias
GAN
81
1
0
21 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed
  Opportunity
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
123
55
0
14 May 2023
Efficient Neural Network based Classification and Outlier Detection for
  Image Moderation using Compressed Sensing and Group Testing
Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing
Sabyasachi Ghosh
Sanyam Saxena
Ajit V. Rajwade
69
0
0
12 May 2023
Alternating Gradient Descent and Mixture-of-Experts for Integrated
  Multimodal Perception
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Hassan Akbari
Dan Kondratyuk
Huayu Chen
Rachel Hornung
Haoran Wang
Hartwig Adam
VLMMoE
105
13
0
10 May 2023
A Survey on the Robustness of Computer Vision Models against Common
  Corruptions
A Survey on the Robustness of Computer Vision Models against Common Corruptions
Shunxin Wang
Raymond N. J. Veldhuis
Christoph Brune
N. Strisciuglio
OODVLM
139
14
0
10 May 2023
Enhancing Adversarial Contrastive Learning via Adversarial Invariant
  Regularization
Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization
Xilie Xu
Jingfeng Zhang
Feng Liu
Masashi Sugiyama
Mohan S. Kankanhalli
AAML
111
11
0
30 Apr 2023
Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video
  Quality Assessment
Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment
Haoning Wu
Liang Liao
Annan Wang
Chaofeng Chen
Jingwen Hou
Wenxiu Sun
Qiong Yan
Weisi Lin
102
15
0
28 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
103
12
0
24 Apr 2023
Benchmarking Low-Shot Robustness to Natural Distribution Shifts
Benchmarking Low-Shot Robustness to Natural Distribution Shifts
Aaditya K. Singh
Kartik Sarangmath
Prithvijit Chattopadhyay
Judy Hoffman
OOD
90
1
0
21 Apr 2023
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP
  Training
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
Yihao Chen
Xianbiao Qi
Jianan Wang
Lei Zhang
82
18
0
17 Apr 2023
Unified Out-Of-Distribution Detection: A Model-Specific Perspective
Unified Out-Of-Distribution Detection: A Model-Specific Perspective
Reza Averly
Wei-Lun Chao
OODD
82
14
0
13 Apr 2023
Pinpointing Why Object Recognition Performance Degrades Across Income
  Levels and Geographies
Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies
Laura Gustafson
Megan Richards
Melissa Hall
C. Hazirbas
Diane Bouchacourt
Mark Ibrahim
66
7
0
11 Apr 2023
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary
  Visual Recognition
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Shuhuai Ren
Aston Zhang
Yi Zhu
Shuai Zhang
Shuai Zheng
Mu Li
Alexander J. Smola
Xu Sun
VPVLMVLM
84
28
0
10 Apr 2023
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao
Zhan Tong
Limin Wang
Mike Zheng Shou
60
10
0
07 Apr 2023
Micron-BERT: BERT-based Facial Micro-Expression Recognition
Micron-BERT: BERT-based Facial Micro-Expression Recognition
Xuan-Bac Nguyen
C. Duong
Xin Li
Susan Gauch
Han-Seok Seo
Khoa Luu
87
57
0
06 Apr 2023
Learning Neural Eigenfunctions for Unsupervised Semantic Segmentation
Learning Neural Eigenfunctions for Unsupervised Semantic Segmentation
Zhijie Deng
Yucen Luo
67
6
0
06 Apr 2023
What's in a Name? Beyond Class Indices for Image Recognition
What's in a Name? Beyond Class Indices for Image Recognition
Kai Han
Yandong Li
S. Vaze
Jie Li
Xuhui Jia
VLM
85
7
0
05 Apr 2023
Anatomically aware dual-hop learning for pulmonary embolism detection in
  CT pulmonary angiograms
Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms
Florin Condrea
S. Rapaka
Lucian Itu
Puneet Sharma
J. Sperl
Mohamed Ali
Marius Leordeanu
55
5
0
30 Mar 2023
Towards Understanding the Effect of Pretraining Label Granularity
Towards Understanding the Effect of Pretraining Label Granularity
Guanzhe Hong
Huayu Chen
Ariel Fuxman
Stanley H. Chan
Enming Luo
58
2
0
29 Mar 2023
Visually Wired NFTs: Exploring the Role of Inspiration in Non-Fungible Tokens
Visually Wired NFTs: Exploring the Role of Inspiration in Non-Fungible Tokens
Lucio La Cava
Davide Costa
Andrea Tagarelli
82
7
0
29 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
182
71
0
23 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the
  Future
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MHLM&MA
114
140
0
21 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image
  Segmentation
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViTMedIm
101
158
0
17 Mar 2023
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
Seungju Han
Jack Hessel
Nouha Dziri
Yejin Choi
Youngjae Yu
VGen
88
19
0
17 Mar 2023
A New Benchmark: On the Utility of Synthetic Data with Blender for Bare
  Supervised Learning and Downstream Domain Adaptation
A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation
Hui Tang
Kui Jia
OOD
84
14
0
16 Mar 2023
Revisiting Class-Incremental Learning with Pre-Trained Models:
  Generalizability and Adaptivity are All You Need
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need
Da-Wei Zhou
Han-Jia Ye
De-Chuan Zhan
Ziwei Liu
CLL
106
111
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
98
1
0
13 Mar 2023
Stabilizing Transformer Training by Preventing Attention Entropy
  Collapse
Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai
Tatiana Likhomanenko
Etai Littwin
Dan Busbridge
Jason Ramapuram
Yizhe Zhang
Jiatao Gu
J. Susskind
AAML
114
78
0
11 Mar 2023
SLCA: Slow Learner with Classifier Alignment for Continual Learning on a
  Pre-trained Model
SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model
Gengwei Zhang
Liyuan Wang
Guoliang Kang
Ling-Hao Chen
Yunchao Wei
CLL
88
119
0
09 Mar 2023
Out-of-distribution Detection with Implicit Outlier Transformation
Out-of-distribution Detection with Implicit Outlier Transformation
Qizhou Wang
Junjie Ye
Feng Liu
Quanyu Dai
Marcus Kalander
Tongliang Liu
Jianye Hao
Bo Han
OODD
208
47
0
09 Mar 2023
Where We Are and What We're Looking At: Query Based Worldwide Image
  Geo-localization Using Hierarchies and Scenes
Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes
Brandon Clark
Alec Kerrigan
P. Kulkarni
V. Cepeda
M. Shah
75
27
0
07 Mar 2023
Predicted Embedding Power Regression for Large-Scale Out-of-Distribution
  Detection
Predicted Embedding Power Regression for Large-Scale Out-of-Distribution Detection
Han Yang
William R. Gebhardt
Alexander Ororbia
Travis J. Desell
OODD
43
0
0
07 Mar 2023
Data-Efficient Training of CNNs and Transformers with Coresets: A
  Stability Perspective
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective
Animesh Gupta
Irtiza Hassan
Dilip K. Prasad
D. K. Gupta
52
5
0
03 Mar 2023
Token Contrast for Weakly-Supervised Semantic Segmentation
Token Contrast for Weakly-Supervised Semantic Segmentation
Lixiang Ru
Heliang Zheng
Yibing Zhan
Bo Du
ViT
106
91
0
02 Mar 2023
Domain-adapted large language models for classifying nuclear medicine
  reports
Domain-adapted large language models for classifying nuclear medicine reports
Zachary Huemann
Changhee Lee
Junjie Hu
Steve Y. Cho
Tyler Bradshaw
LM&MAVLMMedIm
55
16
0
01 Mar 2023
The Trade-off between Universality and Label Efficiency of
  Representations from Contrastive Learning
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning
Zhenmei Shi
Jiefeng Chen
Kunyang Li
Jayaram Raghuram
Xi Wu
Yingyu Liang
S. Jha
SSL
79
20
0
28 Feb 2023
ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for
  Generalizable and Robust Synthetic Image Detection
ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection
Md Awsafur Rahman
Bishmoy Paul
Najibul Haque Sarker
Zaber Ibn Abdul Hakim
S. Fattah
87
32
0
23 Feb 2023
A framework for benchmarking class-out-of-distribution detection and its
  application to ImageNet
A framework for benchmarking class-out-of-distribution detection and its application to ImageNet
Ido Galil
Mohammed Dabbah
Ran El-Yaniv
UQCV
82
30
0
23 Feb 2023
What Can We Learn From The Selective Prediction And Uncertainty
  Estimation Performance Of 523 Imagenet Classifiers
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers
Ido Galil
Mohammed Dabbah
Ran El-Yaniv
UQCV
78
31
0
23 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual
  Information-Seeking Questions?
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
133
94
0
23 Feb 2023
Open-domain Visual Entity Recognition: Towards Recognizing Millions of
  Wikipedia Entities
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities
Hexiang Hu
Yi Luan
Yang Chen
Urvashi Khandelwal
Mandar Joshi
Kenton Lee
Kristina Toutanova
Ming-Wei Chang
VLM
124
61
0
22 Feb 2023
Towards Reliable Assessments of Demographic Disparities in Multi-Label
  Image Classifiers
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers
Melissa Hall
Bobbie Chern
Laura Gustafson
Denisse Ventura
Harshad Kulkarni
Candace Ross
Nicolas Usunier
63
6
0
16 Feb 2023
Less is More: Selective Layer Finetuning with SubTuning
Less is More: Selective Layer Finetuning with SubTuning
Gal Kaplun
Andrey Gurevich
Tal Swisa
Mazor David
Shai Shalev-Shwartz
Eran Malach
78
9
0
13 Feb 2023
Previous
123456789
Next