v1v2v3v4 (latest)

ImageNet-21K Pretraining for the Masses

22 April 2021

ArXiv (abs)PDF HTML Github (765★)

Papers citing "ImageNet-21K Pretraining for the Masses"

50 / 427 papers shown

Title
Large-scale Dataset Pruning with Dynamic Uncertainty Muyang He Shuo Yang Tiejun Huang Bo Zhao 79 32 0 08 Jun 2023
ScaleDet: A Scalable Multi-Dataset Object Detector Yanbei Chen Manchen Wang Abhay Mittal Zhenlin Xu Paolo Favaro Joseph Tighe Davide Modolo ObjD 51 22 0 08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition Shreyank N. Gowda Anurag Arnab Jonathan Huang ViT 62 4 0 07 Jun 2023
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks Haiyang Xu Qinghao Ye Xuan-Wei Wu Mingshi Yan Yuan Miao ... Qingfang Qian Maofei Que Ji Zhang Xiaoyan Zeng Feiyan Huang VLM MLLM 98 25 0 07 Jun 2023
Centered Self-Attention Layers Ameen Ali Tomer Galanti Lior Wolf 140 8 0 02 Jun 2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval Jiapeng Wang Chengyu Wang Xiaodan Wang Jun Huang Lianwen Jin VLM 113 5 0 28 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model Shuai Zhao Xiaohan Wang Linchao Zhu Yezhou Yang CLIP VLM 129 27 0 23 May 2023
P-NOC: adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors L. David Hélio Pedrini Z. Dias GAN 81 1 0 21 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity Raman Dutt Linus Ericsson Pedro Sanchez Sotirios A. Tsaftaris Timothy M. Hospedales MedIm 123 55 0 14 May 2023
Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing Sabyasachi Ghosh Sanyam Saxena Ajit V. Rajwade 69 0 0 12 May 2023
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception Hassan Akbari Dan Kondratyuk Huayu Chen Rachel Hornung Haoran Wang Hartwig Adam VLM MoE 105 13 0 10 May 2023
A Survey on the Robustness of Computer Vision Models against Common Corruptions Shunxin Wang Raymond N. J. Veldhuis Christoph Brune N. Strisciuglio OOD VLM 139 14 0 10 May 2023
Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization Xilie Xu Jingfeng Zhang Feng Liu Masashi Sugiyama Mohan S. Kankanhalli AAML 111 11 0 30 Apr 2023
Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment Haoning Wu Liang Liao Annan Wang Chaofeng Chen Jingwen Hou Wenxiu Sun Qiong Yan Weisi Lin 102 15 0 28 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid Chen Ziwen K. Patnaik Shuangfei Zhai Alvin Wan Zhile Ren Alex Schwing Alex Colburn Li Fuxin 103 12 0 24 Apr 2023
Benchmarking Low-Shot Robustness to Natural Distribution Shifts Aaditya K. Singh Kartik Sarangmath Prithvijit Chattopadhyay Judy Hoffman OOD 90 1 0 21 Apr 2023
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training Yihao Chen Xianbiao Qi Jianan Wang Lei Zhang 82 18 0 17 Apr 2023
Unified Out-Of-Distribution Detection: A Model-Specific Perspective Reza Averly Wei-Lun Chao OODD 82 14 0 13 Apr 2023
Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies Laura Gustafson Megan Richards Melissa Hall C. Hazirbas Diane Bouchacourt Mark Ibrahim 66 7 0 11 Apr 2023
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition Shuhuai Ren Aston Zhang Yi Zhu Shuai Zhang Shuai Zheng Mu Li Alexander J. Smola Xu Sun VPVLM VLM 84 28 0 10 Apr 2023
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens Ziteng Gao Zhan Tong Limin Wang Mike Zheng Shou 60 10 0 07 Apr 2023
Micron-BERT: BERT-based Facial Micro-Expression Recognition Xuan-Bac Nguyen C. Duong Xin Li Susan Gauch Han-Seok Seo Khoa Luu 87 57 0 06 Apr 2023
Learning Neural Eigenfunctions for Unsupervised Semantic Segmentation Zhijie Deng Yucen Luo 67 6 0 06 Apr 2023
What's in a Name? Beyond Class Indices for Image Recognition Kai Han Yandong Li S. Vaze Jie Li Xuhui Jia VLM 85 7 0 05 Apr 2023
Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms Florin Condrea S. Rapaka Lucian Itu Puneet Sharma J. Sperl Mohamed Ali Marius Leordeanu 55 5 0 30 Mar 2023
Towards Understanding the Effect of Pretraining Label Granularity Guanzhe Hong Huayu Chen Ariel Fuxman Stanley H. Chan Enming Luo 58 2 0 29 Mar 2023
Visually Wired NFTs: Exploring the Role of Inspiration in Non-Fungible Tokens Lucio La Cava Davide Costa Andrea Tagarelli 82 7 0 29 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining Mannat Singh Quentin Duval Kalyan Vasudev Alwala Haoqi Fan Vaibhav Aggarwal ... Piotr Dollár Christoph Feichtenhofer Ross B. Girshick Rohit Girdhar Ishan Misra LRM 182 71 0 23 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future Jianing Qiu Lin Li Jiankai Sun Jiachuan Peng Peilun Shi ... Bo Xiao Wu Yuan Ningli Wang Dong Xu Benny Lo AI4MH LM&MA 114 140 0 21 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation Saikat Roy Gregor Koehler Constantin Ulrich Michael Baumgartner Jens Petersen Fabian Isensee Paul F. Jaeger Klaus Maier-Hein ViT MedIm 101 158 0 17 Mar 2023
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos Seungju Han Jack Hessel Nouha Dziri Yejin Choi Youngjae Yu VGen 88 19 0 17 Mar 2023
A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation Hui Tang Kui Jia OOD 84 14 0 16 Mar 2023
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need Da-Wei Zhou Han-Jia Ye De-Chuan Zhan Ziwei Liu CLL 106 111 0 13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring Yutong Feng Biao Gong Jianwen Jiang Yiliang Lv Yujun Shen Deli Zhao Jingren Zhou 98 1 0 13 Mar 2023
Stabilizing Transformer Training by Preventing Attention Entropy Collapse Shuangfei Zhai Tatiana Likhomanenko Etai Littwin Dan Busbridge Jason Ramapuram Yizhe Zhang Jiatao Gu J. Susskind AAML 114 78 0 11 Mar 2023
SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model Gengwei Zhang Liyuan Wang Guoliang Kang Ling-Hao Chen Yunchao Wei CLL 88 119 0 09 Mar 2023
Out-of-distribution Detection with Implicit Outlier Transformation Qizhou Wang Junjie Ye Feng Liu Quanyu Dai Marcus Kalander Tongliang Liu Jianye Hao Bo Han OODD 208 47 0 09 Mar 2023
Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes Brandon Clark Alec Kerrigan P. Kulkarni V. Cepeda M. Shah 75 27 0 07 Mar 2023
Predicted Embedding Power Regression for Large-Scale Out-of-Distribution Detection Han Yang William R. Gebhardt Alexander Ororbia Travis J. Desell OODD 43 0 0 07 Mar 2023
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective Animesh Gupta Irtiza Hassan Dilip K. Prasad D. K. Gupta 52 5 0 03 Mar 2023
Token Contrast for Weakly-Supervised Semantic Segmentation Lixiang Ru Heliang Zheng Yibing Zhan Bo Du ViT 106 91 0 02 Mar 2023
Domain-adapted large language models for classifying nuclear medicine reports Zachary Huemann Changhee Lee Junjie Hu Steve Y. Cho Tyler Bradshaw LM&MA VLM MedIm 55 16 0 01 Mar 2023
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning Zhenmei Shi Jiefeng Chen Kunyang Li Jayaram Raghuram Xi Wu Yingyu Liang S. Jha SSL 79 20 0 28 Feb 2023
ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection Md Awsafur Rahman Bishmoy Paul Najibul Haque Sarker Zaber Ibn Abdul Hakim S. Fattah 87 32 0 23 Feb 2023
A framework for benchmarking class-out-of-distribution detection and its application to ImageNet Ido Galil Mohammed Dabbah Ran El-Yaniv UQCV 82 30 0 23 Feb 2023
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers Ido Galil Mohammed Dabbah Ran El-Yaniv UQCV 78 31 0 23 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions? Yang Chen Hexiang Hu Yi Luan Haitian Sun Soravit Changpinyo Alan Ritter Ming-Wei Chang 133 94 0 23 Feb 2023
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities Hexiang Hu Yi Luan Yang Chen Urvashi Khandelwal Mandar Joshi Kenton Lee Kristina Toutanova Ming-Wei Chang VLM 124 61 0 22 Feb 2023
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers Melissa Hall Bobbie Chern Laura Gustafson Denisse Ventura Harshad Kulkarni Candace Ross Nicolas Usunier 63 6 0 16 Feb 2023
Less is More: Selective Layer Finetuning with SubTuning Gal Kaplun Andrey Gurevich Tal Swisa Mazor David Shai Shalev-Shwartz Eran Malach 78 9 0 13 Feb 2023