v1v2v3v4 (latest)

ImageNet-21K Pretraining for the Masses

22 April 2021

ArXiv (abs)PDF HTML Github (765★)

Papers citing "ImageNet-21K Pretraining for the Masses"

50 / 427 papers shown

Title
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework Jiuyi Xu Meida Chen Andrew Feng Yangming Shi Zifan Yu 91 0 0 09 Dec 2024
Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar Boaz Lerner N. Darshan Rami Ben-Ari VLM 150 1 0 03 Dec 2024
Extending Video Masked Autoencoders to 128 frames N. B. Gundavarapu Luke Friedman Raghav Goyal Chaitra Hegde Eirikur Agustsson ... Mikhail Sirotenko Ming-Hsuan Yang Tobias Weyand Boqing Gong Leonid Sigal 118 1 0 20 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi Minjing Dong Chang Xu VLM 118 3 0 14 Nov 2024
GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise Moseli Motsóehli Kyungim Baek 90 1 0 08 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation Anil Kag Huseyin Coskun Jierun Chen Junli Cao Willi Menapace Aliaksandr Siarohin Sergey Tulyakov Jian Ren 93 3 0 07 Nov 2024
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba Masakazu Yoshimura Teruaki Hayashi Yota Maeda Mamba 300 2 0 06 Nov 2024
An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Anthony Palladino Dana Gajewski Abigail Aronica Patryk Deptula Alexander Hamme ... Jeff Muri Todd Nelling Michael A. Riley Brian Wong Margaret Duff 39 1 0 05 Nov 2024
Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting Adrian B. Chłopowiec Adam R. Chłopowiec Krzysztof Galus Wojciech Cebula Martin Tabakov MedIm 58 0 0 05 Nov 2024
Confidence Calibration of Classifiers with Many Classes Adrien LeCoz Stéphane Herbin Faouzi Adjed UQCV 82 1 0 05 Nov 2024
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy Kian Kenyon-Dean Zitong Jerry Wang John Urbanik Konstantin Donhauser Jason Hartford ... Safiye Celik Marta M. Fay Juan Sebastian Rodriguez Vera I. Haque Oren Z. Kraus MedIm 113 6 0 04 Nov 2024
Video Token Merging for Long-form Video Understanding Seon-Ho Lee Jue Wang Zhikang Zhang D. Fan Xinyu Li 95 6 0 31 Oct 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach Mathilde Caron Alireza Fathi Cordelia Schmid Ahmet Iscen 67 2 0 31 Oct 2024
Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution Adaptation Wenjun Miao Guansong Pang Jin Zheng Xiao Bai OODD 134 3 0 28 Oct 2024
Vector Quantization Prompting for Continual Learning L. Jiao Qiuxia Lai Yu LI Qiang Xu VLM CLL 62 5 0 27 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context Maximilian Augustin Syed Shakib Sarwar Mostafa Elhoushi Sai Qian Zhang Yuecheng Li B. D. Salvo 66 1 0 23 Oct 2024
Closed-form merging of parameter-efficient modules for Federated Continual Learning Riccardo Salami Pietro Buzzega Matteo Mosconi Jacopo Bonato Luigi Sabetta Simone Calderara FedML MoMe CLL 111 4 0 23 Oct 2024
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation? Lingao Xiao Yang He DD 91 7 0 21 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary Hao-Tang Tsui Chien-Yao Wang H. Liao ObjD VLM 153 0 0 20 Oct 2024
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Yiming Shi Jiwei Wei Yujia Wu Ran Ran Chengwei Sun Shiyuan He Yang Yang ALM 97 1 0 17 Oct 2024
The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse Ekansh Sharma Daniel M. Roy Gintare Karolina Dziugaite MoMe 77 4 0 16 Oct 2024
Stylistic Multi-Task Analysis of Ukiyo-e Woodblock Prints Selina Khan Nanne van Noord 124 4 0 16 Oct 2024
Locality Alignment Improves Vision-Language Models Ian Covert Tony Sun James Zou Tatsunori Hashimoto VLM 265 7 0 14 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models Wenlong Deng Yize Zhao V. Vakilian Minghui Chen Xiaoxiao Li Christos Thrampoulidis 219 7 0 12 Oct 2024
CL3: A Collaborative Learning Framework for the Medical Data Ensuring Data Privacy in the Hyperconnected Environment Mohamamd Zavid Parvez R. Islam Md Zahidul Islam 36 0 0 10 Oct 2024
When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections Keryan Chelouche Marie Lachaize Marine Bernard Louise Olgiati Remi Cuingnet NoLa 58 0 0 10 Oct 2024
MONICA: Benchmarking on Long-tailed Medical Image Classification Lie Ju Siyuan Yan Yukun Zhou Yang Nan Xiaodan Xing Peibo Duan Zongyuan Ge 134 0 0 02 Oct 2024
Task-Oriented Pre-Training for Drivable Area Detection Fulong Ma Guoyang Zhao Weiqing Qi Ming Liu Jun Ma VLM 64 1 0 30 Sep 2024
CBAM-SwinT-BL: Small Rail Surface Defect Detection Method Based on Swin Transformer with Block Level CBAM Enhancement Jiayi Zhao Alison Wun-lam Yeung Ali Muhammad Songjiang Lai Vincent To-Yee NG 48 3 0 30 Sep 2024
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation Kun Yuan V. Srivastav Nassir Navab N. Padoy 122 9 0 30 Sep 2024
Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization Nikos Efthymiadis Giorgos Tolias Ondřej Chum OOD 86 2 0 29 Sep 2024
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang Jungmin Yun Junehyoung Kwon Eunju Lee Youngbin Kim 104 3 0 24 Sep 2024
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition Zheda Mai Ping Zhang Cheng-Hao Tu Hong-You Chen Li Zhang Wei-Lun Chao 52 1 0 24 Sep 2024
Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models Anil Osman Tur Alessandro Conti Cigdem Beyan Davide Boscaini Roberto Larcher S. Messelodi Fabio Poiesi Elisa Ricci VLM 108 0 0 23 Sep 2024
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition Stephen Zhang Vardan Papyan VLM 162 3 0 20 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models Zhenyuan Chen Lingfeng Yang Shuo Chen Zhaowei Chen Jiajun Liang Xiang Li MLLM VPVLM VLM 121 2 0 10 Sep 2024
The AdEMAMix Optimizer: Better, Faster, Older Matteo Pagliardini Pierre Ablin David Grangier ODL 91 13 0 05 Sep 2024
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation Alberto Bacchin Davide Allegro Stefano Ghidoni Emanuele Menegatti 83 1 0 02 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images Xiaoshuai Zhang Zhicheng Wang Howard Zhou Soham Ghosh Danushen Gnanapragasam Varun Jampani Hao Su Leonidas Guibas DD 91 5 0 30 Aug 2024
FungiTastic: A multi-modal dataset and benchmark for image categorization Lukás Picek Klara Janouskova Milan Šulc Jirí Matas 140 1 0 24 Aug 2024
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Gengwei Zhang Liyuan Wang Guoliang Kang Ling Chen Yunchao Wei VLM CLL 68 7 0 15 Aug 2024
Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging S. Woerner Christian F. Baumgartner VLM MedIm 55 0 0 15 Aug 2024
Masked Image Modeling: A Survey Vlad Hondru Florinel-Alin Croitoru Shervin Minaee Radu Tudor Ionescu N. Sebe 189 8 0 13 Aug 2024
Effect of Kernel Size on CNN-Vision-Transformer-Based Gaze Prediction Using Electroencephalography Data Chuhui Qiu Bugao Liang Matthew L. Key 97 0 0 06 Aug 2024
Human-inspired Explanations for Vision Transformers and Convolutional Neural Networks Mahadev Prasad Panda Matteo Tiezzi Martina Vilas Gemma Roig Bjoern M. Eskofier Dario Zanca ViT AAML 95 1 0 04 Aug 2024
Resilience and Security of Deep Neural Networks Against Intentional and Unintentional Perturbations: Survey and Research Challenges Sazzad Sayyed Milin Zhang Shahriar Rifat A. Swami Michael De Lucia Francesco Restuccia 106 1 0 31 Jul 2024
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey Atsuyuki Miyai Jingkang Yang Jingyang Zhang Yifei Ming Sisir Dhakal ... Yixuan Li Hai "Helen" Li Ziwei Liu Toshihiko Yamasaki Kiyoharu Aizawa 135 13 0 31 Jul 2024
Parameter-Efficient Fine-Tuning via Circular Convolution Aochuan Chen Jiashun Cheng Zijing Liu Ziqi Gao Fugee Tsung Yu-Feng Li Jia Li 148 3 0 27 Jul 2024
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery Fernando Julio Cendra Bingchen Zhao Kai Han VLM CLL 102 6 0 26 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective Jingren Liu Zhong Ji YunLong Yu Jiale Cao Yanwei Pang Jungong Han Xuelong Li CLL 142 5 0 24 Jul 2024