Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10972
Cited By
v1
v2
v3
v4 (latest)
ImageNet-21K Pretraining for the Masses
22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
Re-assign community
ArXiv (abs)
PDF
HTML
Github (765★)
Papers citing
"ImageNet-21K Pretraining for the Masses"
50 / 427 papers shown
Title
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
Jiuyi Xu
Meida Chen
Andrew Feng
Yangming Shi
Zifan Yu
91
0
0
09 Dec 2024
Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval
Leah Bar
Boaz Lerner
N. Darshan
Rami Ben-Ari
VLM
150
1
0
03 Dec 2024
Extending Video Masked Autoencoders to 128 frames
N. B. Gundavarapu
Luke Friedman
Raghav Goyal
Chaitra Hegde
Eirikur Agustsson
...
Mikhail Sirotenko
Ming-Hsuan Yang
Tobias Weyand
Boqing Gong
Leonid Sigal
118
1
0
20 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
118
3
0
14 Nov 2024
GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise
Moseli Motsóehli
Kyungim Baek
90
1
0
08 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
93
3
0
07 Nov 2024
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba
Masakazu Yoshimura
Teruaki Hayashi
Yota Maeda
Mamba
300
2
0
06 Nov 2024
An Application-Agnostic Automatic Target Recognition System Using Vision Language Models
Anthony Palladino
Dana Gajewski
Abigail Aronica
Patryk Deptula
Alexander Hamme
...
Jeff Muri
Todd Nelling
Michael A. Riley
Brian Wong
Margaret Duff
39
1
0
05 Nov 2024
Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting
Adrian B. Chłopowiec
Adam R. Chłopowiec
Krzysztof Galus
Wojciech Cebula
Martin Tabakov
MedIm
58
0
0
05 Nov 2024
Confidence Calibration of Classifiers with Many Classes
Adrien LeCoz
Stéphane Herbin
Faouzi Adjed
UQCV
82
1
0
05 Nov 2024
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy
Kian Kenyon-Dean
Zitong Jerry Wang
John Urbanik
Konstantin Donhauser
Jason Hartford
...
Safiye Celik
Marta M. Fay
Juan Sebastian Rodriguez Vera
I. Haque
Oren Z. Kraus
MedIm
113
6
0
04 Nov 2024
Video Token Merging for Long-form Video Understanding
Seon-Ho Lee
Jue Wang
Zhikang Zhang
D. Fan
Xinyu Li
95
6
0
31 Oct 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Mathilde Caron
Alireza Fathi
Cordelia Schmid
Ahmet Iscen
67
2
0
31 Oct 2024
Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution Adaptation
Wenjun Miao
Guansong Pang
Jin Zheng
Xiao Bai
OODD
134
3
0
28 Oct 2024
Vector Quantization Prompting for Continual Learning
L. Jiao
Qiuxia Lai
Yu LI
Qiang Xu
VLM
CLL
62
5
0
27 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
66
1
0
23 Oct 2024
Closed-form merging of parameter-efficient modules for Federated Continual Learning
Riccardo Salami
Pietro Buzzega
Matteo Mosconi
Jacopo Bonato
Luigi Sabetta
Simone Calderara
FedML
MoMe
CLL
111
4
0
23 Oct 2024
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?
Lingao Xiao
Yang He
DD
91
7
0
21 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjD
VLM
153
0
0
20 Oct 2024
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning
Yiming Shi
Jiwei Wei
Yujia Wu
Ran Ran
Chengwei Sun
Shiyuan He
Yang Yang
ALM
97
1
0
17 Oct 2024
The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse
Ekansh Sharma
Daniel M. Roy
Gintare Karolina Dziugaite
MoMe
77
4
0
16 Oct 2024
Stylistic Multi-Task Analysis of Ukiyo-e Woodblock Prints
Selina Khan
Nanne van Noord
124
4
0
16 Oct 2024
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
265
7
0
14 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
219
7
0
12 Oct 2024
CL3: A Collaborative Learning Framework for the Medical Data Ensuring Data Privacy in the Hyperconnected Environment
Mohamamd Zavid Parvez
R. Islam
Md Zahidul Islam
36
0
0
10 Oct 2024
When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections
Keryan Chelouche
Marie Lachaize
Marine Bernard
Louise Olgiati
Remi Cuingnet
NoLa
58
0
0
10 Oct 2024
MONICA: Benchmarking on Long-tailed Medical Image Classification
Lie Ju
Siyuan Yan
Yukun Zhou
Yang Nan
Xiaodan Xing
Peibo Duan
Zongyuan Ge
134
0
0
02 Oct 2024
Task-Oriented Pre-Training for Drivable Area Detection
Fulong Ma
Guoyang Zhao
Weiqing Qi
Ming Liu
Jun Ma
VLM
64
1
0
30 Sep 2024
CBAM-SwinT-BL: Small Rail Surface Defect Detection Method Based on Swin Transformer with Block Level CBAM Enhancement
Jiayi Zhao
Alison Wun-lam Yeung
Ali Muhammad
Songjiang Lai
Vincent To-Yee NG
48
3
0
30 Sep 2024
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
122
9
0
30 Sep 2024
Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization
Nikos Efthymiadis
Giorgos Tolias
Ondřej Chum
OOD
86
2
0
29 Sep 2024
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation
Soojin Jang
Jungmin Yun
Junehyoung Kwon
Eunju Lee
Youngbin Kim
104
3
0
24 Sep 2024
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition
Zheda Mai
Ping Zhang
Cheng-Hao Tu
Hong-You Chen
Li Zhang
Wei-Lun Chao
52
1
0
24 Sep 2024
Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models
Anil Osman Tur
Alessandro Conti
Cigdem Beyan
Davide Boscaini
Roberto Larcher
S. Messelodi
Fabio Poiesi
Elisa Ricci
VLM
108
0
0
23 Sep 2024
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
Stephen Zhang
Vardan Papyan
VLM
162
3
0
20 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
121
2
0
10 Sep 2024
The AdEMAMix Optimizer: Better, Faster, Older
Matteo Pagliardini
Pierre Ablin
David Grangier
ODL
91
13
0
05 Sep 2024
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation
Alberto Bacchin
Davide Allegro
Stefano Ghidoni
Emanuele Menegatti
83
1
0
02 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas Guibas
DD
91
5
0
30 Aug 2024
FungiTastic: A multi-modal dataset and benchmark for image categorization
Lukás Picek
Klara Janouskova
Milan Šulc
Jirí Matas
140
1
0
24 Aug 2024
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training
Gengwei Zhang
Liyuan Wang
Guoliang Kang
Ling Chen
Yunchao Wei
VLM
CLL
68
7
0
15 Aug 2024
Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging
S. Woerner
Christian F. Baumgartner
VLM
MedIm
55
0
0
15 Aug 2024
Masked Image Modeling: A Survey
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
N. Sebe
189
8
0
13 Aug 2024
Effect of Kernel Size on CNN-Vision-Transformer-Based Gaze Prediction Using Electroencephalography Data
Chuhui Qiu
Bugao Liang
Matthew L. Key
97
0
0
06 Aug 2024
Human-inspired Explanations for Vision Transformers and Convolutional Neural Networks
Mahadev Prasad Panda
Matteo Tiezzi
Martina Vilas
Gemma Roig
Bjoern M. Eskofier
Dario Zanca
ViT
AAML
95
1
0
04 Aug 2024
Resilience and Security of Deep Neural Networks Against Intentional and Unintentional Perturbations: Survey and Research Challenges
Sazzad Sayyed
Milin Zhang
Shahriar Rifat
A. Swami
Michael De Lucia
Francesco Restuccia
106
1
0
31 Jul 2024
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey
Atsuyuki Miyai
Jingkang Yang
Jingyang Zhang
Yifei Ming
Sisir Dhakal
...
Yixuan Li
Hai "Helen" Li
Ziwei Liu
Toshihiko Yamasaki
Kiyoharu Aizawa
135
13
0
31 Jul 2024
Parameter-Efficient Fine-Tuning via Circular Convolution
Aochuan Chen
Jiashun Cheng
Zijing Liu
Ziqi Gao
Fugee Tsung
Yu-Feng Li
Jia Li
148
3
0
27 Jul 2024
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
Fernando Julio Cendra
Bingchen Zhao
Kai Han
VLM
CLL
102
6
0
26 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
YunLong Yu
Jiale Cao
Yanwei Pang
Jungong Han
Xuelong Li
CLL
142
5
0
24 Jul 2024
Previous
1
2
3
4
5
6
7
8
9
Next