Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10972
Cited By
v1
v2
v3
v4 (latest)
ImageNet-21K Pretraining for the Masses
22 April 2021
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
Re-assign community
ArXiv (abs)
PDF
HTML
Github (765★)
Papers citing
"ImageNet-21K Pretraining for the Masses"
50 / 427 papers shown
Title
Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions
Kai-Chun Liu
Zhihang Fu
Chao Chen
Sheng Jin
Ze Chen
Mingyuan Tao
Rongxin Jiang
Jieping Ye
VLM
OODD
106
5
0
23 Jul 2024
Exemplar-free Continual Representation Learning via Learnable Drift Compensation
Alex Gomez-Villa
Dipam Goswami
Kai Wang
Andrew D. Bagdanov
Bartlomiej Twardowski
Joost van de Weijer
CLL
SSL
74
11
0
11 Jul 2024
LEMoN: Label Error Detection using Multimodal Neighbors
Haoran Zhang
Aparna Balagopalan
Nassim Oufattole
Hyewon Jeong
Yan Wu
Jiacheng Zhu
Marzyeh Ghassemi
128
0
0
10 Jul 2024
A Single Transformer for Scalable Vision-Language Modeling
Yangyi Chen
Xingyao Wang
Hao Peng
Heng Ji
LRM
107
17
0
08 Jul 2024
Multi-label Learning with Random Circular Vectors
Ken Nishida
Kojiro Machi
Kazuma Onishi
Katsuhiko Hayashi
Hidetaka Kamigaito
89
0
0
08 Jul 2024
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification
Jiaying Shi
Xuetong Xue
Shenghui Xu
VLM
143
0
0
08 Jul 2024
HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient Tuning
Liyuan Wang
Jingyi Xie
Xingxing Zhang
Hang Su
Jun Zhu
CLL
122
7
0
07 Jul 2024
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
Aditya K Surikuchi
Raquel Fernández
Sandro Pezzelle
61
6
0
05 Jul 2024
LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Order
Matthias Anton Freiberger
Peter Kun
A. Løvlie
Sebastian Risi
84
0
0
05 Jul 2024
Learning to Be a Transformer to Pinpoint Anomalies
Alex Costanzino
Pierluigi Zama Ramirez
Giuseppe Lisanti
Luigi Di Stefano
95
0
0
04 Jul 2024
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors
Peter Lorenz
Mario Fernandez
Jens Müller
Ullrich Kothe
AAML
244
1
0
21 Jun 2024
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation
Muhammad Ali
Salman Khan
VLM
130
15
0
21 Jun 2024
Controlling Forgetting with Test-Time Data in Continual Learning
Vaibhav Singh
Rahaf Aljundi
Eugene Belilovsky
CLL
VLM
KELM
74
3
0
19 Jun 2024
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
Fengxiang Wang
H. Wang
Di Wang
Zonghao Guo
Zhenyu Zhong
Long Lan
Wenjing Yang
Jing Zhang
76
3
0
17 Jun 2024
Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox
Xingming Long
Jie Zhang
Shiguang Shan
Xilin Chen
OODD
81
2
0
14 Jun 2024
EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels
Shuqi Zhu
Ziyi Ye
Qingyao Ai
Yiqun Liu
61
2
0
11 Jun 2024
Visual Prompt Tuning in Null Space for Continual Learning
Yue Lu
Shizhou Zhang
De Cheng
Yinghui Xing
N. Wang
Peng Wang
Yanning Zhang
VLM
VPVLM
CLL
93
15
0
09 Jun 2024
Learning 1D Causal Visual Representation with De-focus Attention Networks
Chenxin Tao
Xizhou Zhu
Shiqian Su
Lewei Lu
Changyao Tian
...
Gao Huang
Hongsheng Li
Ping Luo
Jie Zhou
Jifeng Dai
123
1
0
06 Jun 2024
The 3D-PC: a benchmark for visual perspective taking in humans and machines
Drew Linsley
Peisen Zhou
A. Ashok
Akash Nagaraj
Gaurav Gaonkar
Francis E Lewis
Zygmunt Pizlo
Thomas Serre
135
6
0
06 Jun 2024
LADI v2: Multi-label Dataset and Classifiers for Low-Altitude Disaster Imagery
Samuel Scheele
Katherine Picchione
Jeffrey Liu
42
0
0
04 Jun 2024
Scaling Up Deep Clustering Methods Beyond ImageNet-1K
Nikolas Adaloglou
Félix D. P. Michels
Kaspar Senft
Diana Petrusheva
M. Kollmann
107
1
0
03 Jun 2024
From Seedling to Harvest: The GrowingSoy Dataset for Weed Detection in Soy Crops via Instance Segmentation
Raul Steinmetz
V. A. Kich
Henrique Krever
Joao D. Rigo Mazzarolo
Ricardo B. Grando
Vinicius Marini
Celio Trois
Ard Nieuwenhuizen
162
1
0
01 Jun 2024
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Michail Theologitis
Georgios Frangias
Georgios Anestis
V. Samoladas
Antonios Deligiannakis
FedML
103
0
0
31 May 2024
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang
Bin Chen
Bin Kang
Yulin Li
Yichi Chen
Weizhi Xian
Huifeng Chang
VLM
ObjD
84
7
0
28 May 2024
TreeFormers -- An Exploration of Vision Transformers for Deforestation Driver Classification
Uche Ochuba
66
0
0
25 May 2024
Mixture of Experts Meets Prompt-Based Continual Learning
Minh Le
An Nguyen
Huy Nguyen
Trang Nguyen
Trang Pham
L. Ngo
Nhat Ho
CLL
133
14
0
23 May 2024
Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers
Bum Jun Kim
Sang Woo Kim
ViT
61
1
0
23 May 2024
FeTT: Continual Class Incremental Learning via Feature Transformation Tuning
Sunyuan Qiang
Xuxin Lin
Yanyan Liang
Jun Wan
Du Zhang
CLL
96
1
0
20 May 2024
Hierarchical Selective Classification
Shani Goren
Ido Galil
Ran El-Yaniv
BDL
93
2
0
19 May 2024
Enhancing Fine-Grained Image Classifications via Cascaded Vision Language Models
Canshi Wei
VLM
62
0
0
18 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
108
3
0
16 May 2024
ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset
Johannes Ruckert
Louise Bloch
Raphael Brüngel
Ahmad Idrissi-Yaghir
Henning Schafer
...
A. G. S. D. Herrera
Henning Müller
Peter A. Horn
F. Nensa
Christoph M. Friedrich
84
33
0
16 May 2024
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Yulin Wang
Yang Yue
Rui Lu
Yizeng Han
Shiji Song
Gao Huang
VLM
114
12
0
14 May 2024
Energy-based Hopfield Boosting for Out-of-Distribution Detection
Claus Hofmann
Simon Schmid
Bernhard Lehner
Daniel Klotz
Sepp Hochreiter
OODD
103
9
0
14 May 2024
PUMA: margin-based data pruning
Javier Maroto
Pascal Frossard
AAML
79
1
0
10 May 2024
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
Ziqi Gao
Qichao Wang
Aochuan Chen
Zijing Liu
Bingzhe Wu
Liang Chen
Jia Li
103
35
0
05 May 2024
Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Yifei Ming
Yixuan Li
VLM
127
8
0
02 May 2024
Training a high-performance retinal foundation model with half-the-data and 400 times less compute
Justin Engelmann
Miguel O. Bernabeu
MedIm
OOD
123
1
0
30 Apr 2024
OpenStreetView-5M: The Many Roads to Global Visual Geolocation
Guillaume Astruc
Nicolas Dufour
Ioannis Siglidis
Constantin Aronssohn
Nacim Bouia
...
Charles Raude
Elliot Vincent
Lintao Xu
Hongyu Zhou
Loic Landrieu
87
7
0
29 Apr 2024
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
Wenxuan Zhang
Youssef Mohamed
Guohao Li
Philip Torr
Adel Bibi
Mohamed Elhoseiny
CLL
102
5
0
19 Apr 2024
Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology
Oren Z. Kraus
Kian Kenyon-Dean
Saber Saberian
Maryam Fallah
Peter McLean
...
Chi Vicky Cheng
Kristen Morse
Maureen Makes
Ben Mabey
Berton Earnshaw
70
35
0
16 Apr 2024
Utility-Fairness Trade-Offs and How to Find Them
Sepehr Dehdashtian
Bashir Sadeghi
Vishnu Boddeti
80
6
0
15 Apr 2024
Adapting LLaMA Decoder to Vision Transformer
Jiahao Wang
Wenqi Shao
Mengzhao Chen
Chengyue Wu
Yong Liu
Taiqiang Wu
Kaipeng Zhang
Songyang Zhang
Kai-xiang Chen
Ping Luo
MLLM
85
4
0
10 Apr 2024
Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-trained Vision Transformers
Dipam Goswami
Bartlomiej Twardowski
Joost van de Weijer
81
4
0
09 Apr 2024
Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training
Ming-Kun Xie
Jia-Hao Xiao
Pei Peng
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
65
3
0
09 Apr 2024
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Fanjie Kong
Yanbei Chen
Jiarui Cai
Davide Modolo
VLM
ObjD
67
7
0
07 Apr 2024
RaSim: A Range-aware High-fidelity RGB-D Data Simulation Pipeline for Real-world Applications
Xingyu Liu
Chenyangguang Zhang
Gu Wang
Ruida Zhang
Xiangyang Ji
69
1
0
05 Apr 2024
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration
Akshay Dudhane
Omkar Thawakar
Syed Waqas Zamir
Salman Khan
Fahad Shahbaz Khan
Ming-Hsuan Yang
AI4CE
78
7
0
02 Apr 2024
Convolutional Prompting meets Language Models for Continual Learning
Anurag Roy
Riddhiman Moulick
Vinay Kumar Verma
Saptarshi Ghosh
Abir Das
VLM
CLL
LRM
69
18
0
29 Mar 2024
A Two-Phase Recall-and-Select Framework for Fast Model Selection
Jianwei Cui
Wenhang Shi
Honglin Tao
Wei Lu
Xiaoyong Du
93
0
0
28 Mar 2024
Previous
1
2
3
4
5
6
7
8
9
Next