ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.07159
  4. Cited By
Are we done with ImageNet?

Are we done with ImageNet?

12 June 2020
Lucas Beyer
Olivier J. Hénaff
Alexander Kolesnikov
Xiaohua Zhai
Aaron van den Oord
    VLM
ArXivPDFHTML

Papers citing "Are we done with ImageNet?"

50 / 86 papers shown
Title
TULIP: Towards Unified Language-Image Pretraining
TULIP: Towards Unified Language-Image Pretraining
Zineng Tang
Long Lian
Seun Eisape
Xudong Wang
Roei Herzig
Adam Yala
Alane Suhr
Trevor Darrell
David M. Chan
VLM
CLIP
MLLM
103
3
0
19 Mar 2025
Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions
Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions
Sujan Sai Gannamaneni
Rohil Prakash Rao
Michael Mock
Maram Akila
Stefan Wrobel
AAML
136
0
0
17 Feb 2025
Self-supervised Benchmark Lottery on ImageNet: Do Marginal Improvements Translate to Improvements on Similar Datasets?
Utku Ozbulak
Esla Timothy Anzaku
Solha Kang
W. D. Neve
J. Vankerschaver
50
0
0
28 Jan 2025
Probabilistic Language-Image Pre-Training
Probabilistic Language-Image Pre-Training
Sanghyuk Chun
Wonjae Kim
Song Park
Sangdoo Yun
MLLM
VLM
CLIP
117
4
2
24 Oct 2024
An Embedding is Worth a Thousand Noisy Labels
An Embedding is Worth a Thousand Noisy Labels
Francesco Di Salvo
Sebastian Doerrich
Ines Rieger
Christian Ledig
NoLa
71
0
0
26 Aug 2024
FungiTastic: A multi-modal dataset and benchmark for image categorization
FungiTastic: A multi-modal dataset and benchmark for image categorization
Lukás Picek
Klara Janouskova
Milan Šulc
Jirí Matas
77
1
0
24 Aug 2024
Dataset Distillation in Medical Imaging: A Feasibility Study
Dataset Distillation in Medical Imaging: A Feasibility Study
Muyang Li
Can Cui
Quan Liu
Ruining Deng
Tianyuan Yao
Marilyn Lionts
Yuankai Huo
OOD
DD
51
2
0
19 Jul 2024
LookHere: Vision Transformers with Directed Attention Generalize and
  Extrapolate
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
43
2
0
22 May 2024
Efficient Title Reranker for Fast and Improved Knowledge-Intense NLP
Efficient Title Reranker for Fast and Improved Knowledge-Intense NLP
Ziyi Chen
Jize Jiang
Daqian Zuo
Heyi Tao
Jun Yang
Yuxiang Wei
26
0
0
19 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSL
ViT
37
3
0
01 Dec 2023
Distributionally Robust Classification on a Data Budget
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
29
2
0
07 Aug 2023
Get the Best of Both Worlds: Improving Accuracy and Transferability by
  Grassmann Class Representation
Get the Best of Both Worlds: Improving Accuracy and Transferability by Grassmann Class Representation
Haoqi Wang
Zhizhong Li
Wayne Zhang
15
2
0
03 Aug 2023
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Hiroki Naganuma
Ryuichiro Hataya
Kotaro Yoshida
Ioannis Mitliagkas
OODD
89
1
0
17 Jul 2023
Robust Feature Learning Against Noisy Labels
Robust Feature Learning Against Noisy Labels
Tsung-Ming Tai
Yun-Jie Jhang
Wen-Jyi Hwang
NoLa
18
1
0
10 Jul 2023
A Comprehensive Study on the Robustness of Image Classification and
  Object Detection in Remote Sensing: Surveying and Benchmarking
A Comprehensive Study on the Robustness of Image Classification and Object Detection in Remote Sensing: Surveying and Benchmarking
Shaohui Mei
Jiawei Lian
Xiaofei Wang
Yuru Su
Mingyang Ma
Lap-Pui Chau
AAML
23
11
0
21 Jun 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
46
187
0
29 May 2023
Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD
  Detection, Calibration, and Accuracy
Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy
Stanislav Dereka
I. Karpukhin
Maksim Zhdanov
Sergey Kolesnikov
30
0
0
19 May 2023
Incremental Image Labeling via Iterative Refinement
Incremental Image Labeling via Iterative Refinement
Fausto Giunchiglia
Xiaolei Diao
Mayukh Bagchi
11
1
0
18 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
101
3,017
0
14 Apr 2023
Learning Personalized Decision Support Policies
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
48
10
0
13 Apr 2023
Bridging the Gap between Model Explanations in Partially Annotated
  Multi-label Classification
Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification
Youngwook Kim
Jae Myung Kim
Ji-Eun Jeong
Cordelia Schmid
Zeynep Akata
Jungwook Lee
19
7
0
04 Apr 2023
Sigmoid Loss for Language Image Pre-Training
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
19
935
0
27 Mar 2023
Vision Transformer with Quadrangle Attention
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
19
38
0
27 Mar 2023
A Comprehensive Study on Robustness of Image Classification Models:
  Benchmarking and Rethinking
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking
Chang-Shu Liu
Yinpeng Dong
Wenzhao Xiang
X. Yang
Hang Su
Junyi Zhu
YueFeng Chen
Yuan He
H. Xue
Shibao Zheng
OOD
VLM
AAML
27
72
0
28 Feb 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
Symbolic Discovery of Optimization Algorithms
Symbolic Discovery of Optimization Algorithms
Xiangning Chen
Chen Liang
Da Huang
Esteban Real
Kaiyuan Wang
...
Xuanyi Dong
Thang Luong
Cho-Jui Hsieh
Yifeng Lu
Quoc V. Le
50
350
0
13 Feb 2023
Scaling Vision Transformers to 22 Billion Parameters
Scaling Vision Transformers to 22 Billion Parameters
Mostafa Dehghani
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Jonathan Heek
...
Mario Luvcić
Xiaohua Zhai
Daniel Keysers
Jeremiah Harmsen
N. Houlsby
MLLM
61
569
0
10 Feb 2023
Benchmarking Robustness to Adversarial Image Obfuscations
Benchmarking Robustness to Adversarial Image Obfuscations
Florian Stimberg
Ayan Chakrabarti
Chun-Ta Lu
Hussein Hazimeh
Otilia Stretcu
...
Merve Kaya
Cyrus Rashtchian
Ariel Fuxman
Mehmet Tek
Sven Gowal
AAML
24
10
0
30 Jan 2023
Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object
  Classification
Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object Classification
Ali Borji
VLM
37
0
0
29 Jan 2023
Improving Zero-shot Generalization and Robustness of Multi-modal Models
Improving Zero-shot Generalization and Robustness of Multi-modal Models
Yunhao Ge
Jie Jessie Ren
Andrew Gallagher
Yuxiao Wang
Ming Yang
Hartwig Adam
Laurent Itti
Balaji Lakshminarayanan
Jiaping Zhao
VLM
22
33
0
04 Dec 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
58
673
0
14 Nov 2022
A simple, efficient and scalable contrastive masked autoencoder for
  learning visual representations
A simple, efficient and scalable contrastive masked autoencoder for learning visual representations
Shlok Kumar Mishra
Joshua Robinson
Huiwen Chang
David Jacobs
Aaron Sarna
Aaron Maschinot
Dilip Krishnan
DiffM
43
30
0
30 Oct 2022
Distance Based Image Classification: A solution to generative
  classification's conundrum?
Distance Based Image Classification: A solution to generative classification's conundrum?
Wen-Yan Lin
Siying Liu
B. Dai
Hongdong Li
VLM
35
2
0
04 Oct 2022
A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture
A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture
Brandon Victor
Zhen He
Aiden Nibali
22
9
0
03 Oct 2022
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with
  Latest Weight Averaging
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe
3DH
19
39
0
29 Sep 2022
Bugs in the Data: How ImageNet Misrepresents Biodiversity
Bugs in the Data: How ImageNet Misrepresents Biodiversity
A. Luccioni
David Rolnick
19
43
0
24 Aug 2022
Visual correspondence-based explanations improve AI robustness and
  human-AI team accuracy
Visual correspondence-based explanations improve AI robustness and human-AI team accuracy
Giang Nguyen
Mohammad Reza Taesiri
Anh Totti Nguyen
30
42
0
26 Jul 2022
On Label Granularity and Object Localization
On Label Granularity and Object Localization
Elijah Cole
Kimberly Wilber
Grant Van Horn
Xuan S. Yang
Marco Fornoni
Pietro Perona
Serge J. Belongie
Andrew G. Howard
Oisin Mac Aodha
WSOL
28
13
0
20 Jul 2022
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision
  Transformers
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers
Jihao Liu
B. Liu
Hang Zhou
Hongsheng Li
Yu Liu
ViT
10
66
0
18 Jul 2022
Is one annotation enough? A data-centric image classification benchmark
  for noisy and ambiguous label estimation
Is one annotation enough? A data-centric image classification benchmark for noisy and ambiguous label estimation
Lars Schmarje
Vasco Grossmann
Claudius Zelenka
S. Dippel
R. Kiko
...
M. Pastell
J. Stracke
A. Valros
N. Volkmann
Reinahrd Koch
37
34
0
13 Jul 2022
Eliciting and Learning with Soft Labels from Every Annotator
Eliciting and Learning with Soft Labels from Every Annotator
K. M. Collins
Umang Bhatt
Adrian Weller
11
44
0
02 Jul 2022
Distilling Model Failures as Directions in Latent Space
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
A. Madry
18
89
0
29 Jun 2022
Detecting Adversarial Examples in Batches -- a geometrical approach
Detecting Adversarial Examples in Batches -- a geometrical approach
Danush Kumar Venkatesh
Peter Steinbach
AAML
11
2
0
17 Jun 2022
On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual
  Recognition
On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual Recognition
Yue Song
N. Sebe
Wei Wang
21
33
0
26 May 2022
When does dough become a bagel? Analyzing the remaining mistakes on
  ImageNet
When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
Vijay Vasudevan
Benjamin Caine
Raphael Gontijo-Lopes
Sara Fridovich-Keil
Rebecca Roelofs
VLM
UQCV
33
57
0
09 May 2022
Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household
  Items
Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items
Laura Downs
Anthony G. Francis
Nate Koenig
Brandon Kinman
R. Hickman
Krista Reymann
T. B. McHugh
Vincent Vanhoucke
LM&Ro
27
472
0
25 Apr 2022
VSA: Learning Varied-Size Window Attention in Vision Transformers
VSA: Learning Varied-Size Window Attention in Vision Transformers
Qiming Zhang
Yufei Xu
Jing Zhang
Dacheng Tao
22
53
0
18 Apr 2022
MiniViT: Compressing Vision Transformers with Weight Multiplexing
MiniViT: Compressing Vision Transformers with Weight Multiplexing
Jinnian Zhang
Houwen Peng
Kan Wu
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
ViT
21
123
0
14 Apr 2022
Improving Vision Transformers by Revisiting High-frequency Components
Improving Vision Transformers by Revisiting High-frequency Components
Jiawang Bai
Liuliang Yuan
Shutao Xia
Shuicheng Yan
Zhifeng Li
W. Liu
ViT
8
90
0
03 Apr 2022
Model soups: averaging weights of multiple fine-tuned models improves
  accuracy without increasing inference time
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Mitchell Wortsman
Gabriel Ilharco
S. Gadre
Rebecca Roelofs
Raphael Gontijo-Lopes
...
Hongseok Namkoong
Ali Farhadi
Y. Carmon
Simon Kornblith
Ludwig Schmidt
MoMe
48
909
1
10 Mar 2022
12
Next