ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.00476
  4. Cited By
ResNet strikes back: An improved training procedure in timm

ResNet strikes back: An improved training procedure in timm

1 October 2021
Ross Wightman
Hugo Touvron
Hervé Jégou
    AI4TS
ArXiv (abs)PDFHTML

Papers citing "ResNet strikes back: An improved training procedure in timm"

50 / 306 papers shown
Title
An Empirical Analysis of the Shift and Scale Parameters in BatchNorm
An Empirical Analysis of the Shift and Scale Parameters in BatchNorm
Y. Peerthum
Mark Stamp
136
6
0
22 Mar 2023
Equiangular Basis Vectors
Equiangular Basis Vectors
Yang Shen
Xuhao Sun
Xiuying Wei
86
7
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMViTCLIP
125
289
0
20 Mar 2023
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness
  with Dataset Reinforcement
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement
Fartash Faghri
Hadi Pouransari
Sachin Mehta
Mehrdad Farajtabar
Ali Farhadi
Mohammad Rastegari
Oncel Tuzel
80
9
0
15 Mar 2023
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Ziheng Qin
Kaidi Wang
Zangwei Zheng
Jianyang Gu
Xiang Peng
...
Daquan Zhou
Lei Shang
Baigui Sun
Xuansong Xie
Yang You
187
53
0
08 Mar 2023
Can We Scale Transformers to Predict Parameters of Diverse ImageNet
  Models?
Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Boris Knyazev
Doha Hwang
Simon Lacoste-Julien
AI4CE
88
21
0
07 Mar 2023
FFT-based Dynamic Token Mixer for Vision
FFT-based Dynamic Token Mixer for Vision
Yuki Tatsunami
Masato Taki
103
23
0
07 Mar 2023
MAST: Masked Augmentation Subspace Training for Generalizable
  Self-Supervised Priors
MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors
Chen Huang
Hanlin Goh
Jiatao Gu
J. Susskind
SSLOOD
188
6
0
07 Mar 2023
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural
  Network
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
Xuan Shen
Yaohua Wang
Ming Lin
Yi-Li Huang
Hao Tang
Xiuyu Sun
Yanzhi Wang
144
34
0
05 Mar 2023
Revisiting Adversarial Training for ImageNet: Architectures, Training
  and Generalization across Threat Models
Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models
Naman D. Singh
Francesco Croce
Matthias Hein
OOD
121
67
0
03 Mar 2023
Image as Set of Points
Image as Set of Points
Xu Ma
Yuqian Zhou
Huan Wang
Can Qin
Bin Sun
Chang Liu
Yun Fu
VLM
82
52
0
02 Mar 2023
Generic-to-Specific Distillation of Masked Autoencoders
Generic-to-Specific Distillation of Masked Autoencoders
Wei Huang
Zhiliang Peng
Li Dong
Furu Wei
Jianbin Jiao
QiXiang Ye
90
23
0
28 Feb 2023
Spatial Bias for Attention-free Non-local Neural Networks
Spatial Bias for Attention-free Non-local Neural Networks
Junhyung Go
Jongbin Ryu
SSL
59
10
0
24 Feb 2023
Learning Visual Representations via Language-Guided Sampling
Learning Visual Representations via Language-Guided Sampling
Mohamed El Banani
Karan Desai
Justin Johnson
SSLVLM
124
28
0
23 Feb 2023
TFormer: A Transmission-Friendly ViT Model for IoT Devices
TFormer: A Transmission-Friendly ViT Model for IoT Devices
Zhichao Lu
Chuntao Ding
Felix Juefei Xu
Vishnu Boddeti
Shangguang Wang
Yun Yang
75
17
0
15 Feb 2023
Convolutional Neural Operators for robust and accurate learning of PDEs
Convolutional Neural Operators for robust and accurate learning of PDEs
Bogdan Raonić
Roberto Molinaro
Tim De Ryck
Tobias Rohner
Francesca Bartolucci
Rima Alaifari
Siddhartha Mishra
Emmanuel de Bezenac
AAML
143
100
0
02 Feb 2023
Multi-dimensional concept discovery (MCD): A unifying framework with
  completeness guarantees
Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees
Johanna Vielhaben
Stefan Blücher
Nils Strodthoff
88
42
0
27 Jan 2023
The Power of Linear Combinations: Learning with Random Convolutions
The Power of Linear Combinations: Learning with Random Convolutions
Paul Gavrikov
J. Keuper
82
2
0
26 Jan 2023
A Simple Adaptive Unfolding Network for Hyperspectral Image
  Reconstruction
A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction
Junyu Wang
Shijie Wang
Wenyu Liu
Zengqiang Zheng
Xinggang Wang
63
3
0
24 Jan 2023
A Simple Recipe for Competitive Low-compute Self supervised Vision
  Models
A Simple Recipe for Competitive Low-compute Self supervised Vision Models
Quentin Duval
Ishan Misra
Nicolas Ballas
70
9
0
23 Jan 2023
Does progress on ImageNet transfer to real-world datasets?
Does progress on ImageNet transfer to real-world datasets?
Alex Fang
Simon Kornblith
Ludwig Schmidt
VLM
87
38
0
11 Jan 2023
Designing BERT for Convolutional Networks: Sparse and Hierarchical
  Masked Modeling
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
Keyu Tian
Yi Jiang
Qishuai Diao
Chen Lin
Liwei Wang
Zehuan Yuan
84
105
0
09 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Rethinking Mobile Block for Efficient Attention-based Models
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
150
102
0
03 Jan 2023
A Close Look at Spatial Modeling: From Attention to Convolution
A Close Look at Spatial Modeling: From Attention to Convolution
Xu Ma
Huan Wang
Can Qin
Kunpeng Li
Xing Zhao
Jie Fu
Yun Fu
ViT3DPC
66
12
0
23 Dec 2022
Revisiting Residual Networks for Adversarial Robustness: An
  Architectural Perspective
Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective
Shihua Huang
Zhichao Lu
Kalyanmoy Deb
Vishnu Boddeti
OOD
102
45
0
21 Dec 2022
RangeAugment: Efficient Online Augmentation with Range Learning
RangeAugment: Efficient Online Augmentation with Range Learning
Sachin Mehta
Saeid Naderiparizi
Fartash Faghri
Maxwell Horton
Lailin Chen
Ali Farhadi
Oncel Tuzel
Mohammad Rastegari
53
6
0
20 Dec 2022
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image
  Classification
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image Classification
Ming-Chang Chiu
Pin-Yu Chen
Xuezhe Ma
83
6
0
16 Dec 2022
From Xception to NEXcepTion: New Design Decisions and Neural
  Architecture Search
From Xception to NEXcepTion: New Design Decisions and Neural Architecture Search
Hadar Shavit
Filip Jatelnicki
Pol Mor-Puigventós
W. Kowalczyk
45
2
0
16 Dec 2022
Fake it till you make it: Learning transferable representations from
  synthetic ImageNet clones
Fake it till you make it: Learning transferable representations from synthetic ImageNet clones
Mert Bulent Sariyildiz
Alahari Karteek
Diane Larlus
Yannis Kalantidis
DiffMVLM
103
161
0
16 Dec 2022
RTMDet: An Empirical Study of Designing Real-Time Object Detectors
RTMDet: An Empirical Study of Designing Real-Time Object Detectors
Chengqi Lyu
Wenwei Zhang
Haian Huang
Yue Zhou
Yudong Wang
Yanyi Liu
Shilong Zhang
Kai-xiang Chen
ObjD
108
407
0
14 Dec 2022
Comparing the Decision-Making Mechanisms by Transformers and CNNs via
  Explanation Methods
Comparing the Decision-Making Mechanisms by Transformers and CNNs via Explanation Methods
Ming-Xiu Jiang
Saeed Khorram
Li Fuxin
FAtt
112
11
0
13 Dec 2022
Error-aware Quantization through Noise Tempering
Error-aware Quantization through Noise Tempering
Zheng Wang
Juncheng Billy Li
Shuhui Qu
Florian Metze
Emma Strubell
MQ
48
2
0
11 Dec 2022
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One
  Amplifies Others
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others
Zhiheng Li
Ivan Evtimov
Albert Gordo
C. Hazirbas
Tal Hassner
Cristian Canton Ferrer
Chenliang Xu
Mark Ibrahim
86
78
0
09 Dec 2022
Co-training $2^L$ Submodels for Visual Recognition
Co-training 2L2^L2L Submodels for Visual Recognition
Hugo Touvron
Matthieu Cord
Maxime Oquab
Piotr Bojanowski
Jakob Verbeek
Hervé Jégou
VLM
72
10
0
09 Dec 2022
Learning Imbalanced Data with Vision Transformers
Learning Imbalanced Data with Vision Transformers
Zhengzhuo Xu
R. Liu
Shuo Yang
Zenghao Chai
Chun Yuan
103
36
0
05 Dec 2022
Towards Improved Input Masking for Convolutional Neural Networks
Towards Improved Input Masking for Convolutional Neural Networks
S. Balasubramanian
Soheil Feizi
AAML
70
4
0
26 Nov 2022
Receptive Field Refinement for Convolutional Neural Networks Reliably
  Improves Predictive Performance
Receptive Field Refinement for Convolutional Neural Networks Reliably Improves Predictive Performance
Mats L. Richter
C. Pal
70
3
0
26 Nov 2022
Learning on tree architectures outperforms a convolutional feedforward
  network
Learning on tree architectures outperforms a convolutional feedforward network
Yuval Meir
Itamar Ben-Noam
Yarden Tzach
Shiri Hodassman
Ido Kanter
AI4CE
43
6
0
21 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
133
473
0
17 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
85
6
0
14 Nov 2022
Demystify Transformers & Convolutions in Modern Image Deep Networks
Demystify Transformers & Convolutions in Modern Image Deep Networks
Jifeng Dai
Min Shi
Weiyun Wang
Sitong Wu
Linjie Xing
...
Lewei Lu
Jie Zhou
Xiaogang Wang
Yu Qiao
Xiao-hua Hu
ViT
83
11
0
10 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
107
65
0
07 Nov 2022
Studying inductive biases in image classification task
Studying inductive biases in image classification task
N. Arizumi
59
1
0
31 Oct 2022
Facial Action Unit Detection and Intensity Estimation from
  Self-supervised Representation
Facial Action Unit Detection and Intensity Estimation from Self-supervised Representation
Bowen Ma
Rudong An
Wei Zhang
Yu-qiong Ding
Zeng Zhao
Rongsheng Zhang
Tangjie Lv
Changjie Fan
Zhipeng Hu
CVBM
103
21
0
28 Oct 2022
The Robustness Limits of SoTA Vision Models to Natural Variation
The Robustness Limits of SoTA Vision Models to Natural Variation
Mark Ibrahim
Q. Garrido
Ari S. Morcos
Diane Bouchacourt
VLM
99
16
0
24 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
99
169
0
24 Oct 2022
Similarity of Neural Architectures using Adversarial Attack
  Transferability
Similarity of Neural Architectures using Adversarial Attack Transferability
Ian Ryu
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
136
2
0
20 Oct 2022
Packed-Ensembles for Efficient Uncertainty Estimation
Packed-Ensembles for Efficient Uncertainty Estimation
Olivier Laurent
Adrien Lafage
Enzo Tartaglione
Geoffrey Daniel
Jean-Marc Martinez
Andrei Bursuc
Gianni Franchi
OODD
145
32
0
17 Oct 2022
CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models
CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models
Denis Kuznedelev
Eldar Kurtic
Elias Frantar
Dan Alistarh
VLMViT
76
13
0
14 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
107
51
0
13 Oct 2022
Previous
1234567
Next