ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.00476
  4. Cited By
ResNet strikes back: An improved training procedure in timm

ResNet strikes back: An improved training procedure in timm

1 October 2021
Ross Wightman
Hugo Touvron
Hervé Jégou
    AI4TS
ArXiv (abs)PDFHTML

Papers citing "ResNet strikes back: An improved training procedure in timm"

50 / 306 papers shown
Title
Hierarchical Selective Classification
Hierarchical Selective Classification
Shani Goren
Ido Galil
Ran El-Yaniv
BDL
93
2
0
19 May 2024
FFF: Fixing Flawed Foundations in contrastive pre-training results in
  very strong Vision-Language models
FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
Adrian Bulat
Yassine Ouali
Georgios Tzimiropoulos
VLM
104
5
0
16 May 2024
Differentiable Model Scaling using Differentiable Topk
Differentiable Model Scaling using Differentiable Topk
Kai Liu
Ruohui Wang
Jianfei Gao
Kai Chen
MedImVLM
82
2
0
12 May 2024
ToNNO: Tomographic Reconstruction of a Neural Network's Output for
  Weakly Supervised Segmentation of 3D Medical Images
ToNNO: Tomographic Reconstruction of a Neural Network's Output for Weakly Supervised Segmentation of 3D Medical Images
Marius Schmidt-Mengin
Alexis Benichoux
S. Belachew
N. Komodakis
Nikos Paragios
MedIm
75
2
0
19 Apr 2024
GhostNetV3: Exploring the Training Strategies for Compact Models
GhostNetV3: Exploring the Training Strategies for Compact Models
Zhenhua Liu
Zhiwei Hao
Kai Han
Yehui Tang
Yunhe Wang
71
17
0
17 Apr 2024
Post-Hoc Reversal: Are We Selecting Models Prematurely?
Post-Hoc Reversal: Are We Selecting Models Prematurely?
Rishabh Ranjan
Saurabh Garg
Mrigank Raman
Carlos Guestrin
Zachary Chase Lipton
72
0
0
11 Apr 2024
Can Biases in ImageNet Models Explain Generalization?
Can Biases in ImageNet Models Explain Generalization?
Paul Gavrikov
J. Keuper
OODVLM
65
15
0
01 Apr 2024
Efficient Modulation for Vision Networks
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
109
19
0
29 Mar 2024
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Donghyun Kim
Byeongho Heo
Dongyoon Han
85
17
0
28 Mar 2024
FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression
FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression
Alireza Furutanpey
Qiyang Zhang
Philipp Raith
Tobias Pfandzelter
Shangguang Wang
Schahram Dustdar
176
5
0
25 Mar 2024
ParFormer: Vision Transformer Baseline with Parallel Local Global Token
  Mixer and Convolution Attention Patch Embedding
ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding
Novendra Setyawan
Ghufron Wahyu Kurniawan
Chi-Chia Sun
Jun-Wei Hsieh
Hui-Kai Su
W. Kuo
ViTMoE
86
0
0
22 Mar 2024
Rotary Position Embedding for Vision Transformer
Rotary Position Embedding for Vision Transformer
Byeongho Heo
Song Park
Dongyoon Han
Sangdoo Yun
134
51
0
20 Mar 2024
When Do We Not Need Larger Vision Models?
When Do We Not Need Larger Vision Models?
Baifeng Shi
Ziyang Wu
Maolin Mao
Xin Wang
Trevor Darrell
VLMLRM
119
49
0
19 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning
  Researchers
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
77
0
0
14 Mar 2024
On Transfer in Classification: How Well do Subsets of Classes
  Generalize?
On Transfer in Classification: How Well do Subsets of Classes Generalize?
Raphael Baena
Lucas Drumetz
Vincent Gripon
75
0
0
06 Mar 2024
Perceiving Longer Sequences With Bi-Directional Cross-Attention
  Transformers
Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers
Markus Hiller
Krista A. Ehinger
Tom Drummond
110
4
0
19 Feb 2024
How Flawed Is ECE? An Analysis via Logit Smoothing
How Flawed Is ECE? An Analysis via Logit Smoothing
Muthu Chidambaram
Holden Lee
Colin McSwiggen
Semon Rezchikov
UQCV
62
3
0
15 Feb 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture
  Inspiring the Design of Next-generation Neuromorphic Chips
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
Man Yao
Jiakui Hu
Tianxiang Hu
Yifan Xu
Zhaokun Zhou
Yonghong Tian
Boxing Xu
Guoqi Li
98
65
0
15 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
66
6
0
14 Feb 2024
Peeking Behind the Curtains of Residual Learning
Peeking Behind the Curtains of Residual Learning
Tunhou Zhang
Feng Yan
Hai Helen Li
Yiran Chen
21
0
0
13 Feb 2024
Precise Knowledge Transfer via Flow Matching
Precise Knowledge Transfer via Flow Matching
Shitong Shao
Zhiqiang Shen
Linrui Gong
Huanran Chen
Xu Dai
75
2
0
03 Feb 2024
A General Framework for Learning from Weak Supervision
A General Framework for Learning from Weak Supervision
Hao Chen
Jindong Wang
Lei Feng
Xiang Li
Yidong Wang
Xing Xie
Masashi Sugiyama
Rita Singh
Bhiksha Raj
89
4
0
02 Feb 2024
Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI
  Benchmarks
Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks
Stefan Blücher
Johanna Vielhaben
Nils Strodthoff
AAML
131
22
0
12 Jan 2024
FerKD: Surgical Label Adaptation for Efficient Distillation
FerKD: Surgical Label Adaptation for Efficient Distillation
Zhiqiang Shen
73
4
0
29 Dec 2023
Merging Vision Transformers from Different Tasks and Domains
Merging Vision Transformers from Different Tasks and Domains
Peng Ye
Chenyu Huang
Mingzhu Shen
Tao Chen
Yongqi Huang
Yuning Zhang
Wanli Ouyang
MoMe
77
12
0
25 Dec 2023
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty
  from Pre-trained Models
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models
Gianni Franchi
Olivier Laurent
Maxence Leguéry
Andrei Bursuc
Andrea Pilzer
Angela Yao
UQCVBDL
60
6
0
23 Dec 2023
Factorization Vision Transformer: Modeling Long Range Dependency with
  Local Window Cost
Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Haolin Qin
Daquan Zhou
Tingfa Xu
Ziyang Bian
Jianan Li
76
9
0
14 Dec 2023
ELSA: Partial Weight Freezing for Overhead-Free Sparse Network
  Deployment
ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Paniz Halvachi
Alexandra Peste
Dan Alistarh
Christoph H. Lampert
68
0
0
11 Dec 2023
NeuJeans: Private Neural Network Inference with Joint Optimization of Convolution and FHE Bootstrapping
NeuJeans: Private Neural Network Inference with Joint Optimization of Convolution and FHE Bootstrapping
Jae Hyung Ju
Jaiyoung Park
Jongmin Kim
Minsik Kang
Donghwan Kim
Jung Hee Cheon
Jung Ho Ahn
FedML
79
7
0
07 Dec 2023
Simplifying Neural Network Training Under Class Imbalance
Simplifying Neural Network Training Under Class Imbalance
Ravid Shwartz-Ziv
Micah Goldblum
Yucen Lily Li
C. Bayan Bruss
Andrew Gordon Wilson
111
17
0
05 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSLViT
67
3
0
01 Dec 2023
DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering
  Classifier Differences Neuron Visualisations and Visual Counterfactual
  Explanations
DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations
Maximilian Augustin
Yannic Neuhaus
Matthias Hein
DiffM
109
5
0
29 Nov 2023
Tailoring Mixup to Data for Calibration
Tailoring Mixup to Data for Calibration
Quentin Bouniot
Pavlo Mozharovskyi
Florence dÁlché-Buc
157
1
0
02 Nov 2023
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models
  across Computer Vision Tasks
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
Micah Goldblum
Hossein Souri
Renkun Ni
Manli Shu
Viraj Prabhu
...
Adrien Bardes
Judy Hoffman
Ramalingam Chellappa
Andrew Gordon Wilson
Tom Goldstein
VLM
188
68
0
30 Oct 2023
ViR: Towards Efficient Vision Retention Backbones
ViR: Towards Efficient Vision Retention Backbones
Ali Hatamizadeh
Michael Ranzinger
Shiyi Lan
Jose M. Alvarez
Sanja Fidler
Jan Kautz
GNN
31
2
0
30 Oct 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
168
41
0
30 Oct 2023
TorchDEQ: A Library for Deep Equilibrium Models
TorchDEQ: A Library for Deep Equilibrium Models
Zhengyang Geng
J. Zico Kolter
VLM
155
12
0
28 Oct 2023
SpikingJelly: An open-source machine learning infrastructure platform
  for spike-based intelligence
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Wei Fang
Yanqing Chen
Jianhao Ding
Zhaofei Yu
T. Masquelier
Ding Chen
Liwei Huang
Huihui Zhou
Guoqi Li
Yonghong Tian
112
235
0
25 Oct 2023
Gramian Attention Heads are Strong yet Efficient Vision Learners
Gramian Attention Heads are Strong yet Efficient Vision Learners
Jongbin Ryu
Dongyoon Han
J. Lim
102
2
0
25 Oct 2023
PatchCURE: Improving Certifiable Robustness, Model Utility, and
  Computation Efficiency of Adversarial Patch Defenses
PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses
Chong Xiang
Tong Wu
Sihui Dai
Jonathan Petit
Suman Jana
Prateek Mittal
122
6
0
19 Oct 2023
Real-Fake: Effective Training Data Synthesis Through Distribution
  Matching
Real-Fake: Effective Training Data Synthesis Through Distribution Matching
Jianhao Yuan
Jie Zhang
Shuyang Sun
Philip Torr
Bo Zhao
85
27
0
16 Oct 2023
KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training
KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training
Truong Thao Nguyen
Balazs Gerofi
Edgar Josafat Martinez-Noriega
Franccois Trahay
Mohamed Wahib
56
1
0
16 Oct 2023
FedConv: Enhancing Convolutional Neural Networks for Handling Data
  Heterogeneity in Federated Learning
FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning
Peiran Xu
Zeyu Wang
Jieru Mei
Liangqiong Qu
Alan Yuille
Cihang Xie
Yuyin Zhou
FedML
55
1
0
06 Oct 2023
Chunking: Continual Learning is not just about Distribution Shift
Chunking: Continual Learning is not just about Distribution Shift
Thomas L. Lee
Amos Storkey
76
1
0
03 Oct 2023
Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
Atsuyuki Miyai
Qing Yu
Go Irie
Kiyoharu Aizawa
OODD
208
6
0
02 Oct 2023
The Sparsity Roofline: Understanding the Hardware Limits of Sparse
  Neural Networks
The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
Cameron Shinn
Collin McCarthy
Saurav Muralidharan
Muhammad Osama
John Douglas Owens
61
2
0
30 Sep 2023
Understanding and Mitigating the Label Noise in Pre-training on
  Downstream Tasks
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks
Hao Chen
Jindong Wang
Ankit Shah
Ran Tao
Hongxin Wei
Berfin cSimcsek
Masashi Sugiyama
Bhiksha Raj
108
31
0
29 Sep 2023
Weak Supervision for Label Efficient Visual Bug Detection
Weak Supervision for Label Efficient Visual Bug Detection
F. Rahman
73
2
0
20 Sep 2023
Heterogeneous Generative Knowledge Distillation with Masked Image
  Modeling
Heterogeneous Generative Knowledge Distillation with Masked Image Modeling
Ziming Wang
Shumin Han
Xiaodi Wang
Jing Hao
Xianbin Cao
Baochang Zhang
VLM
70
0
0
18 Sep 2023
Introspective Deep Metric Learning
Introspective Deep Metric Learning
Cheng-Hao Wang
Wenzhao Zheng
Zheng Hua Zhu
Jie Zhou
Jiwen Lu
UQCV
80
12
0
11 Sep 2023
Previous
1234567
Next