Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.06171
Cited By
High-Performance Large-Scale Image Recognition Without Normalization
11 February 2021
Andrew Brock
Soham De
Samuel L. Smith
Karen Simonyan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Performance Large-Scale Image Recognition Without Normalization"
50 / 81 papers shown
Title
Myna: Masking-Based Contrastive Learning of Musical Representations
Ori Yonay
Tracy Hammond
Tianbao Yang
AAML
167
0
0
20 Feb 2025
Do Language Models Understand Time?
Xi Ding
Lei Wang
236
1
0
18 Dec 2024
A Parameter Update Balancing Algorithm for Multi-task Ranking Models in Recommendation Systems
Jun Yuan
Guohao Cai
Zhenhua Dong
148
0
0
08 Oct 2024
Deep Learning Alternatives of the Kolmogorov Superposition Theorem
Leonardo Ferreira Guilhoto
P. Perdikaris
77
7
0
02 Oct 2024
Differentially Private Active Learning: Balancing Effective Data Selection and Privacy
Kristian Schwethelm
Johannes Kaiser
Jonas Kuntzer
Mehmet Yigitsoy
Daniel Rueckert
Georgios Kaissis
90
0
0
01 Oct 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
227
52
0
23 May 2024
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks
Matteo Tucat
Anirbit Mukherjee
Procheta Sen
Mingfei Sun
Omar Rivasplata
MLT
63
1
0
12 Apr 2024
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
317
180
0
17 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Nayeon Lee
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
344
989
0
27 Jan 2021
Characterizing signal propagation to close the performance gap in unnormalized ResNets
Andrew Brock
Soham De
Samuel L. Smith
116
123
0
21 Jan 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
345
6,731
0
23 Dec 2020
ResizeMix: Mixing Data with Preserved Object Information and True Labels
Jie Qin
Jiemin Fang
Qian Zhang
Wenyu Liu
Xingang Wang
Xinggang Wang
62
86
0
21 Dec 2020
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Golnaz Ghiasi
Huayu Chen
A. Srinivas
Rui Qian
Nayeon Lee
E. D. Cubuk
Quoc V. Le
Barret Zoph
ISeg
286
987
0
13 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
530
40,739
0
22 Oct 2020
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
184
1,344
0
03 Oct 2020
Normalization Techniques in Training DNNs: Methodology, Analysis and Application
Lei Huang
Jie Qin
Yi Zhou
Fan Zhu
Li Liu
Ling Shao
AI4CE
102
268
0
27 Sep 2020
The Hardware Lottery
Sara Hooker
58
209
0
14 Sep 2020
On the Generalization Benefit of Noise in Stochastic Gradient Descent
Samuel L. Smith
Erich Elsen
Soham De
MLT
49
99
0
26 Jun 2020
Array Programming with NumPy
Charles R. Harris
K. Millman
S. Walt
R. Gommers
Pauli Virtanen
...
Tyler Reddy
Warren Weckesser
Hameer Abbasi
C. Gohlke
T. Oliphant
131
14,883
0
18 Jun 2020
Designing Network Design Spaces
Ilija Radosavovic
Raj Prateek Kosaraju
Ross B. Girshick
Kaiming He
Piotr Dollár
GNN
96
1,680
0
30 Mar 2020
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
335
667
0
23 Mar 2020
ReZero is All You Need: Fast Convergence at Large Depth
Thomas C. Bachlechner
Bodhisattwa Prasad Majumder
H. H. Mao
G. Cottrell
Julian McAuley
AI4CE
66
279
0
10 Mar 2020
MaxUp: A Simple Way to Improve Generalization of Neural Network Training
Chengyue Gong
Zhaolin Ren
Mao Ye
Qiang Liu
AAML
58
56
0
20 Feb 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
325
18,721
0
13 Feb 2020
On the distance between two neural networks and the stability of learning
Jeremy Bernstein
Arash Vahdat
Yisong Yue
Xuan Li
ODL
227
58
0
09 Feb 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
526
4,773
0
23 Jan 2020
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
165
12,065
0
13 Nov 2019
Self-training with Noisy Student improves ImageNet classification
Qizhe Xie
Minh-Thang Luong
Eduard H. Hovy
Quoc V. Le
NoLa
286
2,387
0
11 Nov 2019
RandAugment: Practical automated data augmentation with a reduced search space
E. D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
MQ
208
3,480
0
30 Sep 2019
Non-discriminative data or weak model? On the relative importance of data and model resolution
Mark Sandler
Jonathan Baccash
A. Zhmoginov
Andrew G. Howard
46
31
0
07 Sep 2019
Order and Chaos: NTK views on DNN Normalization, Checkerboard and Boundary Artifacts
Arthur Jacot
Franck Gabriel
François Ged
Clément Hongler
57
23
0
11 Jul 2019
Fixing the train-test resolution discrepancy
Hugo Touvron
Andrea Vedaldi
Matthijs Douze
Hervé Jégou
113
420
0
14 Jun 2019
Four Things Everyone Should Know to Improve Batch Normalization
Cecilia Summers
M. Dinneen
50
52
0
09 Jun 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
129
18,058
0
28 May 2019
Why gradient clipping accelerates training: A theoretical justification for adaptivity
J.N. Zhang
Tianxing He
S. Sra
Ali Jadbabaie
72
459
0
28 May 2019
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features
Sangdoo Yun
Dongyoon Han
Seong Joon Oh
Sanghyuk Chun
Junsuk Choe
Y. Yoo
OOD
604
4,766
0
13 May 2019
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
Saurabh Singh
Abhinav Shrivastava
43
51
0
12 Apr 2019
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Yang You
Jing Li
Sashank J. Reddi
Jonathan Hseu
Sanjiv Kumar
Srinadh Bhojanapalli
Xiaodan Song
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
ODL
208
993
0
01 Apr 2019
Micro-Batch Training with Batch-Channel Normalization and Weight Standardization
Siyuan Qiao
Huiyu Wang
Chenxi Liu
Wei Shen
Alan Yuille
MQ
83
144
0
25 Mar 2019
A Mean Field Theory of Batch Normalization
Greg Yang
Jeffrey Pennington
Vinay Rao
Jascha Narain Sohl-Dickstein
S. Schoenholz
60
178
0
21 Feb 2019
Fixup Initialization: Residual Learning Without Normalization
Hongyi Zhang
Yann N. Dauphin
Tengyu Ma
ODL
AI4CE
85
349
0
27 Jan 2019
Bag of Tricks for Image Classification with Convolutional Neural Networks
Tong He
Zhi-Li Zhang
Hang Zhang
Zhongyue Zhang
Junyuan Xie
Mu Li
278
1,413
0
04 Dec 2018
Towards Understanding Regularization in Batch Normalization
Ping Luo
Xinjiang Wang
Wenqi Shao
Zhanglin Peng
MLT
AI4CE
53
180
0
04 Sep 2018
Understanding Batch Normalization
Johan Bjorck
Carla P. Gomes
B. Selman
Kilian Q. Weinberger
128
609
0
01 Jun 2018
How Does Batch Normalization Help Optimization?
Shibani Santurkar
Dimitris Tsipras
Andrew Ilyas
Aleksander Madry
ODL
92
1,537
0
29 May 2018
Self-Attention Generative Adversarial Networks
Han Zhang
Ian Goodfellow
Dimitris N. Metaxas
Augustus Odena
GAN
131
3,720
0
21 May 2018
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
Laurens van der Maaten
VLM
176
1,367
0
02 May 2018
Group Normalization
Yuxin Wu
Kaiming He
196
3,644
0
22 Mar 2018
How to Start Training: The Effect of Initialization and Architecture
Boris Hanin
David Rolnick
55
255
0
05 Mar 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
169
19,204
0
13 Jan 2018
1
2
Next