Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.00071
Cited By
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour
30 October 2020
Arissa Wongpanich
Hieu H. Pham
J. Demmel
Mingxing Tan
Quoc V. Le
Yang You
Sameer Kumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour"
2 / 2 papers shown
Title
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
54
47
0
08 Aug 2021
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
310
2,896
0
15 Sep 2016
1