
How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Papers citing "How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers"
23 / 23 papers shown
Title |
---|
![]() Large Batch Optimization for Deep Learning: Training BERT in 76 minutes Yang You Jing Li Sashank J. Reddi Jonathan Hseu Sanjiv Kumar Srinadh Bhojanapalli Xiaodan Song J. Demmel Kurt Keutzer Cho-Jui Hsieh |