Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.01371
Cited By
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
3 August 2018
Raul Puri
Robert M. Kirby
Nikolai Yakovenko
Bryan Catanzaro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large Scale Language Modeling: Converging on 40GB of Text in Four Hours"
3 / 3 papers shown
Title
Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Tim Tsz-Kit Lau
Weijian Li
Chenwei Xu
Han Liu
Mladen Kolar
185
0
0
30 Dec 2024
Scaling Neural Machine Translation
Myle Ott
Sergey Edunov
David Grangier
Michael Auli
AIMat
44
610
0
01 Jun 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
702
0
26 Feb 2018
1