Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.10504
Cited By
ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment
15 March 2024
Xiaofeng Wu
Jia Rao
Wei Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment"
2 / 2 papers shown
Title
Varuna: Scalable, Low-cost Training of Massive Deep Learning Models
Sanjith Athlur
Nitika Saran
Muthian Sivathanu
Ramachandran Ramjee
Nipun Kwatra
GNN
31
80
0
07 Nov 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,821
0
17 Sep 2019
1