Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.10616
Cited By
DiPaCo: Distributed Path Composition
15 March 2024
Arthur Douillard
Qixuang Feng
Andrei A. Rusu
A. Kuncoro
Yani Donchev
Rachita Chhaparia
Ionel Gog
MarcÁurelio Ranzato
Jiajun Shen
Arthur Szlam
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DiPaCo: Distributed Path Composition"
2 / 2 papers shown
Title
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Zachary B. Charles
Gabriel Teston
Lucio Dery
Keith Rush
Nova Fallen
Zachary Garrett
Arthur Szlam
Arthur Douillard
154
0
0
12 Mar 2025
The Future of Large Language Model Pre-training is Federated
Lorenzo Sani
Alexandru Iacob
Zeyu Cao
Bill Marino
Yan Gao
...
Wanru Zhao
William F. Shen
Preslav Aleksandrov
Xinchi Qiu
Nicholas D. Lane
AI4CE
35
12
0
17 May 2024
1