Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.14960
Cited By
MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
21 April 2025
Dennis Liu
Zijie Yan
Xin Yao
Tong Liu
V. Korthikanti
Evan Wu
Shiqing Fan
Gao Deng
Hongxiao Bai
Jianbin Chang
Ashwath Aithal
M. Andersch
M. Shoeybi
Jiajie Yao
Chandler Zhou
David Wu
Xipeng Li
J. Yang
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core"
Title
No papers