Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.11432
Cited By
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production
16 May 2025
Cheng Jin
Ziheng Jiang
Zhihao Bai
Zheng Zhong
Jing Liu
Xiang Li
Ningxin Zheng
Xi Wang
Cong Xie
Qi Huang
Wen Heng
Yiyuan Ma
Wenlei Bao
Size Zheng
Yanghua Peng
Xuanzhe Liu
Xuanzhe Liu
Xin Jin
Xin Liu
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production"
Title
No papers