ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.11432
  4. Cited By
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production

16 May 2025
Cheng Jin
Ziheng Jiang
Zhihao Bai
Zheng Zhong
Jing Liu
Xiang Li
Ningxin Zheng
Xi Wang
Cong Xie
Qi Huang
Wen Heng
Yiyuan Ma
Wenlei Bao
Size Zheng
Yanghua Peng
Xuanzhe Liu
Xuanzhe Liu
Xin Jin
Xin Liu
    MoE
ArXivPDFHTML

Papers citing "MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production"

Title
No papers