ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.15052
  4. Cited By
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

23 May 2024
Xianzhi Du
Tom Gunter
Xiang Kong
Mark Lee
Zirui Wang
Aonan Zhang
Nan Du
Ruoming Pang
    MoE
ArXivPDFHTML

Papers citing "Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training"

1 / 1 papers shown
Title
Mixture-of-Experts with Expert Choice Routing
Mixture-of-Experts with Expert Choice Routing
Yan-Quan Zhou
Tao Lei
Han-Chu Liu
Nan Du
Yanping Huang
Vincent Zhao
Andrew M. Dai
Zhifeng Chen
Quoc V. Le
James Laudon
MoE
160
333
0
18 Feb 2022
1