Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.07035
Cited By
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
12 December 2023
Giang Do
Le Khiem
Quang Pham
TrungTin Nguyen
Thanh-Nam Doan
Binh T. Nguyen
Chenghao Liu
Savitha Ramasamy
Xiaoli Li
Steven C. H. Hoi
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (33★)
Papers citing
"HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts"
2 / 2 papers shown
Title
Model Selection for Gaussian-gated Gaussian Mixture of Experts Using Dendrograms of Mixing Measures
Tuan Thai
TrungTin Nguyen
Dat Do
Nhat Ho
Christopher Drovandi
171
0
0
19 May 2025
Layerwise Recurrent Router for Mixture-of-Experts
Zihan Qiu
Zeyu Huang
Shuang Cheng
Yizhi Zhou
Zili Wang
Ivan Titov
Jie Fu
MoE
149
2
0
13 Aug 2024
1