Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.06211
Cited By
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
10 September 2024
Jaeseong Lee
seung-won hwang
Aurick Qiao
Daniel F Campos
Z. Yao
Yuxiong He
Re-assign community
ArXiv
PDF
HTML
Papers citing
"STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning"
1 / 1 papers shown
Title
Faster MoE LLM Inference for Extremely Large Models
Haoqi Yang
Luohe Shi
Qiwei Li
Zuchao Li
Ping Wang
Bo Du
Mengjia Shen
Hai Zhao
MoE
63
0
0
06 May 2025
1