Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.07533
Cited By
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts
9 June 2025
Wei Tao
Haocheng Lu
Xiaoyang Qu
Bin Zhang
Kai Lu
Jiguang Wan
Jianzong Wang
MQ
MoE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts"
Title
No papers